Journal of Computer Science and Technology ›› 2019, Vol. 34 ›› Issue (4): 762-774.doi: 10.1007/s11390-019-1941-9

Special Issue: Data Management and Data Mining

• Special Section on Spatio-Temporal Big Data Analytics • Previous Articles     Next Articles

A United Framework for Large-Scale Resource Description Framework Stream Processing

Hong Fang1, Bo Zhao2,3, Xiao-Wang Zhang2,3,*, Member, CCF, Xuan-Xing Yang2,3   

  1. 1 College of Arts and Sciences, Shanghai Polytechnic University, Shanghai 201209, China;
    2 College of Intelligence and Computing, Tianjin University, Tianjin 300350, China;
    3 Tianjin Key Laboratory of Cognitive Computing and Application, Tianjin 300350, China
  • Received:2019-01-15 Revised:2019-05-09 Online:2019-07-11 Published:2019-07-11
  • Contact: Xiao-Wang Zhang
  • Supported by:
    This paper is supported by the National Key Research and Development Program of China under Grant No. 2017YFC0908401, and the National Natural Science Foundation of China under Grant No. 61672377. Xiao-Wang Zhang is supported by the program of Peiyang Young Scholars of China under Grant No. 2019XRX-0032.

Resource description framework (RDF) stream is useful to model spatio-temporal data. In this paper, we propose a framework for large-scale RDF stream processing, LRSP, to process general continuous queries over large-scale RDF streams. Firstly, we propose a formalization (named CT-SPARQL) to represent the general continuous queries in a unified, unambiguous way. Secondly, based on our formalization we propose LRSP to process continuous queries in a common white-box way by separating RDF stream processing, query parsing, and query execution. Finally, we implement and evaluate LRSP with those popular continuous query engines on some benchmark datasets and real-world datasets. Due to the architecture of LRSP, many efficient query engines (including centralized and distributed engines) for RDF can be directly employed to process continuous queries. The experimental results show that LRSP has a higher performance, specially, in processing large-scale real-world data.

Key words: resource description framework(RDF)stream; continuous query; united framework; stream processing; largescale RDF stream processing(LRSP);

