Ming-Dong Zhu, De-Rong Shen, Kou Yue, Tie-Zheng Nie, Ge Yu. A Framework for Supporting Tree-Like Indexes on the Chord Overlay[J]. Journal of Computer Science and Technology, 2013, 28(6): 962-972. DOI: 10.1007/s11390-013-1391-8
Citation:
Ming-Dong Zhu, De-Rong Shen, Kou Yue, Tie-Zheng Nie, Ge Yu. A Framework for Supporting Tree-Like Indexes on the Chord Overlay[J]. Journal of Computer Science and Technology, 2013, 28(6): 962-972. DOI: 10.1007/s11390-013-1391-8
Ming-Dong Zhu, De-Rong Shen, Kou Yue, Tie-Zheng Nie, Ge Yu. A Framework for Supporting Tree-Like Indexes on the Chord Overlay[J]. Journal of Computer Science and Technology, 2013, 28(6): 962-972. DOI: 10.1007/s11390-013-1391-8
Citation:
Ming-Dong Zhu, De-Rong Shen, Kou Yue, Tie-Zheng Nie, Ge Yu. A Framework for Supporting Tree-Like Indexes on the Chord Overlay[J]. Journal of Computer Science and Technology, 2013, 28(6): 962-972. DOI: 10.1007/s11390-013-1391-8
College of Information Science and Engineering, Northeastern University, Shenyang 110004, China
Funds: This research was supported by the National Basic Research 973 Program of China under Grant No. 2012CB316201, the National Natural Science Foundation of China under Grant Nos. 60973021, 61033007, 61003060, and the Fundamental Research Funds for the Central Universities of China under Grant No. N100704001.
With the explosive growth of data, to support efficient data management including queries and updates, the database system is expected to provide tree-like indexes, such as R-tree, M-tree, B+-tree, according to different types of data. In the distributed environment, the indexes have to be scattered across the compute nodes to improve reliability and scalability. Indexes can speed up queries, but they incur maintenance cost when updates occur. In the distributed environment, each compute node maintains a subset of an index tree, so keeping the communication cost small is more crucial, or else it occupies lots of network bandwidth and the scalability and availability of the database system are affected. Further, to achieve the reliability and scalability for queries, several replicas of the index are needed, but keeping the replicas consistent is not straightforward. In this paper, we propose a framework supporting tree-like indexes, based on Chord overlay, which is a popular P2P structure. The framework dynamically tunes the number of replicas of index to balance the query cost and the update cost. Several techniques are designed to improve the efficiency of updates without the cost of performance of the queries. We implement M-tree and R-tree in our framework, and extensive experiments on reallife and synthetic datasets verify the efficiency and scalability of our framework.