? 一种用于提高数据去重系统选择性恢复性能的高耐久性固态硬盘读缓存
Journal of Computer Science and Technology
Quick Search in JCST
 Advanced Search 
      Home | PrePrint | SiteMap | Contact Us | Help
 
Indexed by   SCIE, EI ...
Bimonthly    Since 1986
Journal of Computer Science and Technology 2018, Vol. 33 Issue (1) :58-78    DOI: 10.1007/s11390-018-1808-5
Computer Architecture and Systems << Previous Articles | Next Articles >>
一种用于提高数据去重系统选择性恢复性能的高耐久性固态硬盘读缓存
Jian Liu1,2,3, Yun-Peng Chai2,3,*, Member, CCF, Xiao Qin4, Senior Member, IEEE, Yao-Hong Liu2,3
1 Division of Computer Science and Engineering, Louisiana State University, Baton Rouge, LA 70803, U.S.A;
2 Key Laboratory of Data Engineering and Knowledge Engineering, Ministry of Education of China, Beijing 100872, China;
3 School of Information, Renmin University of China, Beijing 100872, China;
4 Shelby Center for Engineering Technology, Department of Computer Science and Software Engineering Samuel Ginn College of Engineering, Auburn University, Auburn, AL 36849-5347, U.S.A
Endurable SSD-Based Read Cache for Improving the Performance of Selective Restore from Deduplication Systems
Jian Liu1,2,3, Yun-Peng Chai2,3,*, Member, CCF, Xiao Qin4, Senior Member, IEEE, Yao-Hong Liu2,3
1 Division of Computer Science and Engineering, Louisiana State University, Baton Rouge, LA 70803, U.S.A;
2 Key Laboratory of Data Engineering and Knowledge Engineering, Ministry of Education of China, Beijing 100872, China;
3 School of Information, Renmin University of China, Beijing 100872, China;
4 Shelby Center for Engineering Technology, Department of Computer Science and Software Engineering Samuel Ginn College of Engineering, Auburn University, Auburn, AL 36849-5347, U.S.A

摘要
参考文献
相关文章
Download: [PDF 1427KB]  
摘要 数据去重技术已经被广泛应用于企业级存储和云存储系统。为了应对数据去重系统的选择性恢复操作所面临的性能挑战,我们可以用基于固态硬盘的读缓存来动态地缓存那些非常热的数据来加快恢复速度。但是经典缓存算法(例如LRU和LFU)需要频繁的进行缓存数据的更新,这样会极大地缩短固态硬盘的使用寿命,同时也会降低其I/O处理速度。为了解决这个问题,我们提出了一种新的方法LOP-Cache来增加长期热门(LOP)数据在所有写入固态硬盘的缓存数据中的比例,因此能显著提高固态硬盘的耐久性,并提高I/O性能。LOP-Cache能够让LOP数据在固态硬盘缓存中停留较长时间从而减少缓存替换的次数。另外,它也能够阻止那些冷的或者不必要的数据进入固态硬盘缓存。我们在一个真实的数据去重原型系统中实现了LOP-Cache算法,从而评估它的性能。我们的实验结果表明,LOP-Cache能够利用一个较小的固态硬盘缓存(其空间为去重后数据的5.56%)将选择性恢复操作的平均延迟降低了37.3%。重要的是,LOP-Cache算法还能够将固态硬盘的寿命延长到原来的9.77倍。这些证据显示LOP-Cache算法能够为数据去重系统提供一个高性价比的固态硬盘读缓存来提高选择性恢复操作的性能。
关键词数据去重   固态硬盘   闪存   缓存   持久性     
Abstract: Deduplication has been commonly used in both enterprise storage systems and cloud storage. To overcome the performance challenge for the selective restore operations of deduplication systems, solid-state-drive-based (i.e., SSD-based) read cache can be deployed for speeding up by caching popular restore contents dynamically. Unfortunately, frequent data updates induced by classical cache schemes (e.g., LRU and LFU) significantly shorten SSDs' lifetime while slowing down I/O processes in SSDs. To address this problem, we propose a new solution-LOP-Cache-to greatly improve the write durability of SSDs as well as I/O performance by enlarging the proportion of long-term popular (LOP) data among data written into SSD-based cache. LOP-Cache keeps LOP data in the SSD cache for a long time period to decrease the number of cache replacements. Furthermore, it prevents unpopular or unnecessary data in deduplication containers from being written into the SSD cache. We implemented LOP-Cache in a prototype deduplication system to evaluate its performance. Our experimental results indicate that LOP-Cache shortens the latency of selective restore by an average of 37.3% at the cost of a small SSD-based cache with only 5.56% capacity of the deduplicated data. Importantly, LOP-Cache improves SSDs' lifetime by a factor of 9.77. The evidence shows that LOP-Cache offers a cost-efficient SSD-based read cache solution to boost performance of selective restore for deduplication systems.
Keywordsdata deduplication   solid state drive (SSD)   flash   cache   endurance     
Received 2016-12-06;
本文基金:

This work is supported by the Natural Science Foundation of Beijing under Grant No. 4172031, the Fundamental Research Funds for the Central Universities of China, and the Research Funds of Renmin University of China under Grant No. 16XNLQ02. Xiao Qin's work is supported by the U.S. National Science Foundation under Grant Nos. ⅡS-1618669, CCF-0845257 (CAREER), CNS-0917137, CNS-0757778, CCF-0742187, CNS-0831502, CNS-0855251, and OCI-0753305. Xiao Qin's study is also supported by the Programme of Introducing Talents of Discipline to Universities (111 Project) in China under Grant No. B07038.

通讯作者: Yun-Peng Chai     Email: ypchai@ruc.edu.cn
About author: Jian Liu received his B.E. degree in electronic information engineering from China Agricultural University, Beijing, in 2012, and his M.E. degree in computer science and technology from National Computer System Engineering Research Institute of China, Beijing, in 2015. He is currently a Ph.D. student majored in computer science at Louisiana State University, Baton Rouge. His research interests include data deduplication, SSD-based storage systems, etc.
引用本文:   
Jian Liu, Yun-Peng Chai, Xiao Qin, Yao-Hong Liu.一种用于提高数据去重系统选择性恢复性能的高耐久性固态硬盘读缓存[J]  Journal of Computer Science and Technology , 2018,V33(1): 58-78
Jian Liu, Yun-Peng Chai, Xiao Qin, Yao-Hong Liu.Endurable SSD-Based Read Cache for Improving the Performance of Selective Restore from Deduplication Systems[J]  Journal of Computer Science and Technology, 2018,V33(1): 58-78
链接本文:  
http://jcst.ict.ac.cn:8080/jcst/CN/10.1007/s11390-018-1808-5
Copyright 2010 by Journal of Computer Science and Technology