? 一种提升数据去重备份储存读取性能的预读缓存器
Journal of Computer Science and Technology
Quick Search in JCST
 Advanced Search 
      Home | PrePrint | SiteMap | Contact Us | Help
 
Indexed by   SCIE, EI ...
Bimonthly    Since 1986
Journal of Computer Science and Technology 2017, Vol. 32 Issue (1) :26-40    DOI: 10.1007/s11390-017-1680-8
Computer Architecture and Systems << Previous Articles | Next Articles >>
一种提升数据去重备份储存读取性能的预读缓存器
Dongchul Park, Ziqi Fan, Young Jin Nam, and David H. C. Du, Fellow, IEEE
Department of Computer Science and Engineering, University of Minnesota-Twin Cities, Minneapolis, MN 55455, U.S.A
A Lookahead Read Cache: Improving Read Performance for Deduplication Backup Storage
Dongchul Park, Ziqi Fan, Young Jin Nam, and David H. C. Du, Fellow, IEEE
Department of Computer Science and Engineering, University of Minnesota-Twin Cities, Minneapolis, MN 55455, U.S.A

摘要
参考文献
相关文章
Download: [PDF 843KB]  
摘要 数据去重(简称“dedupe”)是一种特殊数据压缩技术。迄今为止,尤其在备份存储系统中,它被广泛应用以节省备份时间和存储空间。因此,大多数去重研究致力于去重写性能。然而,对存储恢复而言,提升备份存储去重读性能也同样是一个至关重要的问题。本文为备份应用设计了一种新的去重存储读取缓存器,以通过其特殊特性,即,读序列和写序列一致,来提升去重读取性能。因此,为了更好地利用缓冲,通过预测某一移动窗口里的未来引用,将最不可能被访问的部分从缓存中移除。此外,为了进一步提升缓存的读性能,它通过维护一个小的日志缓存区以谨慎合理地缓存未来访问数据块。基于现实备份工作负荷的大量实验表明,本文提出的读取缓存方案能将读性能提升到64.3%。
关键词数据去重   dedupe   读取缓存   备份     
Abstract: Data deduplication (dedupe for short) is a special data compression technique. It has been widely adopted to save backup time as well as storage space, particularly in backup storage systems. Therefore, most dedupe research has primarily focused on improving dedupe write performance. However, backup storage dedupe read performance is also a crucial problem for storage recovery. This paper designs a new dedupe storage read cache for backup applications that improves read performance by exploiting a special characteristic:the read sequence is the same as the write sequence. Consequently, for better cache utilization, by looking ahead for future references within a moving window, it evicts victims from the cache having the smallest future access. Moreover, to further improve read cache performance, it maintains a small log buffer to judiciously cache future access data chunks. Extensive experiments with real-world backup workloads demonstrate that the proposed read cache scheme improves read performance by up to 64.3%
Keywordsdeduplication   dedupe   read cache   backup     
Received 2016-03-11;
本文基金:

This work is partially supported by the National Science Foundation Awards of USA under Grant Nos. 121756, 1305237, 142191 and 1439622.

About author: Dongchul Park is currently a research scientist in Memory Solutions Laboratory (MSL) at Samsung Semiconductor Inc. in San Jose, California. He received his Ph.D. degree in computer science and engineering at the University of Minnesota-Twin Cities, Minneapolis, in 2012, and was a member of Center for Research in Intelligent Storage (CRIS) group under the advice of Professor David H. C. Du. His research interests focus on storage system design and applications including non-volatile memories, in-storage computing, big data processing, Hadoop MapReduce, data deduplication, key-value store, cloud computing, and shingled magnetic recording (SMR) technology.
引用本文:   
Dongchul Park, Ziqi Fan, Young Jin Nam, David H. C. Du.一种提升数据去重备份储存读取性能的预读缓存器[J]  Journal of Computer Science and Technology , 2017,V32(1): 26-40
Dongchul Park, Ziqi Fan, Young Jin Nam, David H. C. Du.A Lookahead Read Cache: Improving Read Performance for Deduplication Backup Storage[J]  Journal of Computer Science and Technology, 2017,V32(1): 26-40
链接本文:  
http://jcst.ict.ac.cn:8080/jcst/CN/10.1007/s11390-017-1680-8
Copyright 2010 by Journal of Computer Science and Technology