? A Lookahead Read Cache: Improving Read Performance for Deduplication Backup Storage
Journal of Computer Science and Technology
Quick Search in JCST
 Advanced Search 
      Home | PrePrint | SiteMap | Contact Us | FAQ
 
Indexed by   SCIE, EI ...
Bimonthly    Since 1986
Journal of Computer Science and Technology 2017, Vol. 32 Issue (1) :26-40    DOI: 10.1007/s11390-017-1680-8
Computer Architecture and Systems Current Issue | Archive | Adv Search << Previous Articles | Next Articles >>
A Lookahead Read Cache: Improving Read Performance for Deduplication Backup Storage
Dongchul Park, Ziqi Fan, Young Jin Nam, and David H. C. Du, Fellow, IEEE
Department of Computer Science and Engineering, University of Minnesota-Twin Cities, Minneapolis, MN 55455, U.S.A

Abstract
Reference
Related Articles
Download: [PDF 843KB]     Export: BibTeX or EndNote (RIS)  
Abstract Data deduplication (dedupe for short) is a special data compression technique. It has been widely adopted to save backup time as well as storage space, particularly in backup storage systems. Therefore, most dedupe research has primarily focused on improving dedupe write performance. However, backup storage dedupe read performance is also a crucial problem for storage recovery. This paper designs a new dedupe storage read cache for backup applications that improves read performance by exploiting a special characteristic:the read sequence is the same as the write sequence. Consequently, for better cache utilization, by looking ahead for future references within a moving window, it evicts victims from the cache having the smallest future access. Moreover, to further improve read cache performance, it maintains a small log buffer to judiciously cache future access data chunks. Extensive experiments with real-world backup workloads demonstrate that the proposed read cache scheme improves read performance by up to 64.3%
Articles by authors
Dongchul Park
Ziqi Fan
Young Jin Nam
David H. C. Du
Keywordsdeduplication   dedupe   read cache   backup     
Received 2016-03-11;
Fund:

This work is partially supported by the National Science Foundation Awards of USA under Grant Nos. 121756, 1305237, 142191 and 1439622.

About author: Dongchul Park is currently a research scientist in Memory Solutions Laboratory (MSL) at Samsung Semiconductor Inc. in San Jose, California. He received his Ph.D. degree in computer science and engineering at the University of Minnesota-Twin Cities, Minneapolis, in 2012, and was a member of Center for Research in Intelligent Storage (CRIS) group under the advice of Professor David H. C. Du. His research interests focus on storage system design and applications including non-volatile memories, in-storage computing, big data processing, Hadoop MapReduce, data deduplication, key-value store, cloud computing, and shingled magnetic recording (SMR) technology.
Cite this article:   
Dongchul Park, Ziqi Fan, Young Jin Nam, David H. C. Du.A Lookahead Read Cache: Improving Read Performance for Deduplication Backup Storage[J]  Journal of Computer Science and Technology, 2017,V32(1): 26-40
URL:  
http://jcst.ict.ac.cn:8080/jcst/EN/10.1007/s11390-017-1680-8
Copyright 2010 by Journal of Computer Science and Technology