›› 2013,Vol. 28 ›› Issue (6): 1025-1044.doi: 10.1007/s11390-013-1395-4

所属专题: Computer Architecture and Systems

• Special Section on Selected Paper from NPC 2011 • 上一篇    下一篇

一种时序的位置感知页面映射的闪存转换层

Youngjae Kim1, Aayush Gupta2, and Bhuvan Urgaonkar3, Senior Member, ACM, IEEE   

  • 收稿日期:2013-01-03 修回日期:2013-05-16 出版日期:2013-11-05 发布日期:2013-11-05
  • 作者简介:Youngjae Kim is a computer science R&D staff member for the National Center for Computational Sciences at Oak Ridge National Laboratory, USA. He received the Ph.D. degree in computer science and engineering from the Department of Computer Science and Engineering of the Pennsylvania State University, USA, in 2009. His research interests include operating systems, parallel I/O and file systems, and storage systems.

A Temporal Locality-Aware Page-Mapped Flash Translation Layer

Youngjae Kim1, Aayush Gupta2, and Bhuvan Urgaonkar3, Senior Member, ACM, IEEE   

  1. 1 National Center for Computational Sciences, Oak Ridge National Laboratory, Oak Ridge, TN 37831, U.S.A.;
    2 IBM Almaden Research Center, San Jose, CA 95120, U.S.A.;
    3 Department of Computer Science and Engineering, Pennsylvania State University, University Park, PA 16802, U.S.A.
  • Received:2013-01-03 Revised:2013-05-16 Online:2013-11-05 Published:2013-11-05
  • Supported by:

    This research was funded in part by the Natural Science Foundation of U.S. under Grant Nos. CCF-0811670, CNS-0720456, a gift from Cisco System, Inc. and partially through the Office of Science of the U.S. Department of Energy under Contract No. DE-AC05-00OR22725.

在企业规模环境下更好地利用闪存,必须处理随机写性能差的问题。我们分析了性能差的重要原因之一:闪存转换层(FTL)的设计,以实现虚拟到物理的地址转换并隐藏闪存的写前擦除特点。我们提出了一种完整的核心FTL引擎设计的模式转换,提出了一种基于需求的闪存转换层(DFTL)。DFTL能选择性地缓存页面层地址匹配。基于一种闪存仿真框架FlashSim,我们在实际企业规模工作负载下的实验评价证明了DFTL在企业规模存储系统中的可用性:改进了性能;减少了垃圾收集工作量;相对目前最流行的实现方式:混合FTL模式,提供了更好的超负载行为。例如:从一个在大规模金融所的OLTP应用的显著地随机写主导的I/O轨迹,显示了平均反应时间相对混合FTL模式的78%的改进(源于垃圾收集器操作的3折缩减)。即便对于著名的读主导的TPC-H基准,DFTL虽然引入了额外的负担, 仍然改进了56%的反应时间。有趣的是,当基于DFTL的SSD上的写回缓存被使用,DFTL甚至优于基于页面的FTL模式,在金融轨迹中改进了72%的反应时间。

Abstract: The poor performance of random writes has been a cause of major concern which needs to be addressed to better utilize the potential of flash in enterprise-scale environments. We examine one of the important causes of this poor performance: the design of the flash translation layer (FTL) which performs the virtual-to-physical address translations and hides the erase-before-write characteristics of flash. We propose a complete paradigm shift in the design of the core FTL engine from the existing techniques with our Demand-Based Flash Translation Layer (DFTL) which selectively caches pagelevel address mappings. Our experimental evaluation using FlashSim with realistic enterprise-scale workloads endorses the utility of DFTL in enterprise-scale storage systems by demonstrating: 1) improved performance, 2) reduced garbage collection overhead and 3) better overload behavior compared with hybrid FTL schemes which are the most popular implementation methods. For example, a predominantly random-write dominant I/O trace from an OLTP application running at a large financial institution shows a 78% improvement in average response time (due to a 3-fold reduction in operations of the garbage collector), compared with the hybrid FTL scheme. Even for the well-known read-dominant TPC-H benchmark, for which DFTL introduces additional overheads, we improve system response time by 56%. Moreover, interestingly, when write-back cache on DFTL-based SSD is enabled, DFTL even outperforms the page-based FTL scheme, improving their response time by 72% in Financial trace.

[1] Gurumurthi S, Sivasubramaniam A, Natarajan V. Disk drive roadmap from the thermal perspective: A case for dynamic thermal management. In Proc. the 32nd International Symposium on Computer Architecture, June 2005, pp.38-49.

[2] Kim Y, Gurumurthi S, Sivasubramaniam A. Understanding the performance-temperature interactions in disk I/O of server workloads. In Proc. the 12th International Symposium on High-Performance Computer Architecture (HPCA), February 2006, pp.176-186.

[3] Mallary M, Torabi A, Benakli M. One terabit per square inch perpendicular recording conceptual design. IEEE Transactions on Magnetics, 2002, 38(4): 1719-1724.

[4] Chen J, Moon J. Detection signal-to-noise ratio versus bit cell aspect ratio at high areal densities. IEEE Transactions on Magnetics, 2001, 37(3): 1157-1167.

[5] Gulati A, Merchant A, Varman P J. pClock: An arrival curve based approach for QoS guarantees in shared storage systems. In Proc. the ACM SIGMETRICS Conference on Measurement and Modeling of Computer Systems, June 2007, pp.1324.

[6] Tehrani S, Slaughter J M, Chen E, Durlam M, Shi J, DeHerren M. Progress and outlook for MRAM technology. IEEE Transactions on Magnetics, 1999, 35(5): 2814-2819.

[7] Shimada Y. FeRAM: Next generation challenges and future directions. In Proc. the IEEE International Symposium on Applications of Ferroelectric, May 2007.

[8] Leventhal A. Flash storage memory. Communications of the ACM, 2008, 51(7): 47-51.

[9] Lee S, Moon B. Design of flash-based DBMS: An in-page logging approach. In Proc. the International Conference on Management of Data (SIGMOD), August 2007, pp.55-66.

[10] Kim H, Ahn S. BPLRU: A buffer management scheme for improving random writes in flash storage. In Proc. the 6th USENIX Conference on File and Storage Technologies (FAST), February 2008, pp.239-252.

[11] Small-block vs. large-block NAND flash devices. Technical Report, TN-29-07, Micron. http://www.micron.com/products/nand/technotes, Jan. 2013.

[12] Lee S, Park D, Chung T, Lee D, Park S, Song H. A log buffer based flash translation layer using fully associative sector translation. ACM Transactions on Embedded Computing Systems, 2007, 6(3): Article No.18.

[13] Kim Y, Taurus B, Gupta A, Urgaonkar B. FlashSim: A Simulator for NAND flash-based solid-state drives. In Proc. the International Conference on Advances in System Simulation (SIMUL), September 2009, pp.125-131.

[14] Gupta A, Kim Y, Urgaonkar B. DFTL: A flash translation layer employing demand-based selective caching of page-level address mappings. In Proc. the 14th International Conference on Architectural Support for Programming Languages and Operating System (ASPLOS), March 2009, pp.229-240.

[15] Hennessy J, Patterson D. Computer Architecture: A Quantitative Approach. San Francisco, USA: Morgan Kaufmann Publishers Inc., 2006.

[16] Kim J, Kim J M, Noh S H, Min S, Cho Y. A space-efficient flash translation layer for compactflash systems. IEEE Transactions on Consumer Electronics, 2002, 48(2): 366-375.

[17] Chung T, Park D, Park S, Lee D, Lee S, Song H. System software for flash memory: A survey. In Proc. the International Conference on Embedded and Ubiquitous Computing, August 2006, pp.394-404.

[18] Kang J, Jo H, Kim J, Lee J. A superblock-based flash translation layer for NAND flash memory. In Proc. the 6th International Conference on Embedded Software (EMSOFT), October 2006, pp.161-170.

[19] Lee S, Shin D, Kim Y, Kim J. LAST: Locality-aware sector translation for NAND flash memory-based storage systems. ACM SIGOPS Operating Systems Review, 2008, 42(6): 3642.

[20] Karedla R, Love J S, Wherry B G. Caching strategies to improve disk system performance. IEEE Transactions on Computer (TC), 1994, 27(3): 38-46.

[21] Kawaguchi A, Nishioka S, Motoda H. A flash-memory based file system. In Proc. the Winter 1995 USENIX Technical Conference, Jan. 1995, pp.155-164.

[22] Bucy J S, Ganger G R. The DiskSim simulation environment version 3.0 reference manual. CMU, January 2003.

[23] Ban A. Flash file system. United States Patent 5404485, April 4, 1995.

[24] Zhang J, Sivasubramaniam A, Franke H, Gautam N, Zhang Y, Nagar S. Synthesizing representative I/O workloads for TPC-H. In Proc. the 10th International Symposium on High Performance Computer Architecture (HPCA), Feb. 2004, pp.142-151.

[25] Park D, Debnath B, Du D H C. A workload-aware adaptive hybrid flash translation layer with an efficient caching strategy. In Proc. the 19th IEEE Annual International Symposium on Modelling, Analysis, and Simulation of Computer and Telecommunication Systems, July 2011, pp.248-255.

[26] Budilovsky E, Toledo S, Zuck A. Prototyping a highperformance low-cost solid-state disk. In Proc. the 4th Annual International Conference on Systems and Storage, May 30-June 1, 2011, Article No. 13.

[27] Choudhuri S, Givargis T. Performance improvement of block based NAND flash translation layer. In Proc. the 15th IEEE/ACM International Conference on Hardware/Software Codesign and System Synthesis (CODES+ISSS), September 30-October 3, 2007, pp.257-262.

[28] Wu C H, Kuo T W, Chang L P. An efficient B-tree layer implementation for flash-memory storage systems. ACM Trans. Embed. Comput. Syst., 2007, 6(3): Article No. 19.

[29] Kwon H, Kim E, Choi J, Lee D, Noh S H. Janus-FTL: Finding the optimal point on the spectrum between page and block mapping schemes. In Proc. the 10th ACM International Conference on Embedded Software, Oct. 2010, pp.169-178.
No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
[1] 张钹; 张铃;. Statistical Heuristic Search[J]. , 1987, 2(1): 1 -11 .
[2] 孟力明; 徐晓飞; 常会友; 陈光熙; 胡铭曾; 李生;. A Tree-Structured Database Machine for Large Relational Database Systems[J]. , 1987, 2(4): 265 -275 .
[3] 林琦; 夏培肃;. The Design and Implementation of a Very Fast Experimental Pipelining Computer[J]. , 1988, 3(1): 1 -6 .
[4] 孙成政; 慈云桂;. A New Method for Describing the AND-OR-Parallel Execution of Logic Programs[J]. , 1988, 3(2): 102 -112 .
[5] 张焰; 何继潮;. Data Dependencies in Database with Incomplete Information[J]. , 1988, 3(2): 131 -138 .
[6] 张福炎; 蔡士杰; 王曙; 葛如顶;. The Human-Computer Dialogue Management of FCAD System[J]. , 1988, 3(3): 221 -227 .
[7] 张钹; 张恬; 张建伟; 张铃;. Motion Planning for Robots with Topological Dimension Reduction Method[J]. , 1990, 5(1): 1 -16 .
[8] 王鼎兴; 郑纬民; 杜晓黎; 郭毅可;. On the Execution Mechanisms of Parallel Graph Reduction[J]. , 1990, 5(4): 333 -346 .
[9] 蔡士杰; 张福炎;. A Fast Algorithm for Polygon Operations[J]. , 1991, 6(1): 91 -96 .
[10] 周权; 魏道政;. A Complete Critical Path Algorithm for Test Generation of Combinational Circuits[J]. , 1991, 6(1): 74 -82 .
版权所有 © 《计算机科学技术学报》编辑部
本系统由北京玛格泰克科技发展有限公司设计开发 技术支持:support@magtech.com.cn
总访问量: