We use cookies to improve your experience with our site.

Gfarm/BB—节点本地突发缓冲(Burst Buffer)的Gfarm文件系统

Gfarm/BB—Gfarm File System for Node-Local Burst Buffer

  • 摘要: 突发缓冲(Burst Buffer)已经成为实现高性能计算(HPC)突发流量I/O性能要求的重要方式之一。本文提出了Gfarm/BB,它是用于突发缓冲的一种文件系统,可以有效利用节点本地储存系统。虽然节点本地储存改善了储存性能,它们只在任务分配时可用。Gfarm/BB应该拥有更好的访问和元数据性能,并且应在任务执行之前按需构建。它利用文件描述符传递和远程直接内存访问(RDMA)提升读写性能。因为它是一个临时的文件系统,所以通过省略持续性和冗余提升了元数据性能。通过使用RDMA,与IP over InfiniBand(IPoIB)相比,写和读的带宽分别提升了1.7倍和2.2倍。在目录创建性能方面,它达到了每秒1.47万次操作,这比完全持续和冗余情况快14.4倍。Gfarm/BB的构建花了0.31秒,使用了2个节点。通过使用节点本地储存,IOR基准和ARGOT-IO应用I/O基准显示了可扩展的性能的提升。基于IOR写和读基准,Gfarm/BB的性能分别是BeeOND的2.6倍和2.4倍;基于ARGOT-IO基准,性能是其2.5倍。

     

    Abstract: Burst buffer has become a major component to meet the I/O performance requirement of HPC bursty traffic. This paper proposes Gfarm/BB that is a file system for a burst buffer efficiently exploiting node-local storage systems. Although node-local storages improve storage performance, they are only available during the job allocation. Gfarm/BB should have better access and metadata performance while it should be constructed on-demand before the job execution. To improve the read and write performance, it exploits the file descriptor passing and remote direct memory access (RDMA). It improves the metadata performance by omitting the persistency and the redundancy since it is a temporal file system. Using RDMA, writes and reads bandwidth are improved by 1.7x and 2.2x compared with IP over InfiniBand (IPoIB), respectively. It achieves 14 700 operations per second in the directory creation performance, which is 13.4x faster than the fully persistent and redundant case. The construction of Gfarm/BB takes 0.31 seconds using 2 nodes. IOR benchmark and ARGOT-IO application I/O benchmark show the scalable performance improvement by exploiting the locality of node-local storages. Compared with BeeOND, Gfarm/BB shows 2.6x and 2.4x better performance in IOR write and read benchmarks, respectively, and it shows 2.5x better performance in ARGOT-IO.

     

/

返回文章
返回