We use cookies to improve your experience with our site.
张凯, 陈峰, 丁晓宁, 槐寅, 李如豹, 罗天, 王凯博, 袁源, 张晓东. Hetero-DB: 基于异构计算和存储资源的下一代数据库系统[J]. 计算机科学技术学报, 2015, 30(4): 657-678. DOI: 10.1007/s11390-015-1553-y
引用本文: 张凯, 陈峰, 丁晓宁, 槐寅, 李如豹, 罗天, 王凯博, 袁源, 张晓东. Hetero-DB: 基于异构计算和存储资源的下一代数据库系统[J]. 计算机科学技术学报, 2015, 30(4): 657-678. DOI: 10.1007/s11390-015-1553-y
Kai Zhang, Feng Chen, Xiaoning Ding, Yin Huai, Rubao Lee, Tian Luo, Kaibo Wang, Yuan Yuan, Xiaodong Zhang. Hetero-DB: Next Generation High-Performance Database Systems by Best Utilizing Heterogeneous Computing and Storage Resources[J]. Journal of Computer Science and Technology, 2015, 30(4): 657-678. DOI: 10.1007/s11390-015-1553-y
Citation: Kai Zhang, Feng Chen, Xiaoning Ding, Yin Huai, Rubao Lee, Tian Luo, Kaibo Wang, Yuan Yuan, Xiaodong Zhang. Hetero-DB: Next Generation High-Performance Database Systems by Best Utilizing Heterogeneous Computing and Storage Resources[J]. Journal of Computer Science and Technology, 2015, 30(4): 657-678. DOI: 10.1007/s11390-015-1553-y

Hetero-DB: 基于异构计算和存储资源的下一代数据库系统

Hetero-DB: Next Generation High-Performance Database Systems by Best Utilizing Heterogeneous Computing and Storage Resources

  • 摘要: 随着硬件技术的发展,新一代通用高性能设备已经被广泛使用,例如图形处理器GPU和固态硬盘SSD。具有大规模并行性的应用程序可以利用GPU获得高于其基于CPU一个数量级的吞吐量。另外,相比传统硬盘HDD,新的存储设备SSD也能够同时达到更高的吞吐量和更低的延迟。这些新的硬件设备可以使得很多应用的性能大幅提升。因此,数据库领域一直在致力于将这些设备使用在数据库系统之中。然而,如果这些新的硬件设备没有被合理的使用,数据库系统是不会取得应有的性能提升的。在本文中,我们提出了Hetero-DB系统。Hetero-DB是一个数据库系统,它通过在设计和实现中利用数据库的特点以及新硬件的特性以实现高性能数据库查询处理。Hetero-DB开发了一个具有GPU存储管理和调度的查询引擎以支持基于GPU的并发查询处理。另外,基于SSD-HDD的混合存储系统,我们重新设计了存储引擎,使HDD和SSD形成了一个两层的缓存结构。为了更好的利用该混合存储结构,Hetero-DB识别出对于存储性能重要的数据语意信息,并将其传递给存储管理单元。这种策略对于提高存储系统的效率和性能有着很大的潜力。Hetero-DB 的目标是最大化的利用GPU和SSD以取得高性能,并且揭示出它们在构建新一代数据库系统中的作用。

     

    Abstract: With recent advancement on hardware technologies, new general-purpose high-performance devices have been widely adopted, such as the graphics processing unit (GPU) and solid state drive (SSD). GPU may offer an order of higher throughput for applications with massive data parallelism, compared with the multicore CPU. Moreover, new storage device SSD is also capable of offering a much higher I/O throughput and lower latency than a traditional hard disk device (HDD). These new hardware devices can significantly boost the performance of many applications; thus the database community has been actively engaging in adopting them into database systems. However, the performance benefit cannot be easily reaped if the new hardwares are improperly used. In this paper, we propose Hetero-DB, a high-performance database system by exploiting both the characteristics of the database system and the special properties of the new hardware devices in system's design and implementation. Hetero-DB develops a GPU-aware query execution engine with GPU device memory management and query scheduling mechanism to support concurrent query execution. Furthermore, with the SSD-HDD hybrid storage system, we redesign the storage engine by organizing HDD and SSD into a two-level caching hierarchy in Hetero-DB. To best utilize the hybrid hardware devices, the semantic information that is critical for storage I/O is identified and passed to the storage manager, which has a great potential to improve the efficiency and performance. Hetero-DB aims to maximize the performance benefits of GPU and SSD, and demonstrates the effectiveness for designing next generation database systems.

     

/

返回文章
返回