We use cookies to improve your experience with our site.
GeRong, FengXizhou, ZouPengfei, AllenTyler. 功率受限的高性能计算范式[J]. 计算机科学技术学报, 2023, 38(1): 87-102. DOI: 10.1007/s11390-023-2885-7
引用本文: GeRong, FengXizhou, ZouPengfei, AllenTyler. 功率受限的高性能计算范式[J]. 计算机科学技术学报, 2023, 38(1): 87-102. DOI: 10.1007/s11390-023-2885-7
Ge R, Feng X, Zou P et al. The paradigm of power bounded high-performance computing. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY 38(1): 87−102 Jan. 2023. DOI: 10.1007/s11390-023-2885-7.
Citation: Ge R, Feng X, Zou P et al. The paradigm of power bounded high-performance computing. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY 38(1): 87−102 Jan. 2023. DOI: 10.1007/s11390-023-2885-7.

功率受限的高性能计算范式

The Paradigm of Power Bounded High-Performance Computing

  • 摘要: 现代计算机系统越来越受到从单个组件到数据中心的多层可用或允许功率的限制。为了应对这一现实,有必要了解功率界限如何影响性能,特别是对于从高端节点构建的系统,每个节点都包含多个耗电组件。由于在节点或组件上放置不适当的功率限制会导致严重的性能损失,因此在给定总功率预算的情况下,必须协调节点和组件之间的功率分配,从而实现所需的性能。在该篇论文中,作者描述了功率受限的高性能计算范式,该范式认为协调功率受限分配是计算机系统性能分析和优化的关键因素。作者将此范式应用于 CPU 和 GPU 计算的跨多层的功率协调问题。通过几个案例研究,作者展示了如何应用和平衡功率协调原则,并使其适应工作负载、硬件技术和可用总功率的相互作用,从而提高性能。

     

    Abstract: Modern computer systems are increasingly bounded by the available or permissible power at multiple layers from individual components to data centers. To cope with this reality, it is necessary to understand how power bounds impact performance, especially for systems built from high-end nodes, each consisting of multiple power hungry components. Because placing an inappropriate power bound on a node or a component can lead to severe performance loss, coordinating power allocation among nodes and components is mandatory to achieve desired performance given a total power budget. In this article, we describe the paradigm of power bounded high-performance computing, which considers coordinated power bound assignment to be a key factor in computer system performance analysis and optimization. We apply this paradigm to the problem of power coordination across multiple layers for both CPU and GPU computing. Using several case studies, we demonstrate how the principles of balanced power coordination can be applied and adapted to the interplay of workloads, hardware technology, and the available total power for performance improvement.

     

/

返回文章
返回