We use cookies to improve your experience with our site.
庄毅敏, 胡杏, 陈小兵, 支天. DyPipe: 基于动态软流水方法加速动态神经网络[J]. 计算机科学技术学报, 2023, 38(4): 899-910. DOI: 10.1007/s11390-021-1161-y
引用本文: 庄毅敏, 胡杏, 陈小兵, 支天. DyPipe: 基于动态软流水方法加速动态神经网络[J]. 计算机科学技术学报, 2023, 38(4): 899-910. DOI: 10.1007/s11390-021-1161-y
Zhuang YM, Hu X, Chen XB et al. DyPipe: A holistic approach to accelerating dynamic neural networks with dynamic pipelining. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY 38(4): 899−910 July 2023. DOI: 10.1007/s11390-021-1161-y.
Citation: Zhuang YM, Hu X, Chen XB et al. DyPipe: A holistic approach to accelerating dynamic neural networks with dynamic pipelining. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY 38(4): 899−910 July 2023. DOI: 10.1007/s11390-021-1161-y.

DyPipe: 基于动态软流水方法加速动态神经网络

DyPipe: A Holistic Approach to Accelerating Dynamic Neural Networks with Dynamic Pipelining

  • 摘要: 动态神经网络具有强大的复杂网络模型的表达能力,在近年来逐渐被自然语言处理等领域采用。但是,现有的基于神经网络加速器的优化工作通常关注于静态神经网络的优化,在编译时利用静态调度等方式实现对静态计算图的优化。目前仍缺乏一种系统完整的方法用于在神经网络加速器上高效执行动态神经网络,使得神经网络加速器应用场景受限。在本文中,我们分析了动态神经网络在神经网络加速器上的执行过程,并发现了动态特征会影响加速器软流水调度的效率。基于这一发现,我们提出了DyPipe,一种基于动态软流水的方法用于加速动态神经网络在神经网络加速器上的执行速度。DyPipe利用神经网络加速器的片上存储,设计了上下文缓存模块。该模块可以在运行时保存流水信息,并通过运行时调度器实现动态软流水调度。同时,DyPipe设计了特殊的编程接口,通过对算子运算逻辑的分块并绑定硬件模块,从而减少运行时开销。本文的实验证明,DyPipe相比于静态调度方式能够在动态神经网络上取得1.7倍的提升并且在静态神经网络上仍能保持96%以上的执行效率。本文提出的DyPipe方法能够在引入较小运行时开销的条件下,显著提升动态神经网络的执行效率,使得神经网络加速器能够高效地部署在更多的应用场景。

     

    Abstract: Dynamic neural network (NN) techniques are increasingly important because they facilitate deep learning techniques with more complex network architectures. However, existing studies, which predominantly optimize the static computational graphs by static scheduling methods, usually focus on optimizing static neural networks in deep neural network (DNN) accelerators. We analyze the execution process of dynamic neural networks and observe that dynamic features introduce challenges for efficient scheduling and pipelining in existing DNN accelerators. We propose DyPipe, a holistic approach to optimizing dynamic neural network inferences in enhanced DNN accelerators. DyPipe achieves significant performance improvements for dynamic neural networks while it introduces negligible overhead for static neural networks. Our evaluation demonstrates that DyPipe achieves 1.7x speedup on dynamic neural networks and maintains more than 96% performance for static neural networks.

     

/

返回文章
返回