基于组的高性能迭代增量处理框架

余辉; 姜新宇; 赵进; 齐豪; 张宇; 廖小飞; 刘海坤; 毛伏兵; 金海

doi:10.1007/s11390-022-2101-1

基于组的高性能迭代增量处理框架

Toward High-Performance Delta-Based Iterative Processing with a Group-Based Approach

摘要

摘要: 许多系统采用基于增量的迭代执行模型来支持分布式平台上的迭代算法。然而，由于迭代算法需要一次又一次地重复处理相同的大规模数据，直到结果满足用户给定的收敛或停止条件，因此迭代算法在分布式平台上仍然需要很长时间才能收敛。并且，对于大规模迭代算法，现有的同步方迭代方案需要使用同步屏障机制将整个执行流程划分为数个迭代周期，因此存在收敛速度慢和负载不平衡的问题，而现有的异步方法虽然消除了同步屏障，但是同时处理相邻图顶点会导致共享内存的读写冲突，导致过多的冗余通信和计算成本。本文提出一种基于组的迭代执行模型 Aiter-R，以及一种高效的启发式调度算法，允许现有迭代算法自适应地在同步与异步迭代方案中选择最佳的性能平衡，以达到最大效率以有效地支持分布式平台上的大规模基于增量的迭代算法。实验结果表明，Aiter-R 在同步和异步策略之间取得了很好的平衡，并且优于现有主流解决方案。与现有的异步和同步模型相比，它分别减少了高达 54.1% 和 84.6% 的执行时间。

Abstract: Many systems have been built to employ the delta-based iterative execution model to support iterative algorithms on distributed platforms by exploiting the sparse computational dependencies between data items of these iterative algorithms in a synchronous or asynchronous approach. However, for large-scale iterative algorithms, existing synchronous solutions suffer from slow convergence speed and load imbalance, because of the strict barrier between iterations; while existing asynchronous approaches induce excessive redundant communication and computation cost as a result of being barrier-free. In view of the performance trade-off between these two approaches, this paper designs an efficient execution manager, called Aiter-R, which can be integrated into existing delta-based iterative processing systems to efficiently support the execution of delta-based iterative algorithms, by using our proposed group-based iterative execution approach. It can efficiently and correctly explore the middle ground of the two extremes. A heuristic scheduling algorithm is further proposed to allow an iterative algorithm to adaptively choose its trade-off point so as to achieve the maximum efficiency. Experimental results show that Aiter-R strikes a good balance between the synchronous and asynchronous policies and outperforms state-of-the-art solutions. It reduces the execution time by up to 54.1% and 84.6% in comparison with existing asynchronous and the synchronous models, respectively.

HTML全文

参考文献()

施引文献

资源附件()