We use cookies to improve your experience with our site.
陈荔城, 陈明宇, 阮元, 黄永兵, 崔泽汉, 卢天越, 包云岗. 一种消息式内存系统[J]. 计算机科学技术学报, 2014, 29(2): 255-272. DOI: 10.1007/s11390-014-1428-7
引用本文: 陈荔城, 陈明宇, 阮元, 黄永兵, 崔泽汉, 卢天越, 包云岗. 一种消息式内存系统[J]. 计算机科学技术学报, 2014, 29(2): 255-272. DOI: 10.1007/s11390-014-1428-7
Li-Cheng Chen, Ming-Yu Chen, Yuan Ruan, Yong-Bing Huang, Ze-Han Cui, Tian-Yue Lu, Yun-Gang Bao. MIMS:Towards a Message Interface Based Memory System[J]. Journal of Computer Science and Technology, 2014, 29(2): 255-272. DOI: 10.1007/s11390-014-1428-7
Citation: Li-Cheng Chen, Ming-Yu Chen, Yuan Ruan, Yong-Bing Huang, Ze-Han Cui, Tian-Yue Lu, Yun-Gang Bao. MIMS:Towards a Message Interface Based Memory System[J]. Journal of Computer Science and Technology, 2014, 29(2): 255-272. DOI: 10.1007/s11390-014-1428-7

一种消息式内存系统

MIMS:Towards a Message Interface Based Memory System

  • 摘要: 在多核及大数据时代,传统的基于同步总线接口的内存系统会限制很多创新的实现而面临着很多挑战(或瓶颈)。在本文中,我们主张在内存系统中使用消息式接口来替换传统的同步总线接口。为此本文提出了一种新型的消息式内存系统命名为MIMS。MIMS的主要创新在于处理器和内存系统之间通过一种统一的灵活消息包接口进行通信,而不再使用严格的同步时序协议。每个消息包除了能够支持集成多个访存请求、操作命令外,还可以集成额外的上层语义信息。通过在片外增加一种本地的缓冲调度器硬件使内存系统更加智能和主动,缓冲调度器负责接收片上内存控制器发送过来的访存请求消息包,解析消息包,执行访存请求调度,构造响应包返回给处理器,以及在本地执行一些特殊命令。另外消息包中的上层语义信息,可用于指导缓冲调度器的优化。消息式内存接口具有很好的可扩展性,不仅可以很自然地将之前的针对内存体系结构的各种优化方法集成进来,还提供了新的优化机会,如多访存请求的地址压缩和连续地址访存请求的合并。基于时钟精确的消息式内存模拟器上的实验结果表明,在16核配置下,为访存请求集成访存粒度信息,消息式内存可以提高系统性能53.21%,降低功耗时间积55.90%。进一步消息式内存还可以提高内存带宽有效利用率62.42%,并降低平均访存延迟51%。

     

    Abstract: The decades-old synchronous memory bus interface has restricted many innovations in the memory system, which is facing various challenges (or walls) in the era of multi-core and big data. In this paper, we argue that a message-based interface should be adopted to replace the traditional bus-based interface in the memory system. A novel message interface based memory system called MIMS is proposed. The key innovation of MIMS is that processors communicate with the memory system through a universal and flexible message packet interface. Each message packet is allowed to encapsulate multiple memory requests (or commands) and additional semantic information. The memory system is more intelligent and active by equipping with a local buffer scheduler, which is responsible for processing packets, scheduling memory requests, preparing responses, and executing specific commands with the help of semantic information. Under the MIMS framework, many previous innovations on memory architecture as well as new optimization opportunities such as address compression and continuous requests combination can be naturally incorporated. The experimental results on a 16-core cycle-detailed simulation system show that: with accurate granularity message, MIMS can improve system performance by 53.21% and reduce energy delay product (EDP) by 55.90%. Furthermore, it can improve effective bandwidth utilization by 62.42% and reduce memory access latency by 51% on average.

     

/

返回文章
返回