Reinventing Memory System Design for Many-Accelerator Architecture

Ying Wang; Lei Zhang; Yin-He Han; Hua-Wei Li

doi:10.1007/s11390-014-1429-6

Ying Wang, Lei Zhang, Yin-He Han, Hua-Wei Li. Reinventing Memory System Design for Many-Accelerator ArchitectureJ. Journal of Computer Science and Technology, 2014, 29(2): 273-280. DOI: 10.1007/s11390-014-1429-6

Citation:

Reinventing Memory System Design for Many-Accelerator Architecture

Abstract

Abstract

The many-accelerator architecture, mostly composed of general-purpose cores and accelerator-like function units (FUs), becomes a great alternative to homogeneous chip multiprocessors (CMPs) for its superior power-effciency. However, the emerging many-accelerator processor shows a much more complicated memory accessing pattern than general purpose processors (GPPs) because the abundant on-chip FUs tend to generate highly-concurrent memory streams with distinct locality and bandwidth demand. The disordered memory streams issued by diverse accelerators exhibit a mutual-interference behavior and cannot be effciently handled by the orthodox main memory interface that provides an inflexible data fetching mode. Unlike the traditional DRAM memory, our proposed Aggregation Memory System (AMS) can function adaptively to the characterized memory streams from different FUs, because it provides the FUs with different data fetching sizes and protects their locality in memory access by intelligently interleaving their data to memory devices through sub-rank binding. Moreover, AMS can batch the requests without sub-rank conflict into a read burst with our optimized memory scheduling policy. Experimental results from trace-based simulation show both conspicuous performance boost and energy saving brought by AMS.

FullText(HTML)

References (24)

Relative Articles

Supplements (0)

Cited By

Reinventing Memory System Design for Many-Accelerator Architecture

Abstract

Catalog

Export File

Citation

Format

Content