Landing Stencil Code on Godson-T

Hui-Min Cui; Lei Wang; Dong-Rui Fan; Xiao-Bing Feng

doi:10.1007/s11390-010-1069-4

Hui-Min Cui, Lei Wang, Dong-Rui Fan, Xiao-Bing Feng. Landing Stencil Code on Godson-TJ. Journal of Computer Science and Technology, 2010, 25(4): 886-894. DOI: 10.1007/s11390-010-1069-4

Citation:

Hui-Min Cui, Lei Wang, Dong-Rui Fan, Xiao-Bing Feng. Landing Stencil Code on Godson-TJ. Journal of Computer Science and Technology, 2010, 25(4): 886-894. DOI: 10.1007/s11390-010-1069-4

Citation:

Hui-Min Cui, Lei Wang, Dong-Rui Fan, Xiao-Bing Feng. Landing Stencil Code on Godson-TJ. Journal of Computer Science and Technology, 2010, 25(4): 886-894. DOI: 10.1007/s11390-010-1069-4

Landing Stencil Code on Godson-T

Abstract

Abstract

The advent of multi-core/many-core chip technology offers both an extraordinary opportunity and a profound challenge. In particular, computer architects and system software designers are faced with a unique opportunity to introducing new architecture features as well as adequate compiler technology --- together they may have profound impact. This paper presents a case study (using the 1-D Jacobi computation) of compiler-amendable performance optimization techniques on a many-core architecture Godson-T. Godson-T architecture has several unique features that are chosen for this study: 1) chip-level global addressable memory in particular the scratchpad memories (SPM) local to the processing cores; 2) fine-grain memory based synchronization (e.g., full-empty bit for fine-grain synchronization). Leveraging state-of-the-art performance optimization methods for 1-D stencil parallelization (e.g., timed tiling and variants), we developed and implement a number of many-core-based optimization for Godson-T. Our experimental study shows good performance in both execution time speedup and scalability, validate the value of globally accessed SPM and fine-grain synchronization mechanism (full-empty bits) under the Godson-T, and provides some useful guidelines for future compiler technology of many-core chip architectures.

FullText(HTML)

References (36)

Relative Articles

Supplements (0)

Cited By

Landing Stencil Code on Godson-T

Abstract

Catalog

Export File

Citation

Format

Content