Journal of Computer Science and Technology
Quick Search in JCST
 Advanced Search 
      Home | PrePrint | SiteMap | Contact Us | FAQ
 
Indexed by   SCIE, EI ...
Bimonthly    Since 1986
Journal of Computer Science and Technology 2013, Vol. 28 Issue (1) :72-89    DOI: 10.1007/s11390-013-1313-9
Architecture and VLSI Design Current Issue | Archive | Adv Search << Previous Articles | Next Articles >>
Design and Implementation of an Extended Collectives Library for Unified Parallel C
Carlos Teijeiro1, Student Member, IEEE, Guillermo L. Taboada1, Juan Touriño1, Senior Member, IEEE, Member, ACM, Ramón Doallo1, Member, IEEE, José C. Mouriño2, Damián A. Mallón3, and Brian Wibecan4
1. Computer Architecture Group, University of A Coruña, A Coruña 15071, Spain;
2. Galicia Supercomputing Center, Santiago de Compostela 15705, Spain;
3. Jülich Supercomputing Centre, Institute for Advanced Simulation, Forschungszentrum Jülich, Jülich D-52425, Germany;
4. Industry Standard Servers Group, Hewlett-Packard Company, Montgomery, Alabama 36117, U.S.A.

Abstract
Reference
Related Articles
Download: [PDF 2418KB]     Export: BibTeX or EndNote (RIS)  
Abstract Unified Parallel C (UPC) is a parallel extension of ANSI C based on the Partitioned Global Address Space (PGAS) programming model, which provides a shared memory view that simplifies code development while it can take advantage of the scalability of distributed memory architectures. Therefore, UPC allows programmers to write parallel applications on hybrid shared/distributed memory architectures, such as multi-core clusters, in a more productive way, accessing remote memory by means of different high-level language constructs, such as assignments to shared variables or collective primitives. However, the standard UPC collectives library includes a reduced set of eight basic primitives with quite limited functionality. This work presents the design and implementation of extended UPC collective functions that overcome the limitations of the standard collectives library, allowing, for example, the use of a specific source and destination thread or defining the amount of data transferred by each particular thread. This library fulfills the demands made by the UPC developers community and implements portable algorithms, independent of the specific UPC compiler/runtime being used. The use of a representative set of these extended collectives has been evaluated using two applications and four kernels as case studies. The results obtained confirm the suitability of the new library to provide easier programming without trading off performance, thus achieving high productivity in parallel programming to harness the performance of hybrid shared/distributed memory architectures in high performance computing.
Articles by authors
Carlos Teijeiro
Guillermo L. Taboada
Juan Tourino
Ramón Doallo
José C. Mourino
Damián A. Mallón
Brian Wibecan
KeywordsUnified Parallel C   collective operation   programmability   partitioned global address space   high performance computing     
Received 2012-02-08;
Fund:

This work was funded by Hewlett-Packard (Project "Improving UPC Usability and Performance in Constellation Systems: Imple- mentation/Extensions of UPC Libraries"), and partially supported by the Ministry of Science and Innovation of Spain under Project No. TIN2010-16735 and the Galician Government (Consolidation of Competitive Research Groups, Xunta de Galicia ref. 2010/6).

Cite this article:   
Carlos Teijeiro, Guillermo L. Taboada, Juan Touriño, Ramón Doallo, José C. Mouriño, Damián A. Mallón, and Brian Wibecan.Design and Implementation of an Extended Collectives Library for Unified Parallel C[J]  Journal of Computer Science and Technology, 2013,V28(1): 72-89
URL:  
http://jcst.ict.ac.cn:8080/jcst/EN/10.1007/s11390-013-1313-9
Copyright 2010 by Journal of Computer Science and Technology