UPC plus plus : A High-Performance Communication Framework for Asynchronous Computation

被引:26
|
作者
Bachan, John [1 ]
Baden, Scott B. [1 ]
Hofmeyr, Steven [1 ]
Jacquelin, Mathias [1 ]
Kamil, Amir [1 ,2 ]
Bonachea, Dan [1 ]
Hargrove, Paul H. [1 ]
Ahmed, Hadia [1 ]
机构
[1] Lawrence Berkeley Natl Lab, Computat Res Div, Berkeley, CA 94720 USA
[2] Univ Michigan, Ann Arbor, MI 48109 USA
关键词
Asynchronous; PGAS; RMA; RPC; Exascale;
D O I
10.1109/IPDPS.2019.00104
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
UPC++ is a C++ library that supports high-performance computation via an asynchronous communication framework. This paper describes a new incarnation that differs substantially from its predecessor, and we discuss the reasons for our design decisions. We present new design features, including future-based asynchrony management, distributed objects, and generalized Remote Procedure Call (RPC). We show microbenchmark performance results demonstrating that one-sided Remote Memory Access (RMA) in UPC++ is competitive with MPI-3 RMA; on a Cray XC40 UPC++ delivers up to a 25% improvement in the latency of blocking RMA put, and up to a 33% bandwidth improvement in an RMA throughput test. We showcase the benefits of UPC++ with irregular applications through a pair of application motifs, a distributed hash table and a sparse solver component. Our distributed hash table in UPC++ delivers near-linear weak scaling up to 34816 cores of a Cray XC40. Our UPC++ implementation of the sparse solver component shows robust strong scaling up to 2048 cores, where it outperforms variants communicating using MPI by up to 3.1x. UPC++ encourages the use of aggressive asynchrony in low-overhead RMA and RPC, improving programmer productivity and delivering high performance in irregular applications.
引用
收藏
页码:963 / 973
页数:11
相关论文
共 50 条
  • [1] High-performance dataflow computing in hybrid memory systems with UPC plus plus DepSpawn
    Fraguela, Basilio B.
    Andrade, Diego
    JOURNAL OF SUPERCOMPUTING, 2021, 77 (07): : 7676 - 7689
  • [2] A Simple Multithreaded C plus plus Framework for High-Performance Data Acquisition Systems
    Ingles, Rolando
    Perek, Piotr
    Orlikowski, Mariusz
    Napieralski, Andrzej
    2015 22ND INTERNATIONAL CONFERENCE MIXED DESIGN OF INTEGRATED CIRCUITS & SYSTEMS (MIXDES), 2015, : 153 - 157
  • [3] ECF: A C plus plus framework for evolutionary computation
    Jakobovic, Domagoj
    Durasevic, Marko
    Picek, Stjepan
    Gasperov, Bruno
    SOFTWAREX, 2024, 27
  • [4] The New UPC plus plus DepSpawn High Performance Library for Data-Flow Computing with Hybrid Parallelism
    Fraguela, Basilio B.
    Andrade, Diego
    COMPUTATIONAL SCIENCE - ICCS 2022, PT I, 2022, : 761 - 774
  • [5] Enabling Performance Efficient Runtime Support for Hybrid MPI plus UPC plus plus Programming Models
    Hashmi, Jahanzeb Maqbool
    Hamidouche, Khaled
    Panda, Dhabaleswar K.
    PROCEEDINGS OF 2016 IEEE 18TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS; IEEE 14TH INTERNATIONAL CONFERENCE ON SMART CITY; IEEE 2ND INTERNATIONAL CONFERENCE ON DATA SCIENCE AND SYSTEMS (HPCC/SMARTCITY/DSS), 2016, : 1180 - 1187
  • [6] HPTT: A High-Performance Tensor Transposition C plus plus Library
    Springer, Paul
    Su, Tong
    Bientinesi, Paolo
    ARRAY'17: PROCEEDINGS OF THE 4TH ACM SIGPLAN INTERNATIONAL WORKSHOP ON LIBRARIES, LANGUAGES, AND COMPILERS FOR ARRAY PROGRAMMING, 2017, : 56 - 62
  • [7] From C/C plus plus Code to High-Performance Dataflow Circuits
    Josipovic, Lana
    Guerrieri, Andrea
    Ienne, Paolo
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2022, 41 (07) : 2142 - 2155
  • [8] Abstractions for C plus plus code optimizations in parallel high-performance applications
    Klepl, Jiri
    Smelko, Adam
    Rozsypal, Lukas
    Krulis, Martin
    PARALLEL COMPUTING, 2024, 121
  • [9] RcppArmadillo: Accelerating R with high-performance C plus plus linear algebra
    Eddelbuettel, Dirk
    Sanderson, Conrad
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2014, 71 : 1054 - 1063
  • [10] DCA plus plus project: Sustainable and scalable development of a high-performance research code
    Hahner, U. R.
    Balduzzi, G.
    Doak, P. W.
    Maier, T. A.
    Solca, R.
    Schulthess, T. C.
    XXX IUPAP CONFERENCE ON COMPUTATIONAL PHYSICS, 2019, 1290