Design and Implementation of Broadcast Algorithms for Extreme-Scale Systems

被引:3
|
作者
Shamis, Pavel [1 ]
Graham, Richard [1 ]
Venkata, Manjunath Gorentla [1 ]
Ladd, Joshua [1 ]
机构
[1] Oak Ridge Natl Lab, Oak Ridge, TN USA
关键词
MPI; Scalability; Hierarchy; Broadcast; Algorithm;
D O I
10.1109/CLUSTER.2011.17
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The scalability and performance of collective communication operations limit the scalability and performance of many scientific applications. This paper presents two new blocking and nonblocking Broadcast algorithms for communicators with arbitrary communication topology, and studies their performance. These algorithms benefit from increased concurrency and a reduced memory footprint, making them suitable for use on large-scale systems. Measuring small, medium, and large data Broadcasts on a Cray-XT5, using 24,576 MPI processes, the Cheetah algorithms outperform the native MPI on that system by 51%, 69%, and 9%, respectively, at the same process count. These results demonstrate an algorithmic approach to the implementation of the important class of collective communications, which is high performing, scalable, and also uses resources in a scalable manner.
引用
收藏
页码:74 / 83
页数:10
相关论文
共 50 条
  • [31] On SDN-Based Extreme-Scale Networks
    Ghalwash, Haitham
    Huang, Chun-Hsi
    2016 IEEE HIGH PERFORMANCE EXTREME COMPUTING CONFERENCE (HPEC), 2016,
  • [32] Asking the Right Questions: Benchmarking Fault-Tolerant Extreme-Scale Systems
    Widener, Patrick M.
    Ferreira, Kurt B.
    Levy, Scott
    Bridges, Patrick G.
    Arnold, Dorian
    Brightwell, Ron
    EURO-PAR 2013: PARALLEL PROCESSING WORKSHOPS, 2014, 8374 : 717 - 726
  • [33] Opportunities for Nonvolatile Memory Systems in Extreme-Scale High-Performance Computing
    Vetter, Jeffrey S.
    Mittal, Sparsh
    COMPUTING IN SCIENCE & ENGINEERING, 2015, 17 (02) : 73 - 82
  • [34] Memory-Conscious Collective I/O for Extreme-Scale HPC Systems
    Lu, Yin
    Chen, Yong
    Thakur, Rajeev
    Zhuang, Yu
    2012 SC COMPANION: HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS (SCC), 2012, : 1360 - 1360
  • [35] Extreme-scale parallel computing: bottlenecks and strategies
    Ze-yao Mo
    Frontiers of Information Technology & Electronic Engineering, 2018, 19 : 1251 - 1260
  • [36] A Vision for Managing Extreme-Scale Data Hoards
    Logan, Jeremy
    Mehta, Kshitij
    Heber, Gerd
    Klasky, Scott
    Kurc, Tahsin
    Podhorszki, Norbert
    Widener, Patrick
    Wolf, Matthew
    2019 39TH IEEE INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2019), 2019, : 1806 - 1817
  • [37] Extreme-scale parallel computing: bottlenecks and strategies
    Mo, Ze-yao
    FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2018, 19 (10) : 1251 - 1260
  • [38] Extreme-scale earthquake simulations on Sunway TaihuLight
    Haohuan Fu
    Bingwei Chen
    Wenqiang Zhang
    Zhenguo Zhang
    Wei Zhang
    Guangwen Yang
    Xiaofei Chen
    CCF Transactions on High Performance Computing, 2019, 1 : 14 - 24
  • [39] Memory-Conscious Collective I/O for Extreme-scale HPC Systems
    Lu, Yin
    Chen, Yong
    Thakur, Rajeev
    Zhuang, Yu
    2012 SC COMPANION: HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS (SCC), 2012, : 1361 - +
  • [40] Evaluating the Impact of Spiking Neural Network Traffic on Extreme-Scale Hybrid Systems
    Wolfe, Noah
    Plagge, Mark
    Carothers, Christopher D.
    Mubarak, Misbah
    Ross, Robert B.
    PROCEEDINGS OF 2018 IEEE/ACM PERFORMANCE MODELING, BENCHMARKING AND SIMULATION OF HIGH PERFORMANCE COMPUTER SYSTEMS (PMBS 2018), 2018, : 108 - 120