The Pitfalls of Provisioning Exascale Networks: A Trace Replay Analysis for Understanding Communication Performance

被引:3
|
作者
Kenny, Joseph P. [1 ]
Sargsyan, Khachik [1 ]
Knight, Samuel [1 ]
Michelogiannakis, George [2 ]
Wilke, Jeremiah J. [1 ]
机构
[1] Sandia Natl Labs, 7011 East Ave, Livermore, CA 94551 USA
[2] Lawrence Berkeley Natl Lab, 1 Cyclotron Rd, Berkeley, CA USA
关键词
OPTIMIZATION; SIMULATION; MPI;
D O I
10.1007/978-3-319-92040-5_14
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Data movement is considered the main performance concern for exascale, including both on-node memory and off-node network communication. Indeed, many application traces show significant time spent in MPI calls, potentially indicating that faster networks must be provisioned for scalability. However, equating MPI times with network communication delays ignores synchronization delays and software overheads independent of network hardware. Using point-to-point protocol details, we explore the decomposition of MPI time into communication, synchronization and software stack components using architecture simulation. Detailed validation using Bayesian inference is used to identify the sensitivity of performance to specific latency/bandwidth parameters for different network protocols and to quantify associated uncertainties. The inference combined with trace replay shows that synchronization and MPI software stack overhead are at least as important as the network itself in determining time spent in communication routines.
引用
收藏
页码:269 / 288
页数:20
相关论文
共 50 条
  • [21] Performance analysis and admission control in multimedia communication networks
    Paschalidis, IC
    PROCEEDINGS OF THE 36TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-5, 1997, : 1786 - 1791
  • [22] Performance Analysis of Routing Protocols for UAV Communication Networks
    Tan, Xiaopeng
    Zuo, Zhen
    Su, Shaojing
    Guo, Xiaojun
    Sun, Xiaoyong
    Jiang, Deng
    IEEE ACCESS, 2020, 8 : 92212 - 92224
  • [23] PERFORMANCE ANALYSIS OF METEOR BURST COMMUNICATION-NETWORKS
    MCGURL, AK
    1989 IEEE MILITARY COMMUNICATIONS CONFERENCE, VOLS 1-3: BRIDGING THE GAP : INTEROPERABILITY, SURVIVABILITY, SECURITY, 1989, : 408 - 412
  • [24] METHOD OF MOMENTS FOR ANALYSIS OF A SWITCHED COMMUNICATION NETWORKS PERFORMANCE
    KUCZURA, A
    BAJAJ, D
    IEEE TRANSACTIONS ON COMMUNICATIONS, 1977, 25 (02) : 185 - 193
  • [25] Understanding knowledge workers' job performance: a perspective of online and offline communication networks
    Gao, Fei
    Wang, Junwei
    Wang, Shiquan
    ENTERPRISE INFORMATION SYSTEMS, 2019, 13 (01) : 107 - 131
  • [26] Composite performance and availability analysis of wireless communication networks
    Ma, Y
    Han, JJ
    Trivedi, KS
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2001, 50 (05) : 1216 - 1223
  • [27] How Expensive is Consistency? Performance Analysis of Consistent Rate Provisioning to Mobile Users in Cellular Networks
    Mehmeti, Fidan
    Rosenberg, Catherine
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2019, 18 (05) : 1098 - 1115
  • [28] Understanding Probabilistic Cognitive Relaying Communication with Experimental Implementation and Performance Analysis
    Khandakar, Amith
    Mohamed, Amr Mahmoud Salem
    SENSORS, 2019, 19 (01)
  • [29] Performance analysis of multicast algorithms for mobile satellite communication networks
    Thomas, RW
    Raines, RA
    Baldwin, RO
    Temple, MA
    COMPUTER COMMUNICATIONS, 2002, 25 (11-12) : 1085 - 1093
  • [30] PERFORMANCE ANALYSIS OF INTEGRATED COMMUNICATION AND CONTROL-SYSTEM NETWORKS
    HALEVI, Y
    RAY, A
    JOURNAL OF DYNAMIC SYSTEMS MEASUREMENT AND CONTROL-TRANSACTIONS OF THE ASME, 1990, 112 (03): : 365 - 371