The influence of operating systems on the performance of collective operations at extreme scale

被引:0
|
作者
Beckman, Pete [1 ]
Iskra, Kamil [1 ]
Yoshii, Kazutomo [1 ]
Coghlan, Susan [1 ]
机构
[1] Argonne Natl Lab, Div Math & Comp Sci, 9700 S Cass Ave, Argonne, IL 60439 USA
关键词
microbenchmark; noise; petascale; synchronicity;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We investigate operating system noise, which we identify as one of the main reasons for a lack of synchronicity ill parallel applications. Using a microbenchmark, we measure the noise on several contemporary platforms and find that, even with a general-purpose operating system, noise can be limited if certain precautions are taken. We. then inject artificially generated noise into a massively parallel system and measure its influence on the performance of collective operations. Our experiments indicate that on extreme-scale platforms, the performance is correlated with the largest interruption to the application, even if the probability of such an interruption is extremely small. We demonstrate that synchronizing the noise can significantly reduce its negative influence.
引用
收藏
页码:81 / +
页数:3
相关论文
共 50 条
  • [1] mOS: An Architecture for Extreme-Scale Operating Systems
    Wisniewski, Robert W.
    Inglett, Todd
    Keppel, Pardo
    Murty, Ravi
    Riesen, Rolf
    PROCEEDINGS OF THE 4TH INTERNATIONAL WORKSHOP ON RUNTIME AND OPERATING SYSTEMS FOR SUPERCOMPUTERS, ROSS 2014, 2014,
  • [2] Cathode performance: The influence of design, operations, and operating conditions
    Harald A. Øye
    Barry J. Welch
    JOM, 1998, 50 : 18 - 23
  • [3] Cathode performance: The influence of design, operations, and operating conditions
    Oye, Harald A.
    Welch, Barry J.
    JOM, 1998, 50 (02): : 18 - 23
  • [4] Cathode performance: The influence of design, operations, and operating conditions
    Oye, HA
    Welch, BJ
    JOM-JOURNAL OF THE MINERALS METALS & MATERIALS SOCIETY, 1998, 50 (02): : 18 - 23
  • [5] Accurately measuring collective operations at massive scale
    Hoefler, Torsten
    Schneider, Timo
    Lumsdaine, Andrew
    2008 IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL & DISTRIBUTED PROCESSING, VOLS 1-8, 2008, : 3174 - +
  • [6] Improving the performance of collective operations in MPICH
    Thakur, Rajeev
    Gropp, William D.
    2003, Springer Verlag (2840):
  • [7] Improving the performance of collective operations in MPICH
    Thakur, R
    Gropp, WD
    RECENT ADVANCES IN PARALLEL VIRTUAL MACHINE AND MESSAGE PASSING INTERFACE, 2003, 2840 : 257 - 267
  • [8] Performance analysis of MPI collective operations
    Pješivac-Grbović, Jelena
    Angskun, Thara
    Bosilca, George
    Fagg, Graham E.
    Gabriel, Edgar
    Dongarra, Jack J.
    Cluster Computing, 2007, 10 (02) : 127 - 143
  • [9] Performance analysis of MPI collective operations
    Jelena Pješivac-Grbović
    Thara Angskun
    George Bosilca
    Graham E. Fagg
    Edgar Gabriel
    Jack J. Dongarra
    Cluster Computing, 2007, 10 (2) : 127 - 143
  • [10] Performance analysis of MPI collective operations
    Pjesivac-Grbovic, Jelena
    Angskun, Thara
    Bosilca, George
    Fagg, Graham E.
    Gabriel, Edgar
    Dongarra, Jack J.
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2007, 10 (02): : 127 - 143