Performance Models for Communication in Collective I/O Operations

被引:1
|
作者
Jha, Shweta [1 ]
Gabriel, Edgar [1 ]
机构
[1] Univ Houston, Dept Comp Sci, Parallel Software Technol Lab, Houston, TX 77204 USA
基金
美国国家科学基金会;
关键词
D O I
10.1109/CCGRID.2017.31
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Many large scale scientific applications spend a significant amount of time in file I/O operations. Collective I/O APIs provide higher level abstractions of I/O across a group of processes. They often reduce the time spent in file I/O by reorganizing data across processes to match the layout of the data on the file system. In this paper we present performance models for the communication occurring in collective write operations as a first step towards developing a full and accurate mode of collective I/O operations. The models derived in this paper take both the application data decomposition and the file domain partitioning strategies used by the I/O library into account. We discuss properties of our performance models and demonstrate using LogGP parameters derived on multiple platforms their impact on the performance of collective I/O operations. The paper further provides comparison to actual measurements performed on an InfiniBand cluster. Our results indicate a good overall match between predicted and observed behavior.
引用
收藏
页码:982 / 991
页数:10
相关论文
共 50 条
  • [1] Design and Evaluation of Nonblocking Collective I/O Operations
    Venkatesan, Vishwanath
    Chaarawi, Mohamad
    Gabriel, Edgar
    Hoefler, Torsten
    RECENT ADVANCES IN THE MESSAGE PASSING INTERFACE, 2011, 6960 : 90 - +
  • [2] Network Performance Aware MPI Collective Communication Operations in the Cloud
    Gong, Yifan
    He, Bingsheng
    Zhong, Jianlong
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2015, 26 (11) : 3079 - 3089
  • [3] Automatically Selecting the Number of Aggregators for Collective I/O Operations
    Chaarawi, Mohamad
    Gabriel, Edgar
    2011 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2011, : 428 - 437
  • [4] Communication modeling of heterogeneous networks of workstations for performance characterization of collective operations
    Banikazemi, M
    Sampathkumar, J
    Prabhu, S
    Panda, DK
    Sadayappan, P
    (HCW '99) - EIGHTH HETEROGENEOUS COMPUTING WORKSHOP, PROCEEDINGS, 1999, : 125 - 133
  • [5] Using Triggered Operations to Offload Collective Communication Operations
    Hemmert, K. Scott
    Barrett, Brian
    Underwood, Keith D.
    RECENT ADVANCES IN THE MESSAGE PASSING INTERFACE, 2010, 6305 : 249 - +
  • [6] Optimization of collective communication operations in MPICH
    Thakur, R
    Rabenseifner, R
    Gropp, W
    INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2005, 19 (01): : 49 - 66
  • [7] On Overlapping Communication and File I/O in Collective Write Operation
    Feki, Raafat
    Gabriel, Edgar
    2020 IEEE 34TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW 2020), 2020, : 1044 - 1051
  • [8] cFireworks: a Tool for Measuring the Communication Costs in Collective I/O
    Cha, Kwangho
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2014, 5 (08) : 192 - 197
  • [9] Collective buffering: Improving parallel I/O performance
    Nitzberg, B
    Lo, V
    SIXTH IEEE INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE DISTRIBUTED COMPUTING, PROCEEDINGS, 1997, : 148 - 157
  • [10] Improving collective I/O performance using threads
    Dickens, Phillip M.
    Thakur, Rajeev
    Proceedings of the International Parallel Processing Symposium, IPPS, 1999, : 38 - 45