Design Alternatives for Implementing Fence Synchronization in MPI-2 One-sided Communication for InfiniBand Clusters

被引:0
|
作者
Santhanaraman, G. [1 ]
Gangadharappa, T. [1 ]
Narravula, S. [1 ]
Mamidala, A. [1 ]
Panda, D. K. [1 ]
机构
[1] Ohio State Univ, Dept Comp Sci & Engn, Columbus, OH 43210 USA
来源
2009 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING AND WORKSHOPS | 2009年
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Scientific computing has seen an immense growth in recent years. The Message Passing Interface (MPI) has become the de-facto standard for parallel programming model for distributed memory systems. As the system scale increases, application writers often try to increase the overlap of computation and communication. The MPI-2 standard expanded MPI to include one-sided communication semantics that has the potential for overlapping computation with communication. In this model synchronization between processes needs to be done explicitly to ensure completion before using the data. Fence is one of the mechanisms of providing such synchronization in the one-sided model. In this paper, we study a set of different alternatives for designing the fence synchronization mechanisms. We analyze the various trade-offs of these designs on networks like InfiniBand that provide Remote Direct Memory Access (RDMA) capabilities. We propose a novel design for implementing fence synchronization that uses RDMA write with Immediate mechanism (Fence-Imm-RI) provided by InfiniBand networks. We then characterize the performance of different designs with various one-sided communication pattern micro-benchmarks for both latency as well as overlap capability. The new Fence-Imm-RI scheme performs the best in scenarios that require low synchronization overhead as well as good overlap capability (close to 90% overlap for large messages) as opposed to the other designs that can provide either low synchronization overhead or good overlap capability.
引用
收藏
页码:394 / 402
页数:9
相关论文
共 49 条
  • [41] ONE-SIDED VERSUS 2-SIDED COMMUNICATION UNDER INDIRECT AWARENESS OF PERSUASIVE INTENT
    ROSNOW, RL
    PUBLIC OPINION QUARTERLY, 1968, 32 (01) : 95 - 101
  • [42] Exploring one-sided communication and synchronization on a non-cache-coherent many-core architecture
    Christgau, Steffen
    Schnor, Bettina
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2017, 29 (15):
  • [43] Evaluation of MPI's one-sided communication mechanism for short-range molecular dynamics on the Origin2000
    Matthey, T
    Hansen, JP
    APPLIED PARALLEL COMPUTING, PROCEEDINGS: NEW PARADIGMS FOR HPC IN INDUSTRY AND ACADEMIA, 2001, 1947 : 356 - 365
  • [44] COMPARATIVE-ANALYSIS OF THE RELATIVE EFFECTIVENESS OF ONE-SIDED AND 2-SIDED COMMUNICATION FOR CONTRASTING PRODUCTS
    GOLDEN, LL
    ALPERT, MI
    JOURNAL OF ADVERTISING, 1987, 16 (01) : 18 - &
  • [45] RELATIVE EFFICACY OF A ONE-SIDED VS 2-SIDED COMMUNICATION IN A SIMULATED GOVERNMENT EVALUATION OF PROPOSALS
    DYCUS, RD
    PSYCHOLOGICAL REPORTS, 1976, 38 (03) : 787 - 790
  • [46] Design of non-fragile synchronization of one-sided Lipschitz Lur'e differential inclusion system
    Shi M.-J.
    Huang J.
    Chen L.
    Han Z.-Z.
    Huang, Jun (cauchyhot@163.com), 1654, South China University of Technology (34): : 1654 - 1661
  • [47] ON DOMINATING THE X2-TEST IN THE CASE OF MULTIVARIATE NORMAL-DISTRIBUTION WITH ONE-SIDED ALTERNATIVES
    ALRAWWASH, HM
    SANKHYA-THE INDIAN JOURNAL OF STATISTICS SERIES B, 1990, 52 : 174 - 182
  • [48] Implementing the one-sided Jacobi method on a 2D/3D mesh multicomputer
    Royo, D
    Valero-García, M
    González, A
    PARALLEL COMPUTING, 2001, 27 (09) : 1253 - 1271
  • [49] Optimized Non-contiguous MPI Datatype Communication for GPU Clusters: Design, Implementation and Evaluation with MVAPICH2
    Wang, Hao
    Potluri, Sreeram
    Luo, Miao
    Singh, Ashish Kumar
    Ouyang, Xiangyong
    Sur, Sayantan
    Panda, Dhabaleswar K.
    2011 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2011, : 308 - 316