COMMUNICATION OPTIMIZATIONS FOR IRREGULAR SCIENTIFIC COMPUTATIONS ON DISTRIBUTED-MEMORY ARCHITECTURES

被引:75
|
作者
DAS, R
UYSAL, M
SALTZ, J
HWANG, YS
机构
[1] Department of Computer Science, University of Maryland, College Park
[2] UMIACS, University of Maryland, College Park
关键词
D O I
10.1006/jpdc.1994.1104
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper describes a number of optimizations that can be used to support the efficient execution of irregular problems on distributed memory parallel machines. These primitives (1) coordinate interprocessor data movement, (2) manage the storage of, and access to, copies of off-processor data, (3) minimize interprocessor communication requirements, and (4) support a shared name space. We present a detailed performance and scalability analysis of the communication primitives. This performance and scalability analysis is carried out using a workload generator, kernels from real applications, and a large unstructured adaptive application (the molecular dynamics code CHARMM). (C) 1994 Academic Press, Inc.
引用
收藏
页码:462 / 478
页数:17
相关论文
共 50 条
  • [1] Communication Lower Bounds for Distributed-Memory Computations
    Scquizzato, Michele
    Silvestri, Francesco
    31ST INTERNATIONAL SYMPOSIUM ON THEORETICAL ASPECTS OF COMPUTER SCIENCE (STACS 2014), 2014, 25 : 627 - 638
  • [2] IRREGULAR PERSONALIZED COMMUNICATION ON DISTRIBUTED-MEMORY MACHINES
    RANKA, S
    WANG, JC
    KUMAR, M
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1995, 25 (01) : 58 - 71
  • [3] Probabilistic communication optimizations and parallelization for distributed-memory systems
    Mehofer, E
    Scholz, B
    NINTH EUROMICRO WORKSHOP ON PARALLEL AND DISTRIBUTED PROCESSING, PROCEEDINGS, 2001, : 186 - 192
  • [4] COMPILING FOR DISTRIBUTED-MEMORY ARCHITECTURES
    ROGERS, A
    PINGALI, K
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 1994, 5 (03) : 281 - 298
  • [5] A NEW PROGRAM TRANSFORMATION TO MINIMIZE COMMUNICATION IN DISTRIBUTED-MEMORY ARCHITECTURES
    OBOYLE, M
    HEDAYAT, GA
    LECTURE NOTES IN COMPUTER SCIENCE, 1992, 605 : 347 - 362
  • [6] Parallelizing RRT on Distributed-Memory Architectures
    Devaurs, Didier
    Simeon, Thierry
    Cortes, Juan
    2011 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2011, : 2261 - 2266
  • [7] Parallel ILP for distributed-memory architectures
    Nuno A. Fonseca
    Ashwin Srinivasan
    Fernando Silva
    Rui Camacho
    Machine Learning, 2009, 74 : 257 - 279
  • [8] Code Generation for Distributed-Memory Architectures
    Zhao, Jie
    Zhao, Rongcai
    Xu, Jinchen
    COMPUTER JOURNAL, 2016, 59 (01): : 119 - 132
  • [9] Parallel ILP for distributed-memory architectures
    Fonseca, Nuno A.
    Srinivasan, Ashwin
    Silva, Fernando
    Camacho, Rui
    MACHINE LEARNING, 2009, 74 (03) : 257 - 279
  • [10] IMPLEMENTING AN IRREGULAR APPLICATION ON A DISTRIBUTED-MEMORY MULTIPROCESSOR
    CHAKRABARTI, S
    YELICK, K
    SIGPLAN NOTICES, 1993, 28 (07): : 169 - 178