Performance instrumentation and compiler optimizations for MPI/OpenMP applications

被引:0
|
作者
Hernandez, Oscar [1 ]
Song, Fengguang [2 ]
Chapman, Barbara [1 ]
Dongarra, Jack [2 ]
Mohr, Bernd [3 ]
Moore, Shirley [2 ]
Wolf, Felix [3 ]
机构
[1] Univ Houston, Dept Comp Sci, Houston, TX 77204 USA
[2] Univ Tennessee, Dept Comp Sci, Knoxville, TN 37996 USA
[3] Forschungszentrum Julich, ZAM, D-52425 Julich, Germany
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This article describes how the integration of the OpenUH OpenMP compiler with the KOJAK performance analysis tool can assist developers of OpenMP and hybrid codes in optimizing their applications with as little user intervention as possible. In particular, we (i) describe how the compiler's ability to automatically instrument user code down to the flow-graph level can improve the location of performance problems and (ii) outline how the performance feedback provided by KOJAK will direct the compiler's optimization decisions in the future. To demonstrate our methodology, we present experimental results showing how reasons for the performance slow down of the ASPCG benchmark could be identified.
引用
收藏
页码:267 / +
页数:3
相关论文
共 50 条
  • [31] Performance Comparison of OpenMP, MPI, and MapReduce in Practical Problems
    Kang, Sol Ji
    Lee, Sang Yeon
    Lee, Keon Myung
    ADVANCES IN MULTIMEDIA, 2015, 2015
  • [32] Performance comparison of MPI and OpenMP on shared memory multiprocessors
    Krawezik, G
    Cappello, F
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2006, 18 (01): : 29 - 61
  • [33] Performance modeling of hybrid MPI/OpenMP scientific applications on large-scale multicore supercomputers
    Wu, Xingfu
    Taylor, Valerie
    JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2013, 79 (08) : 1256 - 1268
  • [34] An Architecture-Independent CGRA Compiler enabling OpenMP Applications
    Kojima, Takuya
    Adhi, Boma
    Cortes, Carlos
    Tan, Yiyu
    Sano, Kentaro
    2022 IEEE 36TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW 2022), 2022, : 631 - 638
  • [35] Bounds modelling and compiler optimizations for superscalar performance tuning
    Bose, P
    Kim, S
    O'Connell, FP
    Ciarfella, WA
    JOURNAL OF SYSTEMS ARCHITECTURE, 1999, 45 (12-13) : 1111 - 1137
  • [36] Parallel programming for OSEM reconstruction with MPI, OpenMP, and hybrid MPI-OpenMP
    Jones, MD
    Yao, RT
    2004 IEEE NUCLEAR SCIENCE SYMPOSIUM CONFERENCE RECORD, VOLS 1-7, 2004, : 3036 - 3042
  • [37] COMPILER OPTIMIZATIONS FOR THE WAM
    TURK, AK
    LECTURE NOTES IN COMPUTER SCIENCE, 1986, 225 : 657 - 662
  • [38] Cache simulation based on runtime instrumentation for OpenMP applications
    Tao, H
    Weidendorfer, J
    37TH ANNUAL SIMULATION SYMPOSIUM, PROCEEDINGS, 2004, : 97 - 103
  • [39] Performance Evaluation of Mixed-Mode OpenMP/MPI Implementations
    J. Mark Bull
    James Enright
    Xu Guo
    Chris Maynard
    Fiona Reid
    International Journal of Parallel Programming, 2010, 38 : 396 - 417
  • [40] Compiler-Assisted Overlapping of Communication and Computation in MPI Applications
    Guo, Jichi
    Yi, Qing
    Meng, Jiayuan
    Zhang, Junchao
    Balaji, Pavan
    2016 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2016, : 60 - 69