An efficient OpenMP runtime system for hierarchical architectures

被引:0
|
作者
Thibault, Samuel [1 ]
Broquedis, Francois [1 ]
Goglin, Brice [1 ]
Namyst, Raymond [1 ]
Wacrenier, Pierre-Andre [1 ]
机构
[1] LaBRI, INRIA Futurs, F-33405 Talence, France
关键词
OpenMP; nested parallelism; hierarchical thread scheduling; bubbles; multi-core; NUMA; SMP;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Exploiting the full computational power of always deeper hierarchical multiprocessor machines requires a very careful distribution of threads and data among the underlying non-uniform architecture. The emergence of mufti-core chips and NUMA machines makes it important to minimize the number of remote memory accesses, to favor cache affinities, and to guarantee fast completion of synchronization steps. By using the BubbleSched platform as a threading backend for the GOMP OpenMP compiler, we are able to easily transpose affinities of thread teams into scheduling hints using abstractions called bubbles. We then propose a scheduling strategy suited to nested OpenMP parallelism. The resulting preliminary performance evaluations show an important improvement of the speedup on a typical NAS OpenMP benchmark application.
引用
收藏
页码:161 / 172
页数:12
相关论文
共 50 条
  • [1] A runtime optimization system for OpenMP
    Burcea, M
    Voss, MJ
    OPENMP SHARED MEMORY PARALLEL PROGRAMMING, 2003, 2716 : 42 - 53
  • [2] Binding Nested OpenMP Programs on Hierarchical Memory Architectures
    Schmidl, Dirk
    Terboven, Christian
    Mey, Dieter An
    Buecker, Martin
    BEYOND LOOP LEVEL PARALLELISM IN OPENMP: ACCELERATORS, TASKING AND MORE, PROCEEDINGS, 2010, 6132 : 29 - +
  • [3] A runtime system architecture for ubiquitous support of OpenMP
    Philos, Giorgos Ch.
    Dimakopoulos, Vassilios V.
    Hadjidoukas, Panagiotis E.
    PROCEEDINGS OF THE INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED COMPUTING, 2008, : 189 - 196
  • [4] Adaptive OpenMP Runtime System for Embedded Multicores
    Agathos, Spiros N.
    Dimakopoulos, Vassilios V.
    2018 IEEE 16TH INTERNATIONAL CONFERENCE ON EMBEDDED AND UBIQUITOUS COMPUTING (EUC 2018), 2018, : 174 - 181
  • [5] ForestGOMP: An Efficient OpenMP Environment for NUMA Architectures
    François Broquedis
    Nathalie Furmento
    Brice Goglin
    Pierre-André Wacrenier
    Raymond Namyst
    International Journal of Parallel Programming, 2010, 38 : 418 - 439
  • [6] ForestGOMP: An Efficient OpenMP Environment for NUMA Architectures
    Broquedis, Francois
    Furmento, Nathalie
    Goglin, Brice
    Wacrenier, Pierre-Andre
    Namyst, Raymond
    INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2010, 38 (5-6) : 418 - 439
  • [7] Dynamic Task and Data Placement over NUMA Architectures: An OpenMP Runtime Perspective
    Broquedis, Francois
    Furmento, Nathalie
    Goglin, Brice
    Namyst, Raymond
    Wacrenier, Pierre-Andre
    EVOLVING OPENMP IN AN AGE OF EXTREME PARALLELISM, 2009, 5568 : 79 - +
  • [8] Compiler and runtime support for running OpenMP programs on Pentium- and Itanium-architectures
    Tian, XM
    Girkar, M
    Shah, S
    Armstrong, D
    Su, E
    Petersen, P
    EIGHTH INTERNATIONAL WORKSHOP ON HIGH-LEVEL PARALLEL PROGRAMMING MODELS AND SUPPORTIVE ENVIRONMENTS, PROCEEDINGS, 2003, : 47 - 55
  • [9] A Runtime Implementation of OpenMP Tasks
    LaGrone, James
    Aribuki, Ayodunni
    Addison, Cody
    Chapman, Barbara
    OPENMP IN THE PETASCALE ERA, (IWOMP 2011), 2011, 6665 : 165 - 178
  • [10] OpenMP® Runtime Instrumentation for Optimization
    Doodi, Taru
    Peyton, Jonathan
    Cownie, Jim
    Garzaran, Maria
    Kalidas, Rubasri
    Kim, Jeongnim
    Mathuriya, Amrita
    Wilmarth, Terry
    Zheng, Gengbin
    SCALING OPENMP FOR EXASCALE PERFORMANCE AND PORTABILITY (IWOMP 2017), 2017, 10468 : 281 - 295