Improving parallel executions by increasing task granularity in task-based runtime systems using acyclic DAG clustering

被引:5
|
作者
Bramas, Berenger [1 ,2 ]
Ketterlin, Alain [1 ,2 ,3 ]
机构
[1] Inria Nancy Grand Est, CAMUS, Nancy, France
[2] ICube, ICPS Team, Illkirch Graffenstaden, France
[3] Univ Strasbourg, Strasbourg, France
关键词
Task-based; Graph; DAG; Clustering; Partitioning; ALGORITHM; OPENMP;
D O I
10.7717/peerj-cs.247
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The task-based approach is a parallelization paradigm in which an algorithm is transformed into a direct acyclic graph of tasks: the vertices are computational elements extracted from the original algorithm and the edges are dependencies between those. During the execution, the management of the dependencies adds an overhead that can become significant when the computational cost of the tasks is low. A possibility to reduce the makespan is to aggregate the tasks to make them heavier, while having fewer of them, with the objective of mitigating the importance of the overhead. In this paper, we study an existing clustering/partitioning strategy to speed up the parallel execution of a task-based application. We provide two additional heuristics to this algorithm and perform an in-depth study on a large graph set. In addition, we propose a new model to estimate the execution duration and use it to choose the proper granularity. We show that this strategy allows speeding up a real numerical application by a factor of 7 on a multi-core system.
引用
收藏
页码:1 / 26
页数:26
相关论文
共 46 条
  • [1] Increasing the degree of parallelism using speculative execution in task-based runtime systems
    Bramas, Berenger
    PEERJ COMPUTER SCIENCE, 2019, 2019 (03)
  • [2] Mitigating the NUMA effect on task-based runtime systems
    Maronas, Marcos
    Navarro, Antoni
    Ayguade, Eduard
    Beltran, Vicenc
    JOURNAL OF SUPERCOMPUTING, 2023, 79 (13): : 14287 - 14312
  • [3] Assembly Operations for Multicore Architectures Using Task-Based Runtime Systems
    Genet, Damien
    Guermouche, Abdou
    Bosilca, George
    EURO-PAR 2014: PARALLEL PROCESSING WORKSHOPS, PT II, 2014, 8806 : 338 - 350
  • [4] Mitigating the NUMA effect on task-based runtime systems
    Marcos Maroñas
    Antoni Navarro
    Eduard Ayguadé
    Vicenç Beltran
    The Journal of Supercomputing, 2023, 79 : 14287 - 14312
  • [5] Fast approximation algorithms for task-based runtime systems
    Beaumont, Olivier
    Eyraud-Dubois, Lionel
    Kumar, Suraj
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2018, 30 (17):
  • [6] Automatic Parallelization: Executing Sequential Programs on a Task-Based Parallel Runtime
    Alcides Fonseca
    Bruno Cabral
    João Rafael
    Ivo Correia
    International Journal of Parallel Programming, 2016, 44 : 1337 - 1358
  • [7] Automatic Parallelization: Executing Sequential Programs on a Task-Based Parallel Runtime
    Fonseca, Alcides
    Cabral, Bruno
    Rafael, Joao
    Correia, Ivo
    INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2016, 44 (06) : 1337 - 1358
  • [8] Using Dynamic Broadcasts to Improve Task-Based Runtime Performances
    Denis, Alexandre
    Jeannot, Emmanuel
    Swartvagher, Philippe
    Thibault, Samuel
    EURO-PAR 2020: PARALLEL PROCESSING, 2020, 12247 : 443 - 457
  • [9] Parallelization Using Task Parallel Library with Task-Based Programming Model
    Hei, Xinhong
    Zhang, Jinlong
    Wang, Bin
    Jin, Haiyan
    Giacaman, Nasser
    2014 5TH IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS), 2014, : 653 - 656
  • [10] Runtime-Assisted Global Cache Management for Task-Based Parallel Programs
    Manivannan, Madhavan
    Pericas, Miquel
    Papaefstathiou, Vassilis
    Stenstrom, Per
    IEEE COMPUTER ARCHITECTURE LETTERS, 2017, 16 (02) : 145 - 148