Improving parallel executions by increasing task granularity in task-based runtime systems using acyclic DAG clustering

被引:5
|
作者
Bramas, Berenger [1 ,2 ]
Ketterlin, Alain [1 ,2 ,3 ]
机构
[1] Inria Nancy Grand Est, CAMUS, Nancy, France
[2] ICube, ICPS Team, Illkirch Graffenstaden, France
[3] Univ Strasbourg, Strasbourg, France
关键词
Task-based; Graph; DAG; Clustering; Partitioning; ALGORITHM; OPENMP;
D O I
10.7717/peerj-cs.247
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The task-based approach is a parallelization paradigm in which an algorithm is transformed into a direct acyclic graph of tasks: the vertices are computational elements extracted from the original algorithm and the edges are dependencies between those. During the execution, the management of the dependencies adds an overhead that can become significant when the computational cost of the tasks is low. A possibility to reduce the makespan is to aggregate the tasks to make them heavier, while having fewer of them, with the objective of mitigating the importance of the overhead. In this paper, we study an existing clustering/partitioning strategy to speed up the parallel execution of a task-based application. We provide two additional heuristics to this algorithm and perform an in-depth study on a large graph set. In addition, we propose a new model to estimate the execution duration and use it to choose the proper granularity. We show that this strategy allows speeding up a real numerical application by a factor of 7 on a multi-core system.
引用
收藏
页码:1 / 26
页数:26
相关论文
共 46 条
  • [21] A scalable clustering-based task scheduler for homogeneous processors using DAG partitioning
    Ozkaya, M. Yusuf
    Benoit, Anne
    Ucar, Bora
    Herrmann, Julien
    Catalyurek, Umit, V
    2019 IEEE 33RD INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2019), 2019, : 155 - 165
  • [22] Using Intrinsic Performance Counters to Assess Efficiency in Task-based Parallel Applications
    Grubel, Patricia
    Kaiser, Hartmut
    Huck, Kevin
    Cook, Jeanine
    2016 IEEE 30TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), 2016, : 1692 - 1701
  • [23] Performance Measurements Within Asynchronous Task-Based Runtime Systems: A Double White Dwarf Merger as an Application
    Diehl, Patrick
    Marcello, Dominic
    Amini, Parsa
    Kaiser, Hartmut
    Shiber, Sagiv
    Clayton, Geoffrey C.
    Frank, Juhan
    Daiss, Gregor
    Pfluger, Dirk
    Eder, David
    Koniges, Alice
    Huck, Kevin
    COMPUTING IN SCIENCE & ENGINEERING, 2021, 23 (03) : 73 - 81
  • [24] Approximation Proofs of a Fast and Efficient List Scheduling Algorithm for Task-Based Runtime Systems on Multicores and GPUs
    Beaumont, Olivier
    Eyraud-Dubois, Lionel
    Kumar, Suraj
    2017 31ST IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), 2017, : 768 - 777
  • [25] THE PRACTICALITY OF USING THE TASK-BASED INSTRUCTION——IMPROVING STUDENTS’UNDERSTANDING OF STRESS AND TONE
    顾敏
    读与写(教育教学刊), 2008, (04) : 5+8 - 5
  • [26] Using Intra-Core Loop-Task Accelerators to Improve the Productivity and Performance of Task-Based Parallel Programs
    Kim, Ji
    Jiang, Shunning
    Torng, Christopher
    Wang, Moyang
    Srinath, Shreesha
    Ilbeyi, Berkin
    Al-Hawaj, Khalid
    Batten, Christopher
    50TH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE (MICRO), 2017, : 759 - 773
  • [27] Towards seismic wave modeling on heterogeneous many-core architectures using task-based runtime system
    Martinez, Victor
    Michea, David
    Dupros, Fabrice
    Aumage, Olivier
    Thibault, Samuel
    Aochi, Hideo
    Navaux, Philippe O. A.
    2015 27TH INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING (SBAC-PAD), 2015, : 1 - 8
  • [28] An Efficient Technique for Virtual Machine Clustering and Communications Using Task-Based Scheduling in Cloud Computing
    Saravanakumar, C.
    Geetha, M.
    Kumar, S. Manoj
    Manikandan, S.
    Arun, C.
    Srivatsan, K.
    SCIENTIFIC PROGRAMMING, 2021, 2021
  • [29] Cpp-Taskflow: Fast Task-based Parallel Programming using Modern C plus
    Huang, Tsung-Wei
    Lin, Chun-Xun
    Guo, Guannan
    Wong, Martin
    2019 IEEE 33RD INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2019), 2019, : 986 - 995
  • [30] The TaPaSCo Open-Source Toolflow for the Automated Composition of Task-Based Parallel Reconfigurable Computing Systems
    Heinz, Carsten
    Hofmann, Jaco
    Korinth, Jens
    Sommer, Lukas
    Weber, Lukas
    Koch, Andreas
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2021, 93 (05): : 545 - 563