Improving parallel executions by increasing task granularity in task-based runtime systems using acyclic DAG clustering

被引:5
|
作者
Bramas, Berenger [1 ,2 ]
Ketterlin, Alain [1 ,2 ,3 ]
机构
[1] Inria Nancy Grand Est, CAMUS, Nancy, France
[2] ICube, ICPS Team, Illkirch Graffenstaden, France
[3] Univ Strasbourg, Strasbourg, France
关键词
Task-based; Graph; DAG; Clustering; Partitioning; ALGORITHM; OPENMP;
D O I
10.7717/peerj-cs.247
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The task-based approach is a parallelization paradigm in which an algorithm is transformed into a direct acyclic graph of tasks: the vertices are computational elements extracted from the original algorithm and the edges are dependencies between those. During the execution, the management of the dependencies adds an overhead that can become significant when the computational cost of the tasks is low. A possibility to reduce the makespan is to aggregate the tasks to make them heavier, while having fewer of them, with the objective of mitigating the importance of the overhead. In this paper, we study an existing clustering/partitioning strategy to speed up the parallel execution of a task-based application. We provide two additional heuristics to this algorithm and perform an in-depth study on a large graph set. In addition, we propose a new model to estimate the execution duration and use it to choose the proper granularity. We show that this strategy allows speeding up a real numerical application by a factor of 7 on a multi-core system.
引用
收藏
页码:1 / 26
页数:26
相关论文
共 46 条
  • [41] OKCM: improving parallel task scheduling in high-performance computing systems using online learning
    Li, Jingbo
    Zhang, Xingjun
    Han, Li
    Ji, Zeyu
    Dong, Xiaoshe
    Hu, Chenglong
    JOURNAL OF SUPERCOMPUTING, 2021, 77 (06): : 5960 - 5983
  • [42] OKCM: improving parallel task scheduling in high-performance computing systems using online learning
    Jingbo Li
    Xingjun Zhang
    Li Han
    Zeyu Ji
    Xiaoshe Dong
    Chenglong Hu
    The Journal of Supercomputing, 2021, 77 : 5960 - 5983
  • [43] Runtime Task Scheduling for FPGA-Based Embedded Systems Using Just-in-Time Bitstream Prefetching
    Duhamel, Alexis
    Pillement, Sebastien
    IEEE ACCESS, 2024, 12 : 184303 - 184310
  • [44] Balancing task allocation in multi-robot systems using K-means clustering and auction based mechanisms
    Eango, Murugappan
    Nachiappan, Subramanian
    Tiwari, Manoj Kumar
    EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (06) : 6486 - 6491
  • [45] A statistical, task-based evaluation method for three-dimensional x-ray breast imaging systems using variable-background phantoms
    Park, Subok
    Jennings, Robert
    Liu, Haimo
    Badano, Aldo
    Myers, Kyle
    MEDICAL PHYSICS, 2010, 37 (12) : 6253 - 6270
  • [46] Improving the Sensitivity of Task-Based Multi-Echo Functional Magnetic Resonance Imaging via T2* Mapping Using Synthetic Data-Driven Deep Learning
    Zhao, Yinghe
    Yang, Qinqin
    Qian, Shiting
    Dong, Jiyang
    Cai, Shuhui
    Chen, Zhong
    Cai, Congbo
    BRAIN SCIENCES, 2024, 14 (08)