Runtime scheduling of dynamic parallelism on accelerator-based multi-core systems

被引:9
|
作者
Blagojevic, Filip [1 ,2 ]
Nikolopoulos, Dimitrios S. [1 ,2 ]
Stamatakis, Alexandros [3 ]
Antonopoulos, Christos D. [4 ]
Curtis-Maury, Matthew [1 ,2 ]
机构
[1] Virginia Tech, Dept Comp Sci, Blacksburg, VA 24061 USA
[2] Virginia Tech, Ctr High End Comp Syst, Blacksburg, VA 24061 USA
[3] Ecole Polytech Fed Lausanne, Sch Comp & Commun Sci, CH-1015 Lausanne, Switzerland
[4] Univ Thessaly, Dept Comp & Commun Engn, Volos 38221, Greece
基金
美国国家科学基金会;
关键词
heterogeneous multi-core processors; accelerator-based parallel architectures; runtime systems for parallel programming; Cell broadband engine;
D O I
10.1016/j.parco.2007.09.004
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We explore runtime mechanisms and policies for scheduling dynamic multi-grain parallelism on heterogeneous multicore processors. Heterogeneous multi-core processors integrate conventional cores that run legacy codes with specialized cores that serve as computational accelerators. The term multi-grain parallelism refers to the exposure of multiple dimensions of parallelism from within the runtime system, so as to best exploit a parallel architecture with heterogeneous computational capabilities between its cores and execution units. We investigate user-level schedulers that dynamically "rightsize" the dimensions and degrees of parallelism on the cell broadband engine. The schedulers address the problem of mapping application-specific concurrency to an architecture with multiple hardware layers of parallelism, without requiring programmer intervention or sophisticated compiler support. We evaluate recently introduced schedulers for event-driven execution and utilization-driven dynamic multi-grain parallelization on Cell. We also present a new scheduling scheme for dynamic multi-grain parallelism, S-MGPS, which uses sampling of dominant execution phases to converge to the optimal scheduling algorithm. We evaluate S-MGPS on an IBM Cell BladeCenter with two realistic bioinformatics applications that infer large phylogenies. S-MGPS performs within 2-10% of the optimal scheduling algorithm in these applications, while exhibiting low overhead and little sensitivity to application-dependent parameters. (c) 2007 Elsevier B.V. All rights reserved.
引用
收藏
页码:700 / 719
页数:20
相关论文
共 50 条
  • [1] H.264/AVC inter prediction on accelerator-based multi-core systems
    Rafael Rodríguez-Sánchez
    José Luis Martínez
    Gerardo Fernández-Escribano
    José Luis Sánchez
    José Manuel Claver
    Multimedia Tools and Applications, 2013, 66 : 361 - 381
  • [2] H.264/AVC inter prediction on accelerator-based multi-core systems
    Rodriguez-Sanchez, Rafael
    Luis Martinez, Jose
    Fernandez-Escribano, Gerardo
    Luis Sanchez, Jose
    Manuel Claver, Jose
    MULTIMEDIA TOOLS AND APPLICATIONS, 2013, 66 (03) : 361 - 381
  • [3] Runtime Mapping and Scheduling for Energy Efficiency in Heterogeneous Multi-Core Systems
    Silva, Bruno
    Delbem, Alexandre
    Bona, Vanderlei
    Diniz, Pedro C.
    2015 INTERNATIONAL CONFERENCE ON RECONFIGURABLE COMPUTING AND FPGAS (RECONFIG), 2015,
  • [4] DAG Scheduling and Analysis on Multi-Core Systems by Modelling Parallelism and Dependency
    Zhao, Shuai
    Dai, Xiaotian
    Bate, Iain
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2022, 33 (12) : 4019 - 4038
  • [5] Dynamic Power Scheduling for VM-based Multi-core Systems
    Liang, Jhe-Ming
    Zhan, Ren-Hao
    Chen, Wei-Mei
    2012 IEEE 4TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING TECHNOLOGY AND SCIENCE (CLOUDCOM), 2012,
  • [6] Improved parallelism and scheduling in multi-core software routers
    Egi, Norbert
    Iannaccone, Gianluca
    Manesh, Maziar
    Mathy, Laurent
    Ratnasamy, Sylvia
    JOURNAL OF SUPERCOMPUTING, 2013, 63 (01): : 294 - 322
  • [7] Improved parallelism and scheduling in multi-core software routers
    Norbert Egi
    Gianluca Iannaccone
    Maziar Manesh
    Laurent Mathy
    Sylvia Ratnasamy
    The Journal of Supercomputing, 2013, 63 : 294 - 322
  • [8] Scheduling Multi-tenant Cloud Workloads on Accelerator-based Systems
    Sengupta, Dipanjan
    Goswami, Anshuman
    Schwan, Karsten
    Pallavi, Krishna
    SC14: INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2014, : 513 - 524
  • [9] Scheduling OR-parallelism in YapOr and ThOr on Multi-Core Machines
    Dutra, Ines
    Rocha, Ricardo
    Costa, Vitor Santos
    Silva, Fernando
    Santos, Joao
    2012 IEEE 26TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS & PHD FORUM (IPDPSW), 2012, : 1581 - 1590
  • [10] Research on dynamic optimization method of embedded multi-core performance based on runtime
    Huang Y.
    Fang X.
    Zeng Y.
    Huang Z.
    Guo H.
    Recent Patents on Engineering, 2021, 15 (03) : 356 - 365