Runtime scheduling of dynamic parallelism on accelerator-based multi-core systems

被引：9

作者：

Blagojevic, Filip ^{[1
,2
]}

Nikolopoulos, Dimitrios S. ^{[1
,2
]}

Stamatakis, Alexandros ^{[3
]}

Antonopoulos, Christos D. ^{[4
]}

Curtis-Maury, Matthew ^{[1
,2
]}

机构：

[1] Virginia Tech, Dept Comp Sci, Blacksburg, VA 24061 USA

[2] Virginia Tech, Ctr High End Comp Syst, Blacksburg, VA 24061 USA

[3] Ecole Polytech Fed Lausanne, Sch Comp & Commun Sci, CH-1015 Lausanne, Switzerland

[4] Univ Thessaly, Dept Comp & Commun Engn, Volos 38221, Greece

来源：

PARALLEL COMPUTING | 2007年 / 33卷 / 10-11期

基金：

美国国家科学基金会;

关键词：

heterogeneous multi-core processors; accelerator-based parallel architectures; runtime systems for parallel programming; Cell broadband engine;

D O I：

10.1016/j.parco.2007.09.004

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

We explore runtime mechanisms and policies for scheduling dynamic multi-grain parallelism on heterogeneous multicore processors. Heterogeneous multi-core processors integrate conventional cores that run legacy codes with specialized cores that serve as computational accelerators. The term multi-grain parallelism refers to the exposure of multiple dimensions of parallelism from within the runtime system, so as to best exploit a parallel architecture with heterogeneous computational capabilities between its cores and execution units. We investigate user-level schedulers that dynamically "rightsize" the dimensions and degrees of parallelism on the cell broadband engine. The schedulers address the problem of mapping application-specific concurrency to an architecture with multiple hardware layers of parallelism, without requiring programmer intervention or sophisticated compiler support. We evaluate recently introduced schedulers for event-driven execution and utilization-driven dynamic multi-grain parallelization on Cell. We also present a new scheduling scheme for dynamic multi-grain parallelism, S-MGPS, which uses sampling of dominant execution phases to converge to the optimal scheduling algorithm. We evaluate S-MGPS on an IBM Cell BladeCenter with two realistic bioinformatics applications that infer large phylogenies. S-MGPS performs within 2-10% of the optimal scheduling algorithm in these applications, while exhibiting low overhead and little sensitivity to application-dependent parameters. (c) 2007 Elsevier B.V. All rights reserved.

引用

页码：700 / 719

页数：20

共 50 条

[1] H.264/AVC inter prediction on accelerator-based multi-core systems
Rafael Rodríguez-Sánchez
José Luis Martínez
Gerardo Fernández-Escribano
José Luis Sánchez
José Manuel Claver
Multimedia Tools and Applications, 2013, 66 : 361 - 381
[2] H.264/AVC inter prediction on accelerator-based multi-core systems
Rodriguez-Sanchez, Rafael
Luis Martinez, Jose
Fernandez-Escribano, Gerardo
Luis Sanchez, Jose
Manuel Claver, Jose
MULTIMEDIA TOOLS AND APPLICATIONS, 2013, 66 (03) : 361 - 381
[3] Runtime Mapping and Scheduling for Energy Efficiency in Heterogeneous Multi-Core Systems
Silva, Bruno
Delbem, Alexandre
Bona, Vanderlei
Diniz, Pedro C.
2015 INTERNATIONAL CONFERENCE ON RECONFIGURABLE COMPUTING AND FPGAS (RECONFIG), 2015,
[4] DAG Scheduling and Analysis on Multi-Core Systems by Modelling Parallelism and Dependency
Zhao, Shuai
Dai, Xiaotian
Bate, Iain
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2022, 33 (12) : 4019 - 4038
[5] Dynamic Power Scheduling for VM-based Multi-core Systems
Liang, Jhe-Ming
Zhan, Ren-Hao
Chen, Wei-Mei
2012 IEEE 4TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING TECHNOLOGY AND SCIENCE (CLOUDCOM), 2012,
[6] Improved parallelism and scheduling in multi-core software routers
Egi, Norbert
Iannaccone, Gianluca
Manesh, Maziar
Mathy, Laurent
Ratnasamy, Sylvia
JOURNAL OF SUPERCOMPUTING, 2013, 63 (01): : 294 - 322
[7] Improved parallelism and scheduling in multi-core software routers
Norbert Egi
Gianluca Iannaccone
Maziar Manesh
Laurent Mathy
Sylvia Ratnasamy
The Journal of Supercomputing, 2013, 63 : 294 - 322
[8] Scheduling Multi-tenant Cloud Workloads on Accelerator-based Systems
Sengupta, Dipanjan
Goswami, Anshuman
Schwan, Karsten
Pallavi, Krishna
SC14: INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2014, : 513 - 524
[9] Scheduling OR-parallelism in YapOr and ThOr on Multi-Core Machines
Dutra, Ines
Rocha, Ricardo
Costa, Vitor Santos
Silva, Fernando
Santos, Joao
2012 IEEE 26TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS & PHD FORUM (IPDPSW), 2012, : 1581 - 1590
[10] Research on dynamic optimization method of embedded multi-core performance based on runtime
Huang Y.
Fang X.
Zeng Y.
Huang Z.
Guo H.
Recent Patents on Engineering, 2021, 15 (03) : 356 - 365

← 1 2 3 4 5 →