Strategies for maximizing utilization on multi-CPU and multi-GPU heterogeneous architectures

被引:0
|
作者
Angeles Navarro
Antonio Vilches
Francisco Corbera
Rafael Asenjo
机构
[1] University of Malaga,Department of Computer Architecture
[2] Universidad de Málaga,Andalucía Tech, Department of Computer Architecture
来源
关键词
Heterogeneous computing; Dynamic scheduling; Adaptive partitioning; Task parallelism; Oversubscription; Synchronization;
D O I
暂无
中图分类号
学科分类号
摘要
This paper explores the possibility of efficiently executing a single application using multicores simultaneously with multiple GPU accelerators under a parallel task programming paradigm. In particular, we address the challenge of extending a parallel_for template to allow its exploitation on heterogeneous architectures. Due to the asymmetry of the computing resources, we propose in this work a dynamic scheduling strategy coupled with an adaptive partitioning scheme that resizes chunks to prevent underutilization and load imbalance of CPUs and GPUs. In this paper we also address the problem of the underutilization of the CPU core where a host thread operates. To solve it, we propose two different approaches: (1) a collaborative host thread strategy, in which the host thread, instead of busy-waiting for the GPU to complete, it carries out useful chunk processing; and (2) a host thread blocking strategy combined with oversubscription, that delegates on the OS the duty of scheduling threads to available CPU cores in order to guarantee that all cores are doing useful work. Using two benchmarks we evaluate the overhead introduced by our scheduling and partitioning algorithms, finding that it is negligible. We also evaluate the efficiency of the strategies proposed finding that allowing oversubscription controlled by the OS can be beneficial under certain scenarios.
引用
收藏
页码:756 / 771
页数:15
相关论文
共 50 条
  • [41] Parallel Singular Value Decomposition on Heterogeneous Multi-core and Multi-GPU Platforms
    Feng, Xiaowen
    Jin, Hai
    Zheng, Ran
    Zhu, Lei
    2014 NINTH INTERNATIONAL CONFERENCE ON DIGITAL INFORMATION MANAGEMENT (ICDIM), 2014, : 45 - 50
  • [42] Optimizing Multi-GPU Parallelization Strategies for Deep Learning Training
    Pal, Saptadeep
    Ebrahimi, Eiman
    Zulfiqar, Arslan
    Fu, Yaosheng
    Zhang, Victor
    Migacz, Szymon
    Nellans, David
    Gupta, Puneet
    IEEE MICRO, 2019, 39 (05) : 91 - 101
  • [43] Modelling Multi-GPU Systems
    Spampinato, Daniele G.
    Elster, Anne C.
    Natvig, Thorvald
    PARALLEL COMPUTING: FROM MULTICORES AND GPU'S TO PETASCALE, 2010, 19 : 562 - 569
  • [44] MAPREDUCE IMPLEMENTATION WITH MULTI-GPU
    Chen, Yi
    Chen, Su
    Jiang, Hai
    INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE & TECHNOLOGY: PROCEEDINGS, 2012, : 21 - 25
  • [45] Multi-GPU Graph Analytics
    Pan, Yuechao
    Wang, Yangzihao
    Wu, Yuduo
    Yang, Carl
    Owens, John D.
    2017 31ST IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), 2017, : 479 - 490
  • [46] Multi-CPU architecture speeds ray tracing
    Gauvin, Michael
    Scott, Donald
    LASER FOCUS WORLD, 2007, 43 (03): : 49 - +
  • [47] A Novel Multi-CPU/GPU Collaborative Computing Framework for SGD-based Matrix Factorization
    Huang, Yizhi
    Yin, Yanlong
    Liu, Yan
    He, Shuibing
    Bai, Yang
    Li, Renfa
    50TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, 2021,
  • [48] New multi-GPU implementation for smoothed particle hydrodynamics on heterogeneous clusters
    Dominguez, J. M.
    Crespo, A. J. C.
    Valdez-Balderas, D.
    Rogers, B. D.
    Gomez-Gesteira, M.
    COMPUTER PHYSICS COMMUNICATIONS, 2013, 184 (08) : 1848 - 1860
  • [49] Parallel Generation of Digitally Reconstructed Radiographs on Heterogeneous Multi-GPU Workstations
    Abdellah, Marwan
    Abdelaziz, Asem
    Ali, Eslam
    Abdelaziz, Sherief
    Sayed, Abdelrahman
    Owis, Mohamed I.
    Eldeib, Ayman
    2016 38TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2016, : 3953 - 3956
  • [50] MG-Join: A Scalable Join for Massively Parallel Multi-GPU Architectures
    Paul, Johns
    Lu, Shengliang
    He, Bingsheng
    Lau, Chiew Tong
    SIGMOD '21: PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2021, : 1413 - 1425