Consumer Level Multi-GPU Systems Utilization, Efficiency, and Optimization

被引:0
|
作者
Ross, John Brandon [1 ]
机构
[1] Univ Alabama, Huntsville, AL 35899 USA
关键词
Graphics Processing Unit; GPU; Multi-GPU; Multi-Card; High Performance Computing; General Purpose GPU;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper focuses on the basic techniques that a common programmer can use to utilize multiple consumer level GPUs, commonly available to them at home or through purchase, to assist them in massively parallel computation in their work. It lays out an experimental framework to determine the best and easiest styles of programming for multiple GPUs to use, without actually using the more complex and available NVIDIA multi-GPU templates and techniques.
引用
收藏
页数:8
相关论文
共 50 条
  • [31] Efficient breadth first search on multi-GPU systems
    Mastrostefano, Enrico
    Bernaschi, Massimo
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2013, 73 (09) : 1292 - 1305
  • [32] Dynamic load balancing on heterogeneous multi-GPU systems
    Acosta, Alejandro
    Blanco, Vicente
    Almeida, Francisco
    COMPUTERS & ELECTRICAL ENGINEERING, 2013, 39 (08) : 2591 - 2602
  • [33] Tensor Movement Orchestration in Multi-GPU Training Systems
    Lin, Shao-Fu
    Chen, Yi-Jung
    Cheng, Hsiang-Yun
    Yang, Chia-Lin
    2023 IEEE INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE, HPCA, 2023, : 1140 - 1152
  • [34] Gossip: Efficient Communication Primitives for Multi-GPU Systems
    Kobus, Robin
    Juenger, Daniel
    Hundt, Christian
    Schmidt, Bertil
    PROCEEDINGS OF THE 48TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING (ICPP 2019), 2019,
  • [35] Solving Multiple Tridiagonal Systems on a Multi-GPU Platform
    Dieguez, Adrian P.
    Amor, Margarita
    Doallo, Ramon
    2018 26TH EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED, AND NETWORK-BASED PROCESSING (PDP 2018), 2018, : 759 - 763
  • [36] Multi-GPU based on multicriteria optimization for motion estimation system
    Carlos Garcia
    Guillermo Botella
    Fermin Ayuso
    Manuel Prieto
    Francisco Tirado
    EURASIP Journal on Advances in Signal Processing, 2013
  • [37] Multi-GPU implementation of a VMAT treatment plan optimization algorithm
    Tian, Zhen
    Peng, Fei
    Folkerts, Michael
    Tan, Jun
    Jia, Xun
    Jiang, Steve B.
    MEDICAL PHYSICS, 2015, 42 (06) : 2841 - 2852
  • [38] Adjoint Lattice Boltzmann for topology optimization on multi-GPU architecture
    Laniewski-Wollk, L.
    Rokicki, J.
    COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2016, 71 (03) : 833 - 848
  • [39] Multi-GPU based on multicriteria optimization for motion estimation system
    Garcia, Carlos
    Botella, Guillermo
    Ayuso, Fermin
    Prieto, Manuel
    Tirado, Francisco
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2013,
  • [40] HPSM: A Programming Framework for Multi-CPU and Multi-GPU Systems
    Lima, Joao V. F.
    Di Domenico, Daniel
    2017 INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING WORKSHOPS (SBAC-PADW), 2017, : 31 - 36