Benchmarking multi-GPU applications on modern multi-GPU integrated systems

被引：1

作者：

Bernaschi, Massimo ^{[1
]}

Agostini, Elena ^{[2
]}

Rossetti, Davide ^{[2
]}

机构：

[1] CNR, I-00185 Rome, Italy

[2] NVIDIA, Santa Clara, CA USA

来源：

CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE | 2021年 / 33卷 / 14期

关键词：

approximate inverse; DGX-1; GPUDirec; POWER9; spin; LINEAR-SYSTEMS;

D O I：

10.1002/cpe.5470

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

GPUs are very powerful computing accelerators that are often employed in single-device configuration. However, there is a steadily growing interest in using multiple GPUs in a concurrent way both to overcome the memory limitations of the single device and to further reduce execution times. Until recently, communication among GPUs had been carried out mainly by using networking technologies originally devised for standard CPUs with the CPU playing an active role in the communication. However, new alternatives start to be available in which a moderate number of GPUs are directly connected each other by means of proprietary technologies. We present the results of a set of experiments aimed at assessing the performance of some of these hardware/software platforms using a particularly challenging application as a benchmark. We release its source code to facilitate people interested in reproducing or extending our results.

引用

页数：15

共 50 条

[1] Modelling Multi-GPU Systems
Spampinato, Daniele G.
Elster, Anne C.
Natvig, Thorvald
PARALLEL COMPUTING: FROM MULTICORES AND GPU'S TO PETASCALE, 2010, 19 : 562 - 569
[2] Understanding Scalability of Multi-GPU Systems
Feng, Yuan
Jeon, Hyeran
15TH WORKSHOP ON GENERAL PURPOSE PROCESSING USING GPU, GPGPU 2023, 2023, : 36 - 37
[3] Evaluating Multi-GPU Sorting with Modern Interconnects
Maltenberger, Tobias
Ilic, Ivan
Tolovski, Ilin
Rabl, Tilmann
PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA (SIGMOD '22), 2022, : 1795 - 1809
[4] Data Parallel Skeletons for GPU Clusters and Multi-GPU Systems
Ernsting, Steffen
Kuchen, Herbert
APPLICATIONS, TOOLS AND TECHNIQUES ON THE ROAD TO EXASCALE COMPUTING, 2012, 22 : 509 - 518
[5] MAPREDUCE IMPLEMENTATION WITH MULTI-GPU
Chen, Yi
Chen, Su
Jiang, Hai
INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE & TECHNOLOGY: PROCEEDINGS, 2012, : 21 - 25
[6] Multi-GPU Graph Analytics
Pan, Yuechao
Wang, Yangzihao
Wu, Yuduo
Yang, Carl
Owens, John D.
2017 31ST IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), 2017, : 479 - 490
[7] Suffix Array Construction on Multi-GPU Systems
Bueren, Florian
Juenger, Daniel
Kobus, Robin
Hundt, Christian
Schmidt, Bertil
HPDC'19: PROCEEDINGS OF THE 28TH INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE PARALLEL AND DISTRIBUTED COMPUTING, 2019, : 183 - 194
[8] Multi-GPU codes for spin systems simulations
Bernaschi, M.
Fatica, M.
Parisi, G.
Parisi, L.
COMPUTER PHYSICS COMMUNICATIONS, 2012, 183 (07) : 1416 - 1421
[9] Accelerating MapReduce framework on multi-GPU systems
Jiang, Hai
Chen, Yi
Qiao, Zhi
Li, Kuan-Ching
Ro, WonWoo
Gaudiot, Jean-Luc
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2014, 17 (02): : 293 - 301
[10] Accelerating MapReduce framework on multi-GPU systems
Hai Jiang
Yi Chen
Zhi Qiao
Kuan-Ching Li
WonWoo Ro
Jean-Luc Gaudiot
Cluster Computing, 2014, 17 : 293 - 301

← 1 2 3 4 5 →