Adaptation of Algorithms for efficient execution on GPUs

被引:0
|
作者
Bulavintsev, Vadim G. [1 ]
Zhdanov, Dmitry D. [2 ]
机构
[1] Delft Univ Technol, Delft, Netherlands
[2] ITMO Univ, St Petersburg, Russia
来源
OPTICAL DESIGN AND TESTING XI | 2021年 / 11895卷
关键词
GPU; SIMD; control flow graph; loop optimization; DPLL; resnet;
D O I
10.1117/12.2601619
中图分类号
O43 [光学];
学科分类号
070207 ; 0803 ;
摘要
We propose a generalized method for adapting and optimizing algorithms for efficient execution on modern graphics processing units (GPU). The method consists of several steps. First, build a control flow graph (CFG) of the algorithm. Next, transform the CFG into a tree of loops and merge non-parallelizable loops into parallelizable ones. Finally, map the resulting loops tree to the tree of GPU computational units, unrolling the algorithm's loops as necessary for the match. The method provides a convenient and robust mental framework and strategy for GPU code optimization. We demonstrate the method by adapting a backtracking search algorithm to the GPU platform and building an optimized implementation of the ResNeXt-50 neural network.
引用
收藏
页数:8
相关论文
共 50 条
  • [21] Efficient Execution of Dynamic Programming Algorithms on Apache Spark
    Javanmard, Mohammad Mahdi
    Ahmad, Zafar
    Zola, Jaroslaw
    Pouchet, Louis-Noel
    Chowdhury, Rezaul
    Harrison, Robert
    2020 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER 2020), 2020, : 337 - 348
  • [22] Heterogeneous Isolated Execution for Commodity GPUs
    Jang, Insu
    Tang, Adrian
    Kim, Taehoon
    Sethumadhavan, Simha
    Huh, Jaehyuk
    TWENTY-FOURTH INTERNATIONAL CONFERENCE ON ARCHITECTURAL SUPPORT FOR PROGRAMMING LANGUAGES AND OPERATING SYSTEMS (ASPLOS XXIV), 2019, : 455 - 468
  • [23] Graviton: Trusted Execution Environments on GPUs
    Volos, Stavros
    Vaswani, Kapil
    Bruno, Rodrigo
    PROCEEDINGS OF THE 13TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, 2018, : 681 - 696
  • [24] Autotuning of configuration for program execution in GPUs
    Balaiah, Thanasekhar
    Parthasarathi, Ranjani
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2020, 32 (09):
  • [25] On the Correctness of the SIMT Execution Model of GPUs
    Habermaier, Axel
    Knapp, Alexander
    PROGRAMMING LANGUAGES AND SYSTEMS, 2012, 7211 : 316 - 335
  • [26] TAMING VOTING ALGORITHMS ON GPUS FOR AN EFFICIENT CONNECTED COMPONENT ANALYSIS ALGORITHM
    Lemaitre, Florian
    Hennequin, Arthur
    Lacassagne, Lionel
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 7903 - 7907
  • [27] Efficient GNSS Signal Acquisition with Massive Parallel Algorithms using GPUs
    Pany, T.
    Riedl, B.
    Winkel, J.
    PROCEEDINGS OF THE 23RD INTERNATIONAL TECHNICAL MEETING OF THE SATELLITE DIVISION OF THE INSTITUTE OF NAVIGATION (ION GNSS 2010), 2010, : 1889 - 1895
  • [28] Morph Algorithms on GPUs
    Nasre, Rupesh
    Burtscher, Martin
    Pingali, Keshav
    ACM SIGPLAN NOTICES, 2013, 48 (08) : 147 - 156
  • [29] Efficient Execution of Service Composition for Content Adaptation in Pervasive Computing
    Fawaz, Yaser
    Berhe, Girma
    Brunie, Lionel
    Scuturici, Vasile-Marian
    Coquil, David
    INTERNATIONAL JOURNAL OF DIGITAL MULTIMEDIA BROADCASTING, 2008, 2008
  • [30] An efficient execution of Monte Carlo simulation based on delta-tracking method using GPUs
    Okubo, Takuya
    Endo, Tomohiro
    Yamamoto, Akio
    JOURNAL OF NUCLEAR SCIENCE AND TECHNOLOGY, 2017, 54 (01) : 30 - 38