Adaptation of Algorithms for efficient execution on GPUs

被引:0
|
作者
Bulavintsev, Vadim G. [1 ]
Zhdanov, Dmitry D. [2 ]
机构
[1] Delft Univ Technol, Delft, Netherlands
[2] ITMO Univ, St Petersburg, Russia
来源
OPTICAL DESIGN AND TESTING XI | 2021年 / 11895卷
关键词
GPU; SIMD; control flow graph; loop optimization; DPLL; resnet;
D O I
10.1117/12.2601619
中图分类号
O43 [光学];
学科分类号
070207 ; 0803 ;
摘要
We propose a generalized method for adapting and optimizing algorithms for efficient execution on modern graphics processing units (GPU). The method consists of several steps. First, build a control flow graph (CFG) of the algorithm. Next, transform the CFG into a tree of loops and merge non-parallelizable loops into parallelizable ones. Finally, map the resulting loops tree to the tree of GPU computational units, unrolling the algorithm's loops as necessary for the match. The method provides a convenient and robust mental framework and strategy for GPU code optimization. We demonstrate the method by adapting a backtracking search algorithm to the GPU platform and building an optimized implementation of the ResNeXt-50 neural network.
引用
收藏
页数:8
相关论文
共 50 条
  • [31] LINEAR-ARRAY FOR EFFICIENT EXECUTION OF PARTITIONED MATRIX ALGORITHMS
    MORENO, JH
    LANG, T
    REAL-TIME SIGNAL PROCESSING XII, 1989, 1154 : 102 - 117
  • [32] Efficient parallel execution of genetic algorithms on Epiphany manycore processor
    Faber, Lukasz
    Boryczko, Krzysztof
    PROCEEDINGS OF THE 2016 FEDERATED CONFERENCE ON COMPUTER SCIENCE AND INFORMATION SYSTEMS (FEDCSIS), 2016, 8 : 865 - 872
  • [33] iGPU: Exception Support and Speculative Execution on GPUs
    Menon, Jaikrishnan
    de Kruijf, Marc
    Sankaralingam, Karthikeyan
    2012 39TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA), 2012, : 72 - 83
  • [34] Software Pipelined Execution of Stream Programs on GPUs
    Udupa, Abhishek
    Govindarajan, R.
    Thazhuthaveetil, Matthew J.
    CGO 2009: INTERNATIONAL SYMPOSIUM ON CODE GENERATION AND OPTIMIZATION, PROCEEDINGS, 2009, : 200 - 209
  • [35] Performance and Power Prediction for Concurrent Execution on GPUs
    Moolchandani, Diksha
    Kumar, Anshul
    Sarangi, Smruti R.
    ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2022, 19 (03)
  • [36] Adaptation of System Dynamics Model Execution Algorithms for Cloud-based Environment
    Mulyukin, Alexey
    Perl, Ivan
    PROCEEDINGS OF THE 2018 22ND CONFERENCE OF OPEN INNOVATIONS ASSOCIATION (FRUCT), 2018, : 179 - 189
  • [37] Split-bucket partition (SBP): a novel execution model for top-K and selection algorithms on GPUs
    Yang, Yiqing
    Zhang, Guoyin
    Wu, Yanxia
    Zhao, Zhixiang
    Fu, Yan
    JOURNAL OF SUPERCOMPUTING, 2024, 80 (11): : 15122 - 15160
  • [38] Analysis of Classic Algorithms on GPUs
    Ma, Lin
    Chamberlain, Roger D.
    Agrawal, Kunal
    2014 INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING & SIMULATION (HPCS), 2014, : 65 - 73
  • [39] Incremental Timeline-Based Planning for Efficient Plan Execution and Adaptation
    De Benedictis, Riccardo
    Beraldo, Gloria
    Cesta, Amedeo
    Cortellessa, Gabriella
    AIXIA 2022 - ADVANCES IN ARTIFICIAL INTELLIGENCE, 2023, 13796 : 225 - 240
  • [40] Scheduling Algorithms for Efficient Execution of Stream Workflow Applications in Multicloud Environments
    Barika, Mutaz
    Garg, Saurabh
    Chan, Andrew
    Calheiros, Rodrigo N.
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2022, 15 (02) : 860 - 875