From Loop Fusion to Kernel Fusion: A Domain-Specific Approach to Locality Optimization

被引:0
|
作者
Qiao, Bo [1 ]
Reiche, Oliver [1 ]
Hannig, Frank [1 ]
Teich, Juergen [1 ]
机构
[1] Friedrich Alexander Univ Erlangen Nurnberg FAU, Erlangen, Germany
来源
PROCEEDINGS OF THE 2019 IEEE/ACM INTERNATIONAL SYMPOSIUM ON CODE GENERATION AND OPTIMIZATION (CGO '19) | 2019年
关键词
ALGORITHM;
D O I
10.5281/zenodo.2240193
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Optimizing data-intensive applications such as image processing for GPU targets with complex memory hierarchies requires to explore the tradeoffs among locality, parallelism, and computation. Loop fusion as one of the classical optimization techniques has been proven effective to improve locality at the function level. Algorithms in image processing are increasing their complexities and generally consist of many kernels in a pipeline. The inter-kernel communications are intensive and exhibit another opportunity for locality improvement at the system level. The scope of this paper is an optimization technique called kernel fusion for data locality improvement. We present a formal description of the problem by defining an objective function for locality optimization. By transforming the fusion problem to a graph partitioning problem, we propose a solution based on the minimum cut technique to search fusible kernels recursively. In addition, we develop an analytic model to quantitatively estimate potential locality improvement by incorporating domain-specific knowledge and architecture details. The proposed technique is implemented in an image processing DSL and source-to-source compiler called Hipacc, and evaluated over six image processing applications on three Nvidia GPUs. A geometric mean speedup of up to 2.52 can be observed in our experiments(1).
引用
收藏
页码:242 / 253
页数:12
相关论文
共 50 条
  • [31] Reading motivation: A domain-specific approach to motivation
    Wigfield, A
    EDUCATIONAL PSYCHOLOGIST, 1997, 32 (02) : 59 - 68
  • [32] An approach for the systematic development of domain-specific languages
    Strembeck, Mark
    Zdun, Uwe
    SOFTWARE-PRACTICE & EXPERIENCE, 2009, 39 (15): : 1253 - 1292
  • [33] A domain-specific language approach to programmable networks
    Muller, G
    Lawall, JL
    Thibault, S
    Jensen, REV
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2003, 33 (03): : 370 - 381
  • [34] An Approach of Defining Domain Constraints for Domain-Specific Modeling Language
    Jiang, Tao
    Zhou, Weihong
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2021, 35 (09)
  • [35] An Approach to Mine Business Rule Intents from Domain-specific Documents
    Bhattacharyya, Abhidip
    Chittimalli, Pavan Kumar
    Naik, Ravindra
    PROCEEDINGS OF THE 10TH INNOVATIONS IN SOFTWARE ENGINEERING CONFERENCE, 2017, : 96 - 106
  • [36] Investigation into the distinct structural features of the internal fusion loop within the LASV fusion domain
    Pennington, Hallie N.
    Shi, Zoe W.
    Lee, Jinwoo
    BIOPHYSICAL JOURNAL, 2023, 122 (03) : 499A - 499A
  • [37] Incorporating domain-specific heuristics in a particle swarm optimization approach to the quadratic assignment problem
    Ayah M. Helal
    Ashraf M. Abdelbar
    Memetic Computing, 2014, 6 : 241 - 254
  • [38] Incorporating domain-specific heuristics in a particle swarm optimization approach to the quadratic assignment problem
    Helal, Ayah M.
    Abdelbar, Ashraf M.
    MEMETIC COMPUTING, 2014, 6 (04) : 241 - 254
  • [39] Domain-Specific Optimization of Signal Recognition Targeting FPGAs
    Demertzi, Melina
    Diniz, Pedro C.
    Hall, Mary W.
    Gilbert, Anna C.
    Wang, Yi
    ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS, 2011, 4 (02)
  • [40] UTILIZING DOMAIN-SPECIFIC INFORMATION FOR THE OPTIMIZATION OF LOGISTICS NETWORKS
    Rabe, Markus
    Ammouriova, Majsa
    Schmitt, Dominik
    2018 WINTER SIMULATION CONFERENCE (WSC), 2018, : 2873 - 2884