From Loop Fusion to Kernel Fusion: A Domain-Specific Approach to Locality Optimization

被引:0
|
作者
Qiao, Bo [1 ]
Reiche, Oliver [1 ]
Hannig, Frank [1 ]
Teich, Juergen [1 ]
机构
[1] Friedrich Alexander Univ Erlangen Nurnberg FAU, Erlangen, Germany
来源
PROCEEDINGS OF THE 2019 IEEE/ACM INTERNATIONAL SYMPOSIUM ON CODE GENERATION AND OPTIMIZATION (CGO '19) | 2019年
关键词
ALGORITHM;
D O I
10.5281/zenodo.2240193
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Optimizing data-intensive applications such as image processing for GPU targets with complex memory hierarchies requires to explore the tradeoffs among locality, parallelism, and computation. Loop fusion as one of the classical optimization techniques has been proven effective to improve locality at the function level. Algorithms in image processing are increasing their complexities and generally consist of many kernels in a pipeline. The inter-kernel communications are intensive and exhibit another opportunity for locality improvement at the system level. The scope of this paper is an optimization technique called kernel fusion for data locality improvement. We present a formal description of the problem by defining an objective function for locality optimization. By transforming the fusion problem to a graph partitioning problem, we propose a solution based on the minimum cut technique to search fusible kernels recursively. In addition, we develop an analytic model to quantitatively estimate potential locality improvement by incorporating domain-specific knowledge and architecture details. The proposed technique is implemented in an image processing DSL and source-to-source compiler called Hipacc, and evaluated over six image processing applications on three Nvidia GPUs. A geometric mean speedup of up to 2.52 can be observed in our experiments(1).
引用
收藏
页码:242 / 253
页数:12
相关论文
共 50 条
  • [21] Memory requirement optimization with loop fusion and loop shifting
    Hu, QB
    Palkovic, M
    Kjeldsberg, PG
    PROCEEDINGS OF THE EUROMICRO SYSTEMS ON DIGITAL SYSTEM DESIGN, 2004, : 272 - 278
  • [22] Probing human beta(1)- and beta(2)-adrenoceptors with domain-specific fusion protein antibodies
    Jahns, R
    Siegmund, C
    Jahns, V
    Reilander, H
    Maidhof, A
    MullerEsterl, W
    Lohse, MJ
    Boege, F
    EUROPEAN JOURNAL OF PHARMACOLOGY, 1996, 316 (01) : 111 - 121
  • [23] Probing human beta(1)- and beta(2)-adrenoceptors with domain-specific fusion protein antibodies
    Jahns, R
    Siegmund, C
    Jahns, V
    Reilander, H
    Maidhof, A
    MullerEsterl, W
    Lohse, MJ
    Boege, F
    EUROPEAN JOURNAL OF PHARMACOLOGY, 1997, 334 (01) : 115 - 126
  • [24] Scalan: A Framework for Domain-Specific Hotspot Optimization
    Slesarenko, Alexander
    Romanov, Alexey
    FHPC'15 PROCEEDINGS OF THE 4TH ACM SIGPLAN WORKSHOP ON FUNCTIONAL HIGH-PERFORMANCE COMPUTING, 2015, : 54 - 54
  • [25] Kernel-Based Learning for Domain-Specific Relation Extraction
    Basili, Roberto
    Giannone, Cristina
    Del Vescovo, Chiara
    Moschitti, Alessandro
    Naggar, Paolo
    AI (ASTERISK) IA 2009: EMERGENT PERSPECTIVES IN ARTIFICIAL INTELLIGENCE, 2009, 5883 : 161 - +
  • [26] Domain-specific optimization strategy for skeleton programs
    Emoto, Kento
    Matsuzaki, Kiminori
    Hu, Zhenjiang
    Takeichi, Masato
    EURO-PAR 2007 PARALLEL PROCESSING, PROCEEDINGS, 2007, 4641 : 705 - +
  • [27] A DEEP LEARNING APPROACH TO MULTIPLE KERNEL FUSION
    Song, Huan
    Thiagarajan, Jayaraman J.
    Sattigeri, Prasanna
    Ramamurthy, Karthikeyan Natesan
    Spanias, Andreas
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 2292 - 2296
  • [28] The domain-specific approach of working memory training
    Peng, Peng
    Swanson, H. Lee
    DEVELOPMENTAL REVIEW, 2022, 65
  • [29] A Domain-Specific Modeling Approach for Gestural Interaction
    Deshayes, Romuald
    2013 IEEE SYMPOSIUM ON VISUAL LANGUAGES AND HUMAN-CENTRIC COMPUTING (VL/HCC 2013), 2013, : 181 - 182
  • [30] Domain-Specific Approach to Software Development for Microcontrollers
    Sedov, Boris
    Pakharev, Sergey
    Syschikov, Alexey
    Ivanov, Vera
    PROCEEDINGS OF THE 17TH CONFERENCE OF OPEN INNOVATIONS ASSOCIATION FRUCT, 2015, : 179 - 185