Bayesian Optimization for auto-tuning GPU kernels

被引:10
|
作者
Willemsen, Floris-Jan [1 ]
van Nieuwpoort, Rob [1 ]
van Werkhoven, Ben [2 ]
机构
[1] Univ Amsterdam, Netherlands eSci Ctr, Amsterdam, Netherlands
[2] Netherlands eSci Ctr, Amsterdam, Netherlands
基金
荷兰研究理事会;
关键词
Optimization; Bayesian Optimization; autotuning; GPU Computing; machine learning;
D O I
10.1109/PMBS54543.2021.00017
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Finding optimal parameter configurations for tunable GPU kernels is a non-trivial exercise for large search spaces, even when automated. This poses an optimization task on a nonconvex search space, using an expensive to evaluate function with unknown derivative. These characteristics make a good candidate for Bayesian Optimization, which has not been applied to this problem before. However, the application of Bayesian Optimization to this problem is challenging. We demonstrate how to deal with the rough, discrete, constrained search spaces, containing invalid configurations. We introduce a novel contextual variance exploration factor, as well as new acquisition functions with improved scalability, combined with an informed acquisition function selection mechanism. By comparing the performance of our Bayesian Optimization implementation on various test cases to the existing search strategies in Kernel Tuner, as well as other Bayesian Optimization implementations, we demonstrate that our search strategies generalize well and consistently outperform other search strategies by a wide margin.
引用
收藏
页码:106 / 117
页数:12
相关论文
共 50 条
  • [31] PERI Auto-Tuning
    Bailey, David H.
    Chame, Jacqueline
    Chen, Chun
    Dongarra, Jack
    Hall, Mary
    Hollingsworth, Jeffrey K.
    Hovland, Paul
    Moore, Shirley
    Seymour, Keith
    Shin, Jaewook
    Tiwari, Ananta
    Williams, Sam
    You, Haihang
    SCIDAC 2008: SCIENTIFIC DISCOVERY THROUGH ADVANCED COMPUTING, 2008, 125
  • [32] Input-Aware Auto-Tuning of Compute-Bound HPC Kernels
    Tillet, Philippe
    Cox, David
    SC'17: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2017,
  • [33] Meta-programming and Auto-tuning in the Search for High Performance GPU Code
    Vollmer, Michael
    Svensson, Bo Joel
    Holk, Eric
    Newton, Ryan R.
    FHPC'15 PROCEEDINGS OF THE 4TH ACM SIGPLAN WORKSHOP ON FUNCTIONAL HIGH-PERFORMANCE COMPUTING, 2015, : 1 - 11
  • [34] GPU-FPtuner: Mixed-precision Auto-tuning for Floating-point Applications on GPU
    Gu, Ruidong
    Becchi, Michela
    2020 IEEE 27TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING, DATA, AND ANALYTICS (HIPC 2020), 2020, : 294 - 304
  • [35] Auto-Tuning In LTE Networks Using Joint RRM Optimization
    Raza, Syed Hassan
    Tiwana, Moazzam Islam
    2015 12TH INTERNATIONAL BHURBAN CONFERENCE ON APPLIED SCIENCES AND TECHNOLOGY (IBCAST), 2015, : 659 - 667
  • [36] Efficient Auto-Tuning of Parallel Programs with Interdependent Tuning Parameters via Auto-Tuning Framework (ATF)
    Rasch, Ari
    Schulze, Richard
    Steuwer, Michel
    Gorlatch, Sergei
    ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2021, 18 (01)
  • [37] Auto-Tuning for Military Microgrids
    Podlesak, Thomas
    Vitale, Joseph
    Wilson, Blane
    Bohn, Frank
    Gonzalez, Michael
    Bosse, Richard
    Siegfried, Stefan
    Lynch, Jaclyn
    Barnhill, William
    2019 IEEE ENERGY CONVERSION CONGRESS AND EXPOSITION (ECCE), 2019, : 6270 - 6277
  • [38] The interpolation method for auto-tuning
    Skvortsov, L.M.
    Shuiyun Gongcheng/Port & Waterway Engineering, 1998, (09):
  • [39] Benefits of auto-tuning VFDs
    Avery, Paul
    Control Engineering, 2021, 68 (09)
  • [40] AUTO-TUNING PARALLEL SKELETONS
    Collins, Alexander
    Fensch, Christian
    Leather, Hugh
    PARALLEL PROCESSING LETTERS, 2012, 22 (02)