Optimizing HPC I/O Performance with Regression Analysis and Ensemble Learning

被引:1
|
作者
Liu, Zhangyu [1 ]
Zhang, Cheng [1 ]
Wu, Huijun [2 ]
Fang, Jianbin [2 ]
Peng, Lin [2 ]
Ye, Guixin
Tang, Zhanyong [1 ]
机构
[1] Northwest Univ, Sch Informat Sci & Technol, Xian, Peoples R China
[2] Natl Univ Def Technol, Coll Comp, Changsha, Peoples R China
基金
中国国家自然科学基金;
关键词
HPC; Parallel I/O; Performance Optimization; Auto-tuning; Ensemble Learning;
D O I
10.1109/CLUSTER52292.2023.00027
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
To improve parallel I/O performance, it is imperative to optimize the adjustable parameters across the different layers of the I/O software stack. Finding an optimal configuration for different scenarios is hampered by the complex interaction dynamics between these parameters and the large parameter space. Previous research efforts have focused on tuning these parameters using independent algorithms; however, these approaches exhibit certain shortcomings such as unstable performance results and delayed convergence rates. This paper introduces OPRAEL, an auto-tuning approach on parallel I/O tasks by ensembles and performance modeling using regression analysis. To test its effectiveness, we applied this approach on the Tianhe-II supercomputer using one well-known I/O benchmark(IOR) and two I/O kernels(S3D-I/O, BT-I/O). Leveraging our experience in predictive modeling, we optimized the tuning of the I/O stack parameters. Our experimental results show a remarkable 10.2X improvement in write performance speedup for the optimization task with BT-I/O and a 500x500x500 input. We also compared the potential of using a single search algorithm versus using reinforcement learning search in the I/O parameter auto-optimization task. Our results show that OPRAEL outperforms the traditional approach, resulting in a maximum 8.4X improvement in write performance for the 128-process IOR optimization.
引用
收藏
页码:234 / 246
页数:13
相关论文
共 50 条
  • [41] Understanding and Optimizing the Performance of Extended Ensemble Algorithms
    Nymeyer, Hugh
    Arthur, David R.
    BIOPHYSICAL JOURNAL, 2010, 98 (03) : 573A - 573A
  • [42] I/O performance analysis of machine learning workloads on leadership scale supercomputer
    Karimi, Ahmad Maroof
    Paul, Arnab K.
    Wang, Feiyi
    PERFORMANCE EVALUATION, 2022, 157
  • [43] Tool for performance tuning and regression analyses of HPC systems and applications
    Merchant, Saumil
    Prabhakar, Giri
    2012 19th International Conference on High Performance Computing, HiPC 2012, 2012,
  • [44] Tool for Performance Tuning and Regression Analyses of HPC Systems and Applications
    Merchant, Saumil
    Prabhakar, Giri
    2012 19TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING (HIPC), 2012,
  • [45] High Performance for Predicting Diabetic Nephropathy Using Stacking Regression of Ensemble Learning Method
    Muflikhah, Lailil
    Nurfansepta, Amira Ghina
    Bachtiar, Fitra Abdurrachman
    Ratnawati, Dian Eka
    INTERNATIONAL JOURNAL OF ONLINE AND BIOMEDICAL ENGINEERING, 2024, 20 (08) : 149 - 164
  • [46] Automatic outlier sample detection based on regression analysis and repeated ensemble learning
    Kaneko, Hiromasa
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2018, 177 : 74 - 82
  • [47] An active learning ensemble method for regression tasks
    Fazakis, Nikos
    Kostopoulos, Georgios
    Karlos, Stamatis
    Kotsiantis, Sotiris
    Sgarbas, Kyriakos
    INTELLIGENT DATA ANALYSIS, 2020, 24 (03) : 607 - 623
  • [48] Softmax Regression by Using Unsupervised Ensemble Learning
    Deng, Ping
    Wang, Hongjun
    Horng, Shi-Jinn
    Wang, Dexian
    Zhang, Ji
    Zhou, Hengxue
    2018 9TH INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES, ALGORITHMS AND PROGRAMMING (PAAP 2018), 2018, : 196 - 201
  • [49] Solving regression by learning an ensemble of decision rules
    Dembczynski, Krzysztof
    Kotlowski, Wojciech
    Slowinski, Roman
    ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING - ICAISC 2008, PROCEEDINGS, 2008, 5097 : 533 - 544
  • [50] Negative Correlation Ensemble Learning for Ordinal Regression
    Fernandez-Navarro, Francisco
    Antonio Gutierrez, Pedro
    Hervas-Martinez, Cesar
    Yao, Xin
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2013, 24 (11) : 1836 - 1849