HPC AI500 V3.0: A scalable HPC AI benchmarking framework

被引:0
|
作者
Jiang Z. [1 ,2 ]
Luo C. [1 ]
Gao W. [1 ]
Wang L. [1 ]
Zhan J. [1 ,2 ]
机构
[1] Institute of Computing Technology, Chinese Academy of Sciences, Beijing
[2] University of Chinese Academy of Sciences, Beijing
关键词
Artificial intelligence; Benchmarking; High performance computing; Scalability;
D O I
10.1016/j.tbench.2022.100083
中图分类号
学科分类号
摘要
In recent years, the convergence of High Performance Computing (HPC) and artificial intelligence (AI) makes the community desperately need a benchmark to guide the design of next-generation scalable HPC AI systems. The success of the HPL benchmarks and the affiliated TOP500 ranking indicates that scalability is the fundamental requirement to evaluate HPC systems. However, being scalable in terms of these emerging AI workloads like deep learning (DL) raises nontrivial challenges. This paper formally and systematically analyzes the factor that limits scalability in DL workloads and presents HPC AI500 v3.0, a scalable HPC AI benchmarking framework. The HPC AI500 V3.0 methodology is inspired by bagging, which utilizes the collective wisdom of an ensemble of base models and enables the benchmarks to be adaptively scalable to different scales of HPC systems. We implement HPC AI500 V3.0 in a highly customizable manner, maintaining the space of various optimization from both system and algorithm levels. By reusing the representative workloads in HPC AI500 V2.0, we evaluate HPC AI500 V3.0 on typical HPC systems, and the results show it has near-linear scalability. Furthermore, based on the customizable design, we present a case study to perform a trade-off between AI model quality and its training speed. The source code of HPC AI500 V3.0 is publicly available from the HPC AI500 project homepage https://www.benchcouncil.org/aibench/hpcai500/. © 2022 The Authors
引用
收藏
相关论文
共 50 条
  • [31] Stimulus: Accelerate Data Management for Scientific AI applications in HPC
    Devarajan, Hariharan
    Kougkas, Anthony
    Zheng, Huihuo
    Vishwanath, Venkatram
    Sun, Xian-He
    2022 22ND IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND INTERNET COMPUTING (CCGRID 2022), 2022, : 109 - 118
  • [32] Star-gen: an HPC-AI framework for constructing large-scale computational materials databaseStar-gen: an HPC-AI framework for constructing large-scale...P. Chen et al.
    Pin Chen
    Qing Mo
    Zexin Xu
    Xianwei Zhang
    Yutong Lu
    CCF Transactions on High Performance Computing, 2025, 7 (2) : 85 - 99
  • [33] FASE: A framework for scalable performance prediction of HPC systems and applications
    Grobelny, Eric
    Bueno, David
    Troxel, Ian
    George, Alan D.
    Vetter, Jeffrey S.
    SIMULATION-TRANSACTIONS OF THE SOCIETY FOR MODELING AND SIMULATION INTERNATIONAL, 2007, 83 (10): : 721 - 745
  • [34] Enabling dynamic and intelligent workflows for HPC, data analytics, and AI convergence
    Ejarque, Jorge
    Badia, Rosa M.
    Albertin, Loic
    Aloisio, Giovanni
    Baglione, Enrico
    Becerra, Yolanda
    Boschert, Stefan
    Berlin, Julian R.
    D'Anca, Alessandro
    Elia, Donatello
    Exertier, Francois
    Fiore, Sandro
    Flich, Jose
    Folch, Arnau
    Gibbons, Steven J.
    Koldunov, Nikolay
    Lordan, Francesc
    Lorito, Stefano
    Lovholt, Finn
    Macias, Jorge
    Marozzo, Fabrizio
    Michelini, Alberto
    Monterrubio-Velasco, Marisol
    Pienkowska, Marta
    de la Puente, Josep
    Queralt, Anna
    Quintana-Orti, Enrique S.
    Rodriguez, Juan E.
    Romano, Fabrizio
    Rossi, Riccardo
    Rybicki, Jedrzej
    Kupczyk, Miroslaw
    Selva, Jacopo
    Talia, Domenico
    Tonini, Roberto
    Trunfio, Paolo
    Volpe, Manuela
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2022, 134 : 414 - 429
  • [35] Advancing biomolecular simulation through exascale HPC, AI and quantum computing
    Pyzer-Knapp, Edward O.
    Curioni, Alessandro
    CURRENT OPINION IN STRUCTURAL BIOLOGY, 2024, 87
  • [36] Advancing DSP into HPC, AI, and beyond: challenges, mechanisms, and future directions
    Wang, Yaohua
    Li, Chen
    Liu, Chang
    Liu, Sheng
    Lei, Yuanwu
    Zhang, Jian
    Zhang, Yang
    Guo, Yang
    CCF TRANSACTIONS ON HIGH PERFORMANCE COMPUTING, 2021, 3 (01) : 114 - 125
  • [37] Heterogeneous Integration Technologies: Driving a new era for HPC and AI/ML
    Rice, Rich
    Cao, Lihong
    Advancing Microelectronics, 2020, 47 (03): : 8 - 12
  • [38] Edge HPC Architectures for AI-Based Video Surveillance Applications
    Rossi, Federico
    Saponara, Sergio
    ELECTRONICS, 2024, 13 (09)
  • [40] Hyperparameter optimization of data-driven AI models on HPC systems
    Wulff, Eric
    Girone, Maria
    Pata, Joosep
    20TH INTERNATIONAL WORKSHOP ON ADVANCED COMPUTING AND ANALYSIS TECHNIQUES IN PHYSICS RESEARCH, 2023, 2438