HPC AI500 V3.0: A scalable HPC AI benchmarking framework

被引:0
|
作者
Jiang Z. [1 ,2 ]
Luo C. [1 ]
Gao W. [1 ]
Wang L. [1 ]
Zhan J. [1 ,2 ]
机构
[1] Institute of Computing Technology, Chinese Academy of Sciences, Beijing
[2] University of Chinese Academy of Sciences, Beijing
关键词
Artificial intelligence; Benchmarking; High performance computing; Scalability;
D O I
10.1016/j.tbench.2022.100083
中图分类号
学科分类号
摘要
In recent years, the convergence of High Performance Computing (HPC) and artificial intelligence (AI) makes the community desperately need a benchmark to guide the design of next-generation scalable HPC AI systems. The success of the HPL benchmarks and the affiliated TOP500 ranking indicates that scalability is the fundamental requirement to evaluate HPC systems. However, being scalable in terms of these emerging AI workloads like deep learning (DL) raises nontrivial challenges. This paper formally and systematically analyzes the factor that limits scalability in DL workloads and presents HPC AI500 v3.0, a scalable HPC AI benchmarking framework. The HPC AI500 V3.0 methodology is inspired by bagging, which utilizes the collective wisdom of an ensemble of base models and enables the benchmarks to be adaptively scalable to different scales of HPC systems. We implement HPC AI500 V3.0 in a highly customizable manner, maintaining the space of various optimization from both system and algorithm levels. By reusing the representative workloads in HPC AI500 V2.0, we evaluate HPC AI500 V3.0 on typical HPC systems, and the results show it has near-linear scalability. Furthermore, based on the customizable design, we present a case study to perform a trade-off between AI model quality and its training speed. The source code of HPC AI500 V3.0 is publicly available from the HPC AI500 project homepage https://www.benchcouncil.org/aibench/hpcai500/. © 2022 The Authors
引用
收藏
相关论文
共 50 条
  • [41] Advancing DSP into HPC, AI, and beyond: challenges, mechanisms, and future directions
    Yaohua Wang
    Chen Li
    Chang Liu
    Sheng Liu
    Yuanwu Lei
    Jian Zhang
    Yang Zhang
    Yang Guo
    CCF Transactions on High Performance Computing, 2021, 3 : 114 - 125
  • [42] "Hey CAI" - Conversational AI Enabled User Interface for HPC Tools
    Kousha, Pouya
    Jain, Arpan
    Kolli, Ayyappa
    Sainath, Prasanna
    Subramoni, Hari
    Shafi, Aamir
    Panda, Dhableswar K.
    HIGH PERFORMANCE COMPUTING, ISC HIGH PERFORMANCE 2022, 2022, 13289 : 87 - 108
  • [43] HPC & AI: the challenges and issues related to exascale for numerical modelling and simulation
    Calvin, Christophe
    JOINT INTERNATIONAL CONFERENCE ON SUPERCOMPUTING IN NUCLEAR APPLICATIONS + MONTE CARLO, SNA + MC 2024, 2024, 302
  • [44] Crescoware: A Container-Based Gateway for HPC and AI Applications in the ENEAGRID Infrastructure
    Mariano, Angelo
    D'Amato, Giulio
    Formisano, Giovanni
    Guarnieri, Guido
    Santomauro, Giuseppe
    Migliori, Silvio
    INTELLIGENT COMPUTING, VOL 1, 2022, 506 : 196 - 206
  • [45] Building and Evaluation of Cloud Storage and Datasets Services on AI and HPC Converged Infrastructure
    Tanimura, Yusuke
    Takizawa, Shinichiro
    Ogawa, Hirotaka
    Hamanishi, Takahiro
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 1992 - 2001
  • [46] A perspective on digital signal processor based leadership performance accelerator for AI and HPC
    Guo, Yang
    Wang, Yaohua
    Ma, Sheng
    FRONTIERS OF COMPUTER SCIENCE, 2025, 19 (07)
  • [47] Design of AI-Enhanced Drug Lead Optimization Workflow for HPC and Cloud
    Yang, Chih-Chieh
    Domeniconi, Giacomo
    Zhang, Leili
    Cong, Guojing
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 5861 - 5863
  • [48] Framework for Enabling Scalable Learning Game AI
    Iuhasz, Gabriel
    Munteanu, Victor Ion
    Negru, Viorel
    ENASE: PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON EVALUATION OF NOVEL APPROACHES TO SOFTWARE ENGINEERING, 2013, : 189 - 196
  • [49] IT WON'T BE AI THAT TAKES OVER YOUR JOB AS A DOCTOR: IT WILL BE ANOTHER DOCTOR USING AI': PSYCHIATRY V3.0 (FIT FOR PURPOSE)-EVOLUTION BY CO-DESIGN
    Chapman, M.
    AUSTRALIAN AND NEW ZEALAND JOURNAL OF PSYCHIATRY, 2023, 57 (01): : 177 - 178
  • [50] Building A Scalable Forward Flux Sampling Framework using Big Data and HPC
    DeFever, Ryan S.
    Hanger, Walter
    Sarupria, Sapna
    Kilgannon, Jon
    Apon, Amy W.
    Ngo, Linh B.
    PEARC '19: PROCEEDINGS OF THE PRACTICE AND EXPERIENCE IN ADVANCED RESEARCH COMPUTING ON RISE OF THE MACHINES (LEARNING), 2019,