A Survey on Malleability Solutions for High-Performance Distributed Computing

被引:6
|
作者
Aliaga, Jose, I [1 ]
Castillo, Maribel [1 ]
Iserte, Sergio [1 ]
Martin-Alvarez, Iker [1 ]
Mayo, Rafael [1 ]
机构
[1] Univ Jaume 1, Dept Ingn & Ciencia Comp, Castellon de La Plana 12006, Spain
来源
APPLIED SCIENCES-BASEL | 2022年 / 12卷 / 10期
关键词
exascale; job reconfiguration; MPI; data redistribution; resource management; adaptive workloads; MPI APPLICATIONS; FRAMEWORK;
D O I
10.3390/app12105231
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Maintaining a high rate of productivity, in terms of completed jobs per unit of time, in High-Performance Computing (HPC) facilities is a cornerstone in the next generation of exascale supercomputers. Process malleability is presented as a straightforward mechanism to address that issue. Nowadays, the vast majority of HPC facilities are intended for distributed-memory applications based on the Message Passing (MP) paradigm. For this reason, many efforts are based on the Message Passing Interface (MPI), the de facto standard programming model. Malleability aims to rescale executions on-the-fly, in other words, reconfigure the number and layout of processes in running applications. Process malleability involves resources reallocation within the HPC system, handling processes of the application, and redistributing data among those processes to resume the execution. This manuscript compiles how different frameworks address process malleability, their main features, their integration in resource management systems, and how they may be used in user codes. This paper is a detailed state-of-the-art devised as an entry point for researchers who are interested in process malleability.
引用
收藏
页数:32
相关论文
共 50 条
  • [1] Malleability techniques applications in high-performance computing
    Carretero, Jesus
    Suarez, Estela
    Schulz, Martin
    INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2024, 38 (02): : 53 - 54
  • [2] Malleability Techniques and Applications in High-Performance Computing (HPCMALL 2022)
    Carretero, Jesus
    Schulz, Martin
    Suarez, Estela
    HIGH PERFORMANCE COMPUTING, ISC HIGH PERFORMANCE 2022 INTERNATIONAL WORKSHOPS, 2022, 13387 : 124 - 128
  • [3] HIGH-PERFORMANCE DISTRIBUTED COMPUTING
    RAGHAVENDRA, CS
    CONCURRENCY-PRACTICE AND EXPERIENCE, 1994, 6 (04): : 231 - 233
  • [4] High-Performance Distributed Computing with Smartphones
    Ishikawa, Nadeem
    Nomura, Hayato
    Yoda, Yuya
    Uetsuki, Osamu
    Fukunaga, Keisuke
    Nagoya, Seiji
    Sawara, Junya
    Ishihata, Hiroaki
    Senoguchi, Junsuke
    EURO-PAR 2023: PARALLEL PROCESSING WORKSHOPS, PT II, EURO-PAR 2023, 2024, 14352 : 229 - 232
  • [5] The Form of High-Performance Computing: A Survey
    Assiroj, Priati
    Warnars, H. L. H. S.
    Kosala, R.
    Ranti, B.
    Supangat, S.
    Kistijantoro, A., I
    Abdurrachman, E.
    2ND INTERNATIONAL CONFERENCE ON INFORMATICS, ENGINEERING, SCIENCE, AND TECHNOLOGY (INCITEST 2019), 2019, 662
  • [6] HIGH-PERFORMANCE DISTRIBUTED COMPUTING - PROMISES AND CHALLENGES
    HARIRI, S
    VARMA, A
    CONCURRENCY-PRACTICE AND EXPERIENCE, 1993, 5 (04): : 233 - 238
  • [7] The BORG distributed architecture for high-performance computing
    Mou, ZG
    Duong, L
    Donuhue, D
    Ku, HC
    APPLICATIONS OF HIGH-PERFORMANCE COMPUTING IN ENGINEERING VI, 2000, 6 : 399 - 408
  • [8] A Survey of Communication Performance Models for High-Performance Computing
    Rico-Gallego, Juan A.
    Diaz-Martin, Juan C.
    Manumachu, Ravi Reddy
    Lastovetsky, Alexey L.
    ACM COMPUTING SURVEYS, 2019, 51 (06) : 1 - 36
  • [9] Web Portals for High-performance Computing: A Survey
    Calegari, Patrice
    Levrier, Marc
    Balczynski, Pawel
    ACM TRANSACTIONS ON THE WEB, 2019, 13 (01)
  • [10] A survey of high-performance computing scaling challenges
    Geist, Al
    Reed, Daniel A.
    INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2017, 31 (01): : 104 - 113