Optimizing process allocation of parallel programs for heterogeneous clusters

被引:3
|
作者
Ichikawa, Shuichi [1 ,2 ]
Takahashi, Sho [1 ]
Kawai, Yuu [1 ]
机构
[1] Toyohashi Univ Technol, Dept Knowledge Based Informat Engn, Aichi 4418580, Japan
[2] Toyohashi Univ Technol, Intelligent Sensing Syst Res Ctr, Aichi 4418580, Japan
来源
基金
日本学术振兴会;
关键词
heterogeneous cluster; high-performance computing; performance evaluation; multiprocessing; optimization; HIGH-PERFORMANCE; IMPLEMENTATION; MPI;
D O I
10.1002/cpe.1349
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The performance of a conventional parallel application is often degraded by load-imbalance on heterogeneous clusters. Although it is simple to invoke multiple processes on fast processing elements to alleviate load-imbalance, the optimal process allocation is not obvious. Kishimoto and Ichikawa presented performance models for high-performance Linpack (HPL), with which the sub-optimal configurations of heterogeneous clusters were actually estimated. Their results on HPL are encouraging, whereas their approach is not yet verified with other applications. This study presents some enhancements of Kishimoto's scheme, which are evaluated with four typical scientific applications: computational fluid dynamics (CFD), finite-element method (FEM), HPL (linear algebraic system), and fast Fourier transform (FFT). According to our experiments, our new models (NP-T models) are superior to Kishimoto's models, particularly when the non-negative least squares method is used for parameter extraction. The average errors of the derived models were 0.2% for the CFD benchmark, 2% for the FEM benchmark, 1% for HPL, and 28% for the FFT benchmark. This study also emphasizes the importance of predictability in clusters, listing practical examples derived from our study. Copyright (C) 2008 John Wiley & Sons, Ltd.
引用
收藏
页码:475 / 507
页数:33
相关论文
共 50 条
  • [1] Evaluation of process migration for parallel heterogeneous workstation clusters
    Dantas, MAR
    PROCEEDINGS OF THE HIGH-PERFORMANCE COMPUTING (HPC'98), 1998, : 147 - 150
  • [2] Evaluation of process migration for parallel heterogeneous workstation clusters
    Dantas, MAR
    EURO-PAR '98 PARALLEL PROCESSING, 1998, 1470 : 397 - 400
  • [3] A synthesis of parallel out-of-core sorting programs on heterogeneous clusters
    Cérin, C
    Fkaier, H
    Jemni, M
    CCGRID 2003: 3RD IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER COMPUTING AND THE GRID, PROCEEDINGS, 2003, : 78 - 85
  • [4] Optimizing Heterogeneous Task Allocation for Edge Compute Micro Clusters Using PSO Metaheuristic
    Alhaizaey, Yousef
    Singer, Jeremy
    Michala, Anna Lito
    2022 SEVENTH INTERNATIONAL CONFERENCE ON FOG AND MOBILE EDGE COMPUTING, FMEC, 2022, : 33 - 40
  • [5] Tarazu: Optimizing MapReduce On Heterogeneous Clusters
    Ahmad, Faraz
    Chakradhar, Srimat
    Raghunathan, Anand
    Vijaykumar, T. N.
    ACM SIGPLAN NOTICES, 2012, 47 (04) : 61 - 74
  • [6] Sparker: Optimizing Spark for Heterogeneous Clusters
    Garg, Nishank
    Janakiram, Dharanipragada
    2018 16TH IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING TECHNOLOGY AND SCIENCE (CLOUDCOM 2018), 2018, : 1 - 8
  • [7] An algebraic framework for optimizing parallel programs
    Satoh, I
    SOFTWARE ENGINEERING FOR PARALLEL AND DISTRIBUTED SYSTEMS - INTERNATIONAL SYMPOSIUM PROCEEDINGS, 1998, : 28 - 38
  • [8] OPTIMIZING PARALLEL PROGRAMS WITH EXPLICIT SYNCHRONIZATION
    KRISHNAMURTHY, A
    YELICK, K
    SIGPLAN NOTICES, 1995, 30 (06): : 196 - 204
  • [9] On a scheme for parallel sorting on heterogeneous clusters
    Cérin, C
    Gaudiot, JL
    FUTURE GENERATION COMPUTER SYSTEMS, 2002, 18 (03) : 353 - 372
  • [10] A parallel computational model for heterogeneous clusters
    Bosque, Jose Luis
    Pastor, Luis
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2006, 17 (12) : 1390 - 1400