Optimizing process allocation of parallel programs for heterogeneous clusters

被引:3
|
作者
Ichikawa, Shuichi [1 ,2 ]
Takahashi, Sho [1 ]
Kawai, Yuu [1 ]
机构
[1] Toyohashi Univ Technol, Dept Knowledge Based Informat Engn, Aichi 4418580, Japan
[2] Toyohashi Univ Technol, Intelligent Sensing Syst Res Ctr, Aichi 4418580, Japan
来源
基金
日本学术振兴会;
关键词
heterogeneous cluster; high-performance computing; performance evaluation; multiprocessing; optimization; HIGH-PERFORMANCE; IMPLEMENTATION; MPI;
D O I
10.1002/cpe.1349
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The performance of a conventional parallel application is often degraded by load-imbalance on heterogeneous clusters. Although it is simple to invoke multiple processes on fast processing elements to alleviate load-imbalance, the optimal process allocation is not obvious. Kishimoto and Ichikawa presented performance models for high-performance Linpack (HPL), with which the sub-optimal configurations of heterogeneous clusters were actually estimated. Their results on HPL are encouraging, whereas their approach is not yet verified with other applications. This study presents some enhancements of Kishimoto's scheme, which are evaluated with four typical scientific applications: computational fluid dynamics (CFD), finite-element method (FEM), HPL (linear algebraic system), and fast Fourier transform (FFT). According to our experiments, our new models (NP-T models) are superior to Kishimoto's models, particularly when the non-negative least squares method is used for parameter extraction. The average errors of the derived models were 0.2% for the CFD benchmark, 2% for the FEM benchmark, 1% for HPL, and 28% for the FFT benchmark. This study also emphasizes the importance of predictability in clusters, listing practical examples derived from our study. Copyright (C) 2008 John Wiley & Sons, Ltd.
引用
收藏
页码:475 / 507
页数:33
相关论文
共 50 条
  • [41] Static scheduling of dependent parallel tasks on heterogeneous clusters
    Barbosa, J.
    Morais, C.
    Nobrega, R.
    Monteiro, A. P.
    2005 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2006, : 546 - 553
  • [42] Modeling parallel computing performance for heterogeneous workstation clusters
    Shen, Jun
    Zheng, Weimin
    Shuili Fadian Xuebao/Journal of Hydroelectric Engineering, (01): : 193 - 198
  • [43] Improving parallel execution time of sorting on heterogeneous clusters
    Cérin, C
    Koskas, M
    Fkaier, H
    Jemni, M
    16TH SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING, PROCEEDINGS, 2004, : 180 - 187
  • [44] ExaStamp: A Parallel Framework for Molecular Dynamics on Heterogeneous Clusters
    Cieren, Emmanuel
    Colombet, Laurent
    Pitoiset, Samuel
    Namyst, Raymond
    EURO-PAR 2014: PARALLEL PROCESSING WORKSHOPS, PT II, 2014, 8806 : 121 - 132
  • [45] Experimenting with the implementation of parallel programs on a communication heterogeneous cluster
    Macías, E
    Suárez, A
    Ojeda-Guerra, CN
    Robayna, E
    PDPTA'2001: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, 2001, : 829 - 835
  • [46] HarpLDA plus : Optimizing Latent Dirichlet Allocation for Parallel Efficiency
    Peng, Bo
    Zhang, Bingjing
    Chen, Langshi
    Avram, Mihai
    Henschel, Robert
    Stewart, Craig
    Zhu, Shaojuan
    Mccallum, Emily
    Smith, Lisa
    Zahniser, Tom
    Omer, Jon
    Qiu, Judy
    2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 243 - 252
  • [47] On the use of HCM to develop a resource allocation algorithm for heterogeneous clusters
    Soares, Thiago Marques
    dos Santos, Rodrigo Weber
    Lobosco, Marcelo
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2018, 30 (24):
  • [48] Simple, efficient allocation of modelling runs on heterogeneous clusters with MPI
    Donato, David I.
    ENVIRONMENTAL MODELLING & SOFTWARE, 2017, 88 : 48 - 57
  • [49] Efficient Online Resource Allocation in Heterogeneous Clusters with Machine Variability
    Xu, Huanle
    Liu, Yang
    Lau, Wing Cheong
    Guo, Jun
    Liu, Alex
    IEEE CONFERENCE ON COMPUTER COMMUNICATIONS (IEEE INFOCOM 2019), 2019, : 478 - 486
  • [50] A unified framework for optimizing communication in data-parallel programs
    Gupta, M
    Schonberg, E
    Srinivasan, H
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 1996, 7 (07) : 689 - 704