Evaluation of an MPI-based Implementation of the Tascell Task-Parallel Language on Massively Parallel Systems

被引:3
|
作者
Muraoka, Daisuke [1 ]
Yasugi, Masahiro [2 ]
Hiraishi, Tasuku [3 ]
Umatani, Seiji [4 ]
机构
[1] Kyushu Inst Technol, Grad Sch Comp Sci & Syst Engn, Kitakyushu, Fukuoka, Japan
[2] Kyushu Inst Technol, Dept Artificial Intelligence, Kitakyushu, Fukuoka, Japan
[3] Kyoto Univ, Acad Ctr Comp & Media Studies, Kyoto 6068501, Japan
[4] Kyoto Univ, Grad Sch Infomat, Kyoto 6068501, Japan
关键词
D O I
10.1109/ICPPW.2016.36
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Tascell is a task parallel language that supports distributed memory environments. The conventional implementation of Tascell realizes inter-node communication with TCP/IP communication via Tascell servers. This implementation is suitable for dynamic addition of computation nodes and wide-area distributed environments. On the other hand, in supercomputer environments, TCP/IP may not be available for inter-node communication and there may be no appropriate places for deploying Tascell servers. In this study, we have developed a server-less implementation of Tascell that realizes inter-node communication with MPI communication in order to evaluate its performance on massively parallel systems. It performs well on four Xeon Phi coprocessors (with 456 workers) and the K computer; for instance, our 19-queens solver achieves a 4615-fold speedup relative to a serial implementation with 7168 workers on the K computer. Our server-less implementation realizes deadlock freedom, although it only requires the two-sided communication paradigm and the MPI_THREAD_FUNNELED support level. On Xeon Phi coprocessors, we compare our implementation with other implementations that employ TCP/IP or the MPI_THREAD_MULTIPLE support level.
引用
收藏
页码:161 / 170
页数:10
相关论文
共 50 条
  • [41] Study on High Performance of MPI-Based Parallel FDTD from WorkStation to Super Computer Platform
    He, Z. L.
    Huang, K.
    Zhang, Y.
    Yan, Y.
    Liang, C. H.
    INTERNATIONAL JOURNAL OF ANTENNAS AND PROPAGATION, 2012, 2012
  • [42] Prediction of wind energy location by parallel programming using MPI-based KMEANS clustering algorithm
    Ponnuswamy, Priya
    Palaniappan, Shabariram Chokkalingam
    ENERGY SOURCES PART A-RECOVERY UTILIZATION AND ENVIRONMENTAL EFFECTS, 2024, 46 (01) : 5451 - 5473
  • [43] A MPI-Based Parallel Pyramid Building Algorithm for Large-Scale Remote Sensing Images
    He, Gaojin
    Xiong, Wei
    Chen, Luo
    Wu, Qiuyun
    Jing, Ning
    2015 23RD INTERNATIONAL CONFERENCE ON GEOINFORMATICS, 2015,
  • [44] Kokkos/Qthreads Task-Parallel Approach to Linear Algebra Based Graph Analytics
    Wolf, Michael M.
    Edwards, H. Carter
    Olivier, Stephen L.
    2016 IEEE HIGH PERFORMANCE EXTREME COMPUTING CONFERENCE (HPEC), 2016,
  • [45] COHESION - A microkernel based Desktop Grid platform for irregular task-parallel applications
    Schulz, Sven
    Blochinger, Wolfgang
    Held, Markus
    Dangelmayr, Clemens
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2008, 24 (05): : 354 - 370
  • [46] Design and Implementation of a Criticality- and Heterogeneity-Aware Runtime System for Task-Parallel Applications
    Han, Myeonggyun
    Park, Jinsu
    Baek, Woongki
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2021, 32 (05) : 1117 - 1132
  • [47] The design and implementation of the massively parallel processor based on the matrix architecture
    Noda, Hideyuki
    Nakajima, Masami
    Dosaka, Katsumi
    Nakata, Kiyoshi
    Higashida, Motoki
    Yamamoto, Osamu
    Mizumoto, Katsuya
    Tanizaki, Tetsushi
    Gyohten, Takayuki
    Okuno, Yoshihiro
    Kondo, Hiroyuki
    Shimazu, Yukihiko
    Arimoto, Kazutami
    IEEE JOURNAL OF SOLID-STATE CIRCUITS, 2007, 42 (01) : 183 - 192
  • [48] Implementation and evaluation of parallel edge detector on cluster of workstations using MPI
    Patel, N
    Mahgoub, IO
    10TH INTERNATIONAL CONFERENCE ON COMPUTER APPLICATIONS IN INDUSTRY AND ENGINEERING, 1997, : 102 - 105
  • [49] Static Compiler Analyses for Application-specific Optimization of Task-Parallel Runtime Systems
    Peter Thoman
    Peter Zangerl
    Thomas Fahringer
    Journal of Signal Processing Systems, 2019, 91 : 303 - 320
  • [50] Static Compiler Analyses for Application-specific Optimization of Task-Parallel Runtime Systems
    Thoman, Peter
    Zangerl, Peter
    Fahringer, Thomas
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2019, 91 (3-4): : 303 - 320