Dtree: Dynamic Task Scheduling at Petascale

被引:2
|
作者
Pamnany, Kiran [1 ]
Misra, Sanchit [1 ]
Md, Vasimuddin [2 ]
Liu, Xing [3 ]
Chow, Edmond [4 ]
Aluru, Srinivas [4 ]
机构
[1] Intel Corp, Parallel Comp Lab, Bangalore, Karnataka, India
[2] Indian Inst Technol, Dept Comp Sci & Engn, Bombay, Maharashtra, India
[3] IBM TJ Watson Res Ctr, Yorktown Hts, NY USA
[4] Georgia Inst Technol, Sch Computat Sci & Engn, Atlanta, GA 30332 USA
来源
HIGH PERFORMANCE COMPUTING, ISC HIGH PERFORMANCE 2015 | 2015年 / 9137卷
关键词
Petascale; Dynamic scheduling; Load balance;
D O I
10.1007/978-3-319-20119-1_10
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Irregular applications are challenging to scale on supercomputers due to the difficulty of balancing load across large numbers of nodes. This challenge is exacerbated by the increasing heterogeneity of modern supercomputers in which nodes often contain multiple processors and coprocessors operating at different speeds, and with differing core and thread counts. We present Dtree, a dynamic task scheduler designed to address this challenge. Dtree shows close to optimal results for a class of HPC applications, improving time-to-solution by achieving near-perfect load balance while consuming negligible resources. We demonstrate Dtree's effectiveness on up to 77,824 heterogeneous cores of the TACC Stampede supercomputer with two different petascale HPC applications: ParaBLe, which performs large-scale Bayesian network structure learning, and GTFock, which implements Fock matrix construction, an essential and expensive step in quantum chemistry codes. For ParaBLe, we show improved performance while eliminating the complexity of managing heterogeneity. For GTFock, we match the most recently published performance without using any application-specific optimizations for data access patterns (such as the task distribution design for communication reduction) that enabled that performance. We also show that Dtree can distribute from tens of thousands to hundreds of millions of irregular tasks across up to 1024 nodes with minimal overhead, while balancing load to within 2% of optimal.
引用
收藏
页码:122 / 138
页数:17
相关论文
共 50 条
  • [21] Dynamic Scheduling Strategy for Testing Task in Cloud Computing
    Lou, Yang
    Zhang, Tao
    Yan, Jing
    Li, Kun
    Jiang, Yechun
    Wang, Haipeng
    Cheng, Jing
    2014 6TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMMUNICATION NETWORKS, 2014, : 633 - 636
  • [22] Dynamic task-level voltage scheduling optimizations
    Barnett, JA
    IEEE TRANSACTIONS ON COMPUTERS, 2005, 54 (05) : 508 - 520
  • [23] Dynamic real-time task scheduling on hypercubes
    Mohapatra, P
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1997, 46 (01) : 91 - 100
  • [24] Dynamic task scheduling using a directed neural network
    Tripathy, Binodini
    Dash, Smita
    Padhy, Sasmita Kumari
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2015, 75 : 101 - 106
  • [25] The Dynamic Delay Scheduling Algorithm Based on Task Classification
    Zhao, Bing
    Liu, Yun
    Shen, Bo
    2015 FIRST INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE THEORY, SYSTEMS AND APPLICATIONS (CCITSA 2015), 2015, : 92 - 97
  • [26] Hybrid task scheduling: Integrating static and dynamic heuristics
    Boeres, C
    Lima, A
    Rebello, VEF
    15TH SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING, PROCEEDINGS, 2003, : 199 - 206
  • [27] A Dynamic Task Scheduling Algorithm for Airborne Device Clouds
    Deng, Bao
    Zhai, Zhengjun
    International Journal of Aerospace Engineering, 2024, 2024
  • [28] Dynamic on-line task scheduling on parallel processors
    Xia, CH
    Michailidis, G
    Bambos, N
    PERFORMANCE EVALUATION, 2001, 46 (2-3) : 219 - 233
  • [29] Dynamic Task Scheduling Scheme for a GPGPU Programming Framework
    Ohno, Kazuhiko
    Yamamoto, Rei
    PROCEEDINGS OF 2015 THIRD INTERNATIONAL SYMPOSIUM ON COMPUTING AND NETWORKING (CANDAR), 2015, : 181 - 187
  • [30] Design task planning and dynamic scheduling to Concurrent Engineering
    Jiang, Gongliang
    DCABES 2006 PROCEEDINGS, VOLS 1 AND 2, 2006, : 1045 - 1047