Dtree: Dynamic Task Scheduling at Petascale

被引:2
|
作者
Pamnany, Kiran [1 ]
Misra, Sanchit [1 ]
Md, Vasimuddin [2 ]
Liu, Xing [3 ]
Chow, Edmond [4 ]
Aluru, Srinivas [4 ]
机构
[1] Intel Corp, Parallel Comp Lab, Bangalore, Karnataka, India
[2] Indian Inst Technol, Dept Comp Sci & Engn, Bombay, Maharashtra, India
[3] IBM TJ Watson Res Ctr, Yorktown Hts, NY USA
[4] Georgia Inst Technol, Sch Computat Sci & Engn, Atlanta, GA 30332 USA
关键词
Petascale; Dynamic scheduling; Load balance;
D O I
10.1007/978-3-319-20119-1_10
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Irregular applications are challenging to scale on supercomputers due to the difficulty of balancing load across large numbers of nodes. This challenge is exacerbated by the increasing heterogeneity of modern supercomputers in which nodes often contain multiple processors and coprocessors operating at different speeds, and with differing core and thread counts. We present Dtree, a dynamic task scheduler designed to address this challenge. Dtree shows close to optimal results for a class of HPC applications, improving time-to-solution by achieving near-perfect load balance while consuming negligible resources. We demonstrate Dtree's effectiveness on up to 77,824 heterogeneous cores of the TACC Stampede supercomputer with two different petascale HPC applications: ParaBLe, which performs large-scale Bayesian network structure learning, and GTFock, which implements Fock matrix construction, an essential and expensive step in quantum chemistry codes. For ParaBLe, we show improved performance while eliminating the complexity of managing heterogeneity. For GTFock, we match the most recently published performance without using any application-specific optimizations for data access patterns (such as the task distribution design for communication reduction) that enabled that performance. We also show that Dtree can distribute from tens of thousands to hundreds of millions of irregular tasks across up to 1024 nodes with minimal overhead, while balancing load to within 2% of optimal.
引用
收藏
页码:122 / 138
页数:17
相关论文
共 50 条
  • [1] Eager scheduling with lazy retry for dynamic task scheduling
    Chen, Huey-Ling
    King, Chung-Ta
    Lecture Notes in Computer Science, 1124
  • [2] A novel algorithm for dynamic task scheduling
    Nayak, Sasmita Kumari
    Padhy, Sasmita Kumari
    Panigrahi, Siba Prasada
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2012, 28 (05): : 709 - 717
  • [3] The dynamic task scheduling in the multiprocessor system
    Li, Y
    Chen, C
    PROCEEDINGS OF THE 3RD WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-5, 2000, : 3558 - 3563
  • [4] Dynamic task scheduling of the grid workflow
    Gao, LW
    Yao, YW
    ICESS 2005: SECOND INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS, 2005, : 507 - 512
  • [5] A Parallelized Dynamic Task Scheduling for Batch of Task in a computational grid
    Sheikh S.
    Nagaraju A.
    Shahid M.
    International Journal of Computers and Applications, 2019, 41 (01) : 38 - 52
  • [6] Bounded Memory Scheduling of Dynamic Task Graphs
    Sbirlea, Dragos
    Budimlic, Zoran
    Sarkar, Vivek
    PROCEEDINGS OF THE 23RD INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES (PACT'14), 2014, : 343 - 355
  • [7] Adaptive Scheduling of Task Graphs with Dynamic Resilience
    Hu, Menglan
    Luo, Jun
    Wang, Yang
    Veeravalli, Bharadwaj
    IEEE TRANSACTIONS ON COMPUTERS, 2017, 66 (01) : 17 - 23
  • [8] Dynamic task scheduling using online optimization
    Hamidzadeh, B
    Kit, LY
    Lilja, DJ
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2000, 11 (11) : 1151 - 1163
  • [9] A DISTRIBUTED ALGORITHM FOR DYNAMIC TASK-SCHEDULING
    ROST, J
    MAEHLE, E
    LECTURE NOTES IN COMPUTER SCIENCE, 1990, 457 : 628 - 639
  • [10] A Dynamic Priority Task Scheduling Algorithm For Mpsoc
    Shen, Yang
    Qi, Deyu
    2018 3RD INTERNATIONAL CONFERENCE ON SMART CITY AND SYSTEMS ENGINEERING (ICSCSE), 2018, : 813 - 818