Stream Iterative Distributed Coded Computing for Learning Applications in Heterogeneous Systems

被引:5
|
作者
Esfahanizadeh, Homa [1 ]
Cohen, Alejandro [2 ]
Medard, Muriel [1 ]
机构
[1] MIT, 77 Massachusetts Ave, Cambridge, MA 02139 USA
[2] Technion Israel Inst Technol, Haifa, Israel
关键词
distributed systems; coded computation; heterogeneous; straggler; scheduling;
D O I
10.1109/INFOCOM48880.2022.9796977
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
To improve the utility of learning applications and render machine learning solutions feasible for complex applications, a substantial amount of heavy computations is needed. Thus, it is essential to delegate the computations among several workers, which brings up the major challenge of coping with delays and failures caused by the system's heterogeneity and uncertainties. In particular, minimizing the end-to-end job in-order execution delay, from arrival to delivery, is of great importance for real-world delay-sensitive applications. In this paper, for computation of each job iteration in a stochastic heterogeneous distributed system where the workers vary in their computing and communicating powers, we present a novel joint scheduling-coding framework that optimally split the coded computational load among the workers. This closes the gap between the workers' response time, and is critical to maximize the resource utilization. To further reduce the in-order execution delay, we also incorporate redundant computations in each iteration of a distributed computational job. Our simulation results demonstrate that the delay obtained using the proposed solution is dramatically lower than the uniform split which is oblivious to the system's heterogeneity and, in fact, is very close to an ideal lower bound just by introducing a small percentage of redundant computations.
引用
收藏
页码:230 / 239
页数:10
相关论文
共 50 条
  • [41] Optimal task assignment in heterogeneous distributed computing systems
    Kafil, M
    Ahmad, I
    IEEE CONCURRENCY, 1998, 6 (03): : 42 - +
  • [42] A MATLAB compiler for distributed, heterogeneous, reconfigurable computing systems
    Banerjee, P
    Shenoy, N
    Choudhary, A
    Hauck, S
    Bachmann, C
    Haldar, M
    Joisha, P
    Jones, A
    Kanhare, A
    Nayak, A
    Periyacheri, S
    Walkden, M
    Zaretsky, D
    2000 IEEE SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES, PROCEEDINGS, 2000, : 39 - 48
  • [43] A task migration algorithm for heterogeneous distributed computing systems
    Tiemeyer, MP
    Wong, JSK
    JOURNAL OF SYSTEMS AND SOFTWARE, 1998, 41 (03) : 175 - 188
  • [44] Utilizing heterogeneous networks in distributed parallel computing systems
    Kim, JS
    Lilja, DJ
    SIXTH IEEE INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE DISTRIBUTED COMPUTING, PROCEEDINGS, 1997, : 336 - 345
  • [45] A Task Scheduling Algorithm for Heterogeneous Distributed Computing Systems
    Badral, Undrakh
    Kim, Jin Suk
    INFORMATION-AN INTERNATIONAL INTERDISCIPLINARY JOURNAL, 2008, 11 (05): : 553 - 560
  • [46] A state lossless scheduling strategy in distributed stream computing systems
    Wu, Minghui
    Sun, Dawei
    Cui, Yijing
    Gao, Shang
    Liu, Xunyun
    Buyya, Rajkumar
    JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2022, 206
  • [47] An elastic reconfiguration strategy for operators in distributed stream computing systems
    Dawei Sun
    Yinuo Fan
    Chengjun Guan
    Jia Rong
    Shang Gao
    Rajkumar Buyya
    The Journal of Supercomputing, 81 (5)
  • [48] A state lossless scheduling strategy in distributed stream computing systems
    Wu, Minghui
    Sun, Dawei
    Cui, Yijing
    Gao, Shang
    Liu, Xunyun
    Buyya, Rajkumar
    Journal of Network and Computer Applications, 2022, 206
  • [49] Worker Assignment for Multiple Masters to Speed Up Coded Distributed Computing in Heterogeneous Clusters
    Kim, Daejin
    Park, Hyegyeong
    Niyato, Dusit
    Choi, Junkyun
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2023, 16 (03) : 2283 - 2298
  • [50] Elastic Resource Allocation for Coded Distributed Computing Over Heterogeneous Wireless Edge Networks
    Nguyen, Cong T.
    Nguyen, Diep N.
    Hoang, Dinh Thai
    Phan, Khoa Tran
    Niyato, Dusit
    Pham, Hoang-Anh
    Dutkiewicz, Eryk
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2023, 22 (04) : 2636 - 2649