Stream Distributed Coded Computing

被引:4
|
作者
Cohen A. [1 ]
Thiran G. [2 ]
Esfahanizadeh H. [1 ]
Medard M. [1 ]
机构
[1] Research Laboratory of Electronic, Massachusetts Institute of Technology, Cambridge, 02139, MA
[2] ICTEAM, Université Catholique de Louvain, Louvain-la-Neuve
关键词
Distributed coded computation; in-order execution delay; large matrix-matrix multiplication; large matrix-vector multiplication; queuing theory; stragglers; ultra-reliable low-latency;
D O I
10.1109/JSAIT.2021.3102279
中图分类号
学科分类号
摘要
The emerging large-scale and data-hungry algorithms require the computations to be delegated from a central server to several worker nodes. One major challenge in the distributed computations is to tackle delays and failures caused by the stragglers. To address this challenge, introducing efficient amount of redundant computations via distributed coded computation has received significant attention. Recent approaches in this area have mainly focused on introducing minimum computational redundancies to tolerate certain number of stragglers. To the best of our knowledge, the current literature lacks a unified end-to-end design in a heterogeneous setting where the workers can vary in their computation and communication capabilities. The contribution of this paper is to devise a novel framework for joint scheduling-coding, in a setting where the workers and the arrival of stream computational jobs are based on stochastic models. In our initial joint scheme, we propose a systematic framework that illustrates how to select a set of workers and how to split the computational load among the selected workers based on their differences in order to minimize the average in-order job execution delay. Through simulations, we demonstrate that the performance of our framework is dramatically better than the performance of naive method that splits the computational load uniformly among the workers, and it is close to the ideal performance. © 2020 IEEE.
引用
收藏
页码:1025 / 1040
页数:15
相关论文
共 50 条
  • [31] Coded Distributed Computing: Fundamental Limits and Practical Challenges
    Li, Songze
    Yu, Qian
    Maddah-Ali, Mohammad Ali
    Avestimehr, A. Salman
    2016 50TH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS, 2016, : 509 - 513
  • [32] Opportunistic Coded Distributed Computing: An Evolutionary Game Approach
    Han, Yue
    Niyato, Dusit
    Leung, Cyril
    Kim, Dong In
    IWCMC 2021: 2021 17TH INTERNATIONAL WIRELESS COMMUNICATIONS & MOBILE COMPUTING CONFERENCE (IWCMC), 2021, : 1430 - 1435
  • [33] Coded convolution for parallel and distributed computing within a deadline
    Dutta, Sanghamitra
    Cadambe, Viveck
    Grover, Pulkit
    2017 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY (ISIT), 2017, : 2403 - 2407
  • [34] Coded Distributed Computing: Straggling Servers and Multistage Dataflows
    Li, Songze
    Maddah-Ali, Mohammad Ali
    Avestimehr, A. Salman
    2016 54TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2016, : 164 - 171
  • [35] A New Combinatorial Coded Design for Heterogeneous Distributed Computing
    Woolsey, Nicholas
    Chen, Rong-Rong
    Ji, Mingyue
    IEEE TRANSACTIONS ON COMMUNICATIONS, 2021, 69 (09) : 5672 - 5685
  • [36] Coded Distributed Computing for Sparse Functions With Structured Support
    Brunero, Federico
    Wan, Kai
    Caire, Giuseppe
    Elia, Petros
    2023 IEEE INFORMATION THEORY WORKSHOP, ITW, 2023, : 474 - 479
  • [37] Coded Wireless Distributed Computing With Packet Losses and Retransmissions
    Han, Dong-Jun
    Sohn, Jy-Yong
    Moon, Jaekyun
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2021, 20 (12) : 8204 - 8217
  • [38] MESSAGE TRANSPORT SYSTEM OF DISTRIBUTED STREAM COMPUTING
    Liu, Di
    Pan, Dong
    2014 11TH INTERNATIONAL COMPUTER CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (ICCWAMTIP), 2014, : 483 - 486
  • [39] A Distributed Computing Platform for Task Stream Processing
    Xing Weiyan
    Huang Wenqing
    Liu Dong
    Deng Youyi
    INFORMATION COMPUTING AND APPLICATIONS, ICICA 2013, PT I, 2013, 391 : 110 - +
  • [40] Coded Distributed Computing For Vehicular Edge Computing With Dual -Function Radar Communication
    Hoai Linh Nguyen Thi
    Hoang Le Hung
    Nguyen Cong Luong
    Tien Hoa Nguyen
    Xiao, Sa
    Tan, Junjie
    Niyato, Dusit
    2023 IEEE 98TH VEHICULAR TECHNOLOGY CONFERENCE, VTC2023-FALL, 2023,