Stream Distributed Coded Computing

被引:4
|
作者
Cohen A. [1 ]
Thiran G. [2 ]
Esfahanizadeh H. [1 ]
Medard M. [1 ]
机构
[1] Research Laboratory of Electronic, Massachusetts Institute of Technology, Cambridge, 02139, MA
[2] ICTEAM, Université Catholique de Louvain, Louvain-la-Neuve
关键词
Distributed coded computation; in-order execution delay; large matrix-matrix multiplication; large matrix-vector multiplication; queuing theory; stragglers; ultra-reliable low-latency;
D O I
10.1109/JSAIT.2021.3102279
中图分类号
学科分类号
摘要
The emerging large-scale and data-hungry algorithms require the computations to be delegated from a central server to several worker nodes. One major challenge in the distributed computations is to tackle delays and failures caused by the stragglers. To address this challenge, introducing efficient amount of redundant computations via distributed coded computation has received significant attention. Recent approaches in this area have mainly focused on introducing minimum computational redundancies to tolerate certain number of stragglers. To the best of our knowledge, the current literature lacks a unified end-to-end design in a heterogeneous setting where the workers can vary in their computation and communication capabilities. The contribution of this paper is to devise a novel framework for joint scheduling-coding, in a setting where the workers and the arrival of stream computational jobs are based on stochastic models. In our initial joint scheme, we propose a systematic framework that illustrates how to select a set of workers and how to split the computational load among the selected workers based on their differences in order to minimize the average in-order job execution delay. Through simulations, we demonstrate that the performance of our framework is dramatically better than the performance of naive method that splits the computational load uniformly among the workers, and it is close to the ideal performance. © 2020 IEEE.
引用
收藏
页码:1025 / 1040
页数:15
相关论文
共 50 条
  • [41] On Batch-Processing Based Coded Computing for Heterogeneous Distributed Computing Systems
    Wang, Baoqian
    Xie, Junfei
    Lu, Kejie
    Wan, Yan
    Fu, Shengli
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2021, 8 (03): : 2438 - 2454
  • [42] A Combinatorial Design for Cascaded Coded Distributed Computing on General Networks
    Woolsey, Nicholas
    Chen, Rong-Rong
    Ji, Mingyue
    IEEE TRANSACTIONS ON COMMUNICATIONS, 2021, 69 (09) : 5686 - 5700
  • [43] Coded Distributed Computing for Hierarchical Multi-task Learning
    Hu, Haoyang
    Li, Songze
    Cheng, Minquan
    Wu, Youlong
    2023 IEEE INFORMATION THEORY WORKSHOP, ITW, 2023, : 480 - 485
  • [44] Coded Parallel Transmission for Half-Duplex Distributed Computing
    Zai, Qixuan
    Yuan, Kai
    Wu, Youlong
    INFORMATION, 2022, 13 (07)
  • [45] A Double Auction Mechanism for Coded Distributed Computing in Smart Vehicles
    Ng, Jer Shyuan
    Lim, Wei Yang Bryan
    Xiong, Zehui
    Garg, Sahil
    Zhang, Yang
    Niyato, Dusit
    Leung, Cyril
    PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON WIRELESS NETWORKS AND MOBILE SYSTEMS (WINSYS), 2021, : 107 - 114
  • [46] Efficient Construction of Encoding Polynomials in a Distributed Coded Computing Scheme
    Hibino, Daisuke
    Shibuya, Tomoharu
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2024, E107A (03) : 476 - 485
  • [47] Coded Computing and Cooperative Transmission for Wireless Distributed Matrix Multiplication
    Li, Kuikui
    Tao, Meixia
    Zhang, Jingjing
    Simeone, Osvaldo
    IEEE TRANSACTIONS ON COMMUNICATIONS, 2021, 69 (04) : 2224 - 2239
  • [48] Learning Auction in Coded Distributed Computing with Heterogeneous User Demands
    Liang, Jiawei
    Li, Juan
    Zhu, Kun
    Yi, Changyan
    2022 27TH IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS (IEEE ISCC 2022), 2022,
  • [49] Heterogeneous Coded Distributed Computing with Nonuniform Input File Popularity
    Deng, Yong
    Dong, Min
    IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2022), 2022, : 1936 - 1941
  • [50] Cascaded Coded Distributed Computing Schemes Based on Symmetric Designs
    Jiang, Jing
    Wang, Wenhan
    Zhou, Lingling
    IEEE TRANSACTIONS ON COMMUNICATIONS, 2022, 70 (11) : 7179 - 7190