Leveraging Coding Techniques for Speeding up Distributed Computing

被引:0
|
作者
Konstantinidis, Konstantinos [1 ]
Ramamoorthy, Aditya [1 ]
机构
[1] Iowa State Univ, Dept Elect & Comp Engn, Ames, IA 50010 USA
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Large scale clusters running MapReduce, Spark etc. routinely process data that are on the orders of petabytes or more. The philosophy in these methods is to split the overall job into smaller tasks that are executed on different servers; this is called the map phase. This is followed by a data shuffling phase where appropriate data is exchanged between the servers. The final reduce phase, completes the computation. Prior work has explored a mechanism for reducing the overall execution time by operating on a computation vs. communication tradeoff. Specifically, the idea is to run redundant copies of map tasks that are placed on judiciously chosen servers. The shuffle phase exploits the location of the nodes and utilizes coded transmission. The main drawback of this approach is that it requires the original job to be split into a number of map tasks that grows exponentially in the system parameters. This is problematic, as we demonstrate that splitting jobs too finely can in fact adversely affect the overall execution time. In this work we show that one can simultaneously obtain low communication loads while ensuring that jobs do not need to be split too finely. Our approach uncovers a deep relationship between this problem and a class of combinatorial structures called resolvable designs. We present experimental results obtained on Amazon EC2 clusters for a widely known distributed algorithm, namely TeraSort. We obtain over 4.69x improvement in speedup over the baseline approach and more than 2.6x over current state of the art.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Leveraging Coding Techniques and Redundancy for Speeding Up Distributed Computing and Robustifying Distributed Learning
    Konstantinidis, Konstantinos
    ProQuest Dissertations and Theses Global, 2022,
  • [2] Resolvable Designs for Speeding Up Distributed Computing
    Konstantinidis, Konstantinos
    Ramamoorthy, Aditya
    IEEE-ACM TRANSACTIONS ON NETWORKING, 2020, 28 (04) : 1657 - 1670
  • [3] REPLICATION TECHNIQUES FOR SPEEDING UP PARALLEL APPLICATIONS ON DISTRIBUTED SYSTEMS
    BAL, HE
    KAASHOEK, MF
    TANENBAUM, AS
    JANSEN, J
    CONCURRENCY-PRACTICE AND EXPERIENCE, 1992, 4 (05): : 337 - 355
  • [4] DISTRIBUTED COMPUTING AND MODELING & SIMULATION: SPEEDING UP SIMULATIONS AND CREATING LARGE MODELS
    Taylor, Simon J. E.
    Ghorbani, Mohammadmersad
    Mustafee, Navonil
    Turner, Stephen J.
    Kiss, Tamas
    Farkas, Daniel
    Kite, Shane
    Strassburger, Steffen
    PROCEEDINGS OF THE 2011 WINTER SIMULATION CONFERENCE (WSC), 2011, : 161 - 175
  • [5] Optimization Techniques for a Distributed In-Memory Computing Platform by Leveraging SSD
    Choi, June
    Lee, Jaehyun
    Kim, Jik-Soo
    Lee, Jaehwan
    APPLIED SCIENCES-BASEL, 2021, 11 (18):
  • [6] Speeding Up in Distributed SystemC Simulations
    Galiano, V.
    Migallon, H.
    Perez-Caparros, D.
    Palomino, J. A.
    Martinez, M.
    INTERNATIONAL SYMPOSIUM ON DISTRIBUTED COMPUTING AND ARTIFICIAL INTELLIGENCE 2008, 2009, 50 : 24 - +
  • [7] THE SCIENCE OF COMPUTING - SPEEDING UP PARALLEL PROCESSING
    DENNING, PJ
    AMERICAN SCIENTIST, 1988, 76 (04) : 347 - 349
  • [8] Speeding decompression of multimedia information through distributed computing
    Moonian, O
    Bhurtun, C
    INTERNATIONAL JOURNAL OF ELECTRICAL ENGINEERING EDUCATION, 1999, 36 (03) : 222 - 228
  • [9] Coding for Distributed Fog Computing
    Li, Songze
    Maddah-Ali, Mohammad Ali
    Avestimehr, A. Salman
    IEEE COMMUNICATIONS MAGAZINE, 2017, 55 (04) : 34 - 40
  • [10] Hierarchical Coding for Distributed Computing
    Park, Hyegyeong
    Lee, Kangwook
    Sohn, Jy-yong
    Suh, Changho
    Moon, Jaekyun
    2018 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY (ISIT), 2018, : 1630 - 1634