Leveraging Coding Techniques for Speeding up Distributed Computing

被引:0
|
作者
Konstantinidis, Konstantinos [1 ]
Ramamoorthy, Aditya [1 ]
机构
[1] Iowa State Univ, Dept Elect & Comp Engn, Ames, IA 50010 USA
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Large scale clusters running MapReduce, Spark etc. routinely process data that are on the orders of petabytes or more. The philosophy in these methods is to split the overall job into smaller tasks that are executed on different servers; this is called the map phase. This is followed by a data shuffling phase where appropriate data is exchanged between the servers. The final reduce phase, completes the computation. Prior work has explored a mechanism for reducing the overall execution time by operating on a computation vs. communication tradeoff. Specifically, the idea is to run redundant copies of map tasks that are placed on judiciously chosen servers. The shuffle phase exploits the location of the nodes and utilizes coded transmission. The main drawback of this approach is that it requires the original job to be split into a number of map tasks that grows exponentially in the system parameters. This is problematic, as we demonstrate that splitting jobs too finely can in fact adversely affect the overall execution time. In this work we show that one can simultaneously obtain low communication loads while ensuring that jobs do not need to be split too finely. Our approach uncovers a deep relationship between this problem and a class of combinatorial structures called resolvable designs. We present experimental results obtained on Amazon EC2 clusters for a widely known distributed algorithm, namely TeraSort. We obtain over 4.69x improvement in speedup over the baseline approach and more than 2.6x over current state of the art.
引用
收藏
页数:6
相关论文
共 50 条
  • [31] Experiences in Speeding Up Computer Vision Applications on Mobile Computing Platforms
    Backes, Luna
    Rico, Alejandro
    Franke, Bjorn
    PROCEEDINGS INTERNATIONAL CONFERENCE ON EMBEDDED COMPUTER SYSTEMS - ARCHITECTURES, MODELING AND SIMULATION (SAMOS XV), 2015, : 1 - 8
  • [32] Leveraging software reengineering systems for heterogeneous distributed computing environments
    Chiang, CC
    INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, PROCEEDINGS, 2000, : 254 - 261
  • [33] Environmental Sustainability Coding Techniques for Cloud Computing
    Ahmed, Shakeel
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (10) : 231 - 237
  • [34] Development and performance evaluation of a methodology, based on distributed computing, for speeding EnergyPlus simulation
    Garg, Vishal
    Chandrasen, Kshitij
    Mathur, Jyotirmay
    Tetali, Surekha
    Jawa, Akshey
    JOURNAL OF BUILDING PERFORMANCE SIMULATION, 2011, 4 (03) : 257 - 270
  • [35] Prophet: Speeding up Distributed DNN Training with Predictable Communication Scheduling
    Zhang, Zhenwei
    Qi, Qiang
    Shang, Ruitao
    Chen, Li
    Xu, Fei
    50TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, 2021,
  • [36] Speeding up
    Thomas, S.
    Chemical Engineer, 2001, (722):
  • [37] Distributed Lossless Coding Techniques for Hyperspectral Images
    Zhang, Jinlei
    Li, Houqiang
    Chen, Chang Wen
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2015, 9 (06) : 977 - 989
  • [38] Coding for Distributed Fog Computing in Internet of Mobile Things
    Yue, Jing
    Xiao, Ming
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2021, 20 (04) : 1337 - 1350
  • [39] A Unified Coding Framework for Distributed Computing with Straggling Servers
    Li, Songze
    Maddah-Ali, Mohammad Ali
    Avestimehr, A. Salman
    2016 IEEE GLOBECOM WORKSHOPS (GC WKSHPS), 2016,
  • [40] Speeding up
    Thomas, S
    TCE, 2001, (722): : 18 - 18