Leveraging Coding Techniques for Speeding up Distributed Computing

被引:0
|
作者
Konstantinidis, Konstantinos [1 ]
Ramamoorthy, Aditya [1 ]
机构
[1] Iowa State Univ, Dept Elect & Comp Engn, Ames, IA 50010 USA
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Large scale clusters running MapReduce, Spark etc. routinely process data that are on the orders of petabytes or more. The philosophy in these methods is to split the overall job into smaller tasks that are executed on different servers; this is called the map phase. This is followed by a data shuffling phase where appropriate data is exchanged between the servers. The final reduce phase, completes the computation. Prior work has explored a mechanism for reducing the overall execution time by operating on a computation vs. communication tradeoff. Specifically, the idea is to run redundant copies of map tasks that are placed on judiciously chosen servers. The shuffle phase exploits the location of the nodes and utilizes coded transmission. The main drawback of this approach is that it requires the original job to be split into a number of map tasks that grows exponentially in the system parameters. This is problematic, as we demonstrate that splitting jobs too finely can in fact adversely affect the overall execution time. In this work we show that one can simultaneously obtain low communication loads while ensuring that jobs do not need to be split too finely. Our approach uncovers a deep relationship between this problem and a class of combinatorial structures called resolvable designs. We present experimental results obtained on Amazon EC2 clusters for a widely known distributed algorithm, namely TeraSort. We obtain over 4.69x improvement in speedup over the baseline approach and more than 2.6x over current state of the art.
引用
收藏
页数:6
相关论文
共 50 条
  • [41] Speeding Up HOG and LBP Features for Pedestrian Detection by Multiresolution Techniques
    Geismann, Philip
    Knoll, Alois
    ADVANCES IN VISUAL COMPUTING, PT I, 2010, 6453 : 243 - 252
  • [42] Speeding Up Emulation-Based Diagnosis Techniques for Logic Cores
    Lu, Shyue-Kung
    Chen, Yin-Mou
    Huang, Shi-Yu
    Wu, Cheng-Wen
    IEEE DESIGN & TEST OF COMPUTERS, 2011, 28 (04): : 88 - 97
  • [43] New techniques for speeding-up fault-injection campaigns
    Berrojo, L
    González, I
    Corno, F
    Reorda, MS
    Squillero, G
    Entrena, L
    Lopez, C
    DESIGN, AUTOMATION AND TEST IN EUROPE CONFERENCE AND EXHIBITION, 2002 PROCEEDINGS, 2002, : 847 - 852
  • [44] Novel distributed computing techniques for mobile telecommunications
    Khaddaj, Souheil
    Makoond, Bippin
    Ong, David C. C.
    Oudrhiri, Radouane
    DCABES 2007 Proceedings, Vols I and II, 2007, : 202 - 206
  • [45] Combined Distributed Turbo Coding and Space Frequency Block Coding Techniques
    V Bota
    Zs A Polgar
    A Silva
    S Teodoro
    MP Stef
    A Moço
    A Botos
    A Gameiro
    EURASIP Journal on Wireless Communications and Networking, 2010
  • [46] Distributed computing techniques for wireless messaging systems
    Makoond, B.
    Khaddaj, S.
    Ong, D. C. C.
    Oudrhiri, R.
    Tunnicliffe, M.
    JOURNAL OF ALGORITHMS & COMPUTATIONAL TECHNOLOGY, 2008, 2 (03) : 429 - 446
  • [47] Speeding-up the hybrid video watermarking techniques in the DWT domain
    Chammem, A.
    Mitrea, M.
    Preteux, F.
    WAVELET APPLICATIONS IN INDUSTRIAL PROCESSING VII, 2010, 7535
  • [48] Combined Distributed Turbo Coding and Space Frequency Block Coding Techniques
    Bota, V.
    Polgar, Zs. A.
    Silva, A.
    Teodoro, S.
    Stef, M. P.
    Moco, A.
    Botos, A.
    Gameiro, A.
    EURASIP JOURNAL ON WIRELESS COMMUNICATIONS AND NETWORKING, 2010,
  • [49] Asynchronous Optimization Techniques for Distributed Computing Applications
    Reisch, Raphael-Elias
    Weber, Jens
    Laroque, Christoph
    Schroeder, Christian
    48TH ANNUAL SIMULATION SYMPOSIUM (ANSS 2015), 2015, : 49 - 56
  • [50] SPEEDING UP KRYLOV SUBSPACE METHODS FOR COMPUTING f(A)b VIA RANDOMIZATION
    Cortinovis, Alice
    Kressner, Daniel
    Nakatsukasa, Yuji
    SIAM JOURNAL ON MATRIX ANALYSIS AND APPLICATIONS, 2024, 45 (01) : 619 - 633