Speeding up Distributed Request-Response Workflows

被引:74
|
作者
Jalaparti, Virajith
Bodik, Peter [1 ]
Kandula, Srikanth [1 ]
Menache, Ishai [1 ]
Rybalkin, Mikhail
Yan, Chenyu [1 ]
机构
[1] Microsoft Corp, Redmond, WA 98052 USA
关键词
Interactive services; Tail latency; Optimization; Reissues; Partial results;
D O I
10.1145/2534169.2486028
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We found that interactive services at Bing have highly variable datacenter-side processing latencies because their processing consists of many sequential stages, parallelization across 10s-1000s of servers and aggregation of responses across the network. To improve the tail latency of such services, we use a few building blocks: reissuing laggards elsewhere in the cluster, new policies to return incomplete results and speeding up laggards by giving them more resources. Combining these building blocks to reduce the overall latency is non-trivial because for the same amount of resource (e.g., number of reissues), different stages improve their latency by different amounts. We present Kwiken, a framework that takes an end-to-end view of latency improvements and costs. It decomposes the problem of minimizing latency over a general processing DAG into a manageable optimization over individual stages. Through simulations with production traces, we show sizable gains; the 99th percentile of latency improves by over 50% when just 0.1% of the responses are allowed to have partial results and by over 40% for 25% of the services when just 5% extra resources are used for reissues.
引用
收藏
页码:219 / 230
页数:12
相关论文
共 50 条
  • [41] Request-Response and Censoring-Based Energy-Efficient Decentralized Change-Point Detection With IoT Applications
    Gu, Yuantao
    Jiao, Yuchen
    Xu, Xingyu
    Yu, Quan
    IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (08) : 6771 - 6788
  • [42] Speeding up
    Thomas, S.
    Chemical Engineer, 2001, (722):
  • [43] Speeding up
    Thomas, S
    TCE, 2001, (722): : 18 - 18
  • [44] DistrEdge: Speeding up Convolutional Neural Network Inference on Distributed Edge Devices
    Hou, Xueyu
    Guan, Yongjie
    Han, Tao
    Zhang, Ning
    2022 IEEE 36TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2022), 2022, : 1097 - 1107
  • [45] Speeding-up dynamic response of active power conditioners
    da Silva, LEB
    de Oliveira, LED
    da Silva, VF
    Torres, GL
    Bonaldi, EL
    Rossi, R
    CCECE 2003: CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, VOLS 1-3, PROCEEDINGS: TOWARD A CARING AND HUMANE TECHNOLOGY, 2003, : 347 - 350
  • [46] Speeding Up Distributed Gradient Descent by Utilizing Non-persistent Stragglers
    Ozfatura, Emre
    Gunduz, Deniz
    Ulukus, Sennur
    2019 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY (ISIT), 2019, : 2729 - 2733
  • [47] SPEEDING UP LATENT SEMANTIC ANALYSIS A Streamed Distributed Algorithm for SVD Updates
    Rehurek, Radim
    ICAART 2011: PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 1, 2011, : 446 - 451
  • [48] Distributed execution of workflows
    Navas-Delgado, Ismael
    Aldana-Montes, Jose F.
    Trelles, Oswaldo
    COMPUTATIONAL SCIENCE - ICCS 2006, PT 3, PROCEEDINGS, 2006, 3993 : 936 - 939
  • [49] Distributed workflows with Jupyter
    Colonnelli, Iacopo
    Aldinucci, Marco
    Cantalupo, Barbara
    Padovani, Luca
    Rabellino, Sergio
    Spampinato, Concetto
    Morelli, Roberto
    Di Carlo, Rosario
    Magini, Nicolo
    Cavazzoni, Carlo
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2022, 128 : 282 - 298
  • [50] Speeding Up Private Distributed Matrix Multiplication via Bivariate Polynomial Codes
    Hasircioglu, Burak
    Gomez-Vilardebo, Jesus
    Gunduz, Deniz
    2021 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY (ISIT), 2021, : 1853 - 1858