Speeding up Distributed Request-Response Workflows

被引：74

作者：

Jalaparti, Virajith

Bodik, Peter ^{[1
]}

Kandula, Srikanth ^{[1
]}

Menache, Ishai ^{[1
]}

Rybalkin, Mikhail

Yan, Chenyu ^{[1
]}

机构：

[1] Microsoft Corp, Redmond, WA 98052 USA

来源：

ACM SIGCOMM COMPUTER COMMUNICATION REVIEW | 2013年 / 43卷 / 04期

关键词：

Interactive services; Tail latency; Optimization; Reissues; Partial results;

D O I：

10.1145/2534169.2486028

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

We found that interactive services at Bing have highly variable datacenter-side processing latencies because their processing consists of many sequential stages, parallelization across 10s-1000s of servers and aggregation of responses across the network. To improve the tail latency of such services, we use a few building blocks: reissuing laggards elsewhere in the cluster, new policies to return incomplete results and speeding up laggards by giving them more resources. Combining these building blocks to reduce the overall latency is non-trivial because for the same amount of resource (e.g., number of reissues), different stages improve their latency by different amounts. We present Kwiken, a framework that takes an end-to-end view of latency improvements and costs. It decomposes the problem of minimizing latency over a general processing DAG into a manageable optimization over individual stages. Through simulations with production traces, we show sizable gains; the 99th percentile of latency improves by over 50% when just 0.1% of the responses are allowed to have partial results and by over 40% for 25% of the services when just 5% extra resources are used for reissues.

引用

页码：219 / 230

页数：12

共 50 条

[41] Request-Response and Censoring-Based Energy-Efficient Decentralized Change-Point Detection With IoT Applications
Gu, Yuantao
Jiao, Yuchen
Xu, Xingyu
Yu, Quan
IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (08) : 6771 - 6788
[42] Speeding up
Thomas, S.
Chemical Engineer, 2001, (722):
[43] Speeding up
Thomas, S
TCE, 2001, (722): : 18 - 18
[44] DistrEdge: Speeding up Convolutional Neural Network Inference on Distributed Edge Devices
Hou, Xueyu
Guan, Yongjie
Han, Tao
Zhang, Ning
2022 IEEE 36TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2022), 2022, : 1097 - 1107
[45] Speeding-up dynamic response of active power conditioners
da Silva, LEB
de Oliveira, LED
da Silva, VF
Torres, GL
Bonaldi, EL
Rossi, R
CCECE 2003: CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, VOLS 1-3, PROCEEDINGS: TOWARD A CARING AND HUMANE TECHNOLOGY, 2003, : 347 - 350
[46] Speeding Up Distributed Gradient Descent by Utilizing Non-persistent Stragglers
Ozfatura, Emre
Gunduz, Deniz
Ulukus, Sennur
2019 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY (ISIT), 2019, : 2729 - 2733
[47] SPEEDING UP LATENT SEMANTIC ANALYSIS A Streamed Distributed Algorithm for SVD Updates
Rehurek, Radim
ICAART 2011: PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 1, 2011, : 446 - 451
[48] Distributed execution of workflows
Navas-Delgado, Ismael
Aldana-Montes, Jose F.
Trelles, Oswaldo
COMPUTATIONAL SCIENCE - ICCS 2006, PT 3, PROCEEDINGS, 2006, 3993 : 936 - 939
[49] Distributed workflows with Jupyter
Colonnelli, Iacopo
Aldinucci, Marco
Cantalupo, Barbara
Padovani, Luca
Rabellino, Sergio
Spampinato, Concetto
Morelli, Roberto
Di Carlo, Rosario
Magini, Nicolo
Cavazzoni, Carlo
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2022, 128 : 282 - 298
[50] Speeding Up Private Distributed Matrix Multiplication via Bivariate Polynomial Codes
Hasircioglu, Burak
Gomez-Vilardebo, Jesus
Gunduz, Deniz
2021 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY (ISIT), 2021, : 1853 - 1858

← 1 2 3 4 5 →