Speeding up Distributed Request-Response Workflows

被引:74
|
作者
Jalaparti, Virajith
Bodik, Peter [1 ]
Kandula, Srikanth [1 ]
Menache, Ishai [1 ]
Rybalkin, Mikhail
Yan, Chenyu [1 ]
机构
[1] Microsoft Corp, Redmond, WA 98052 USA
关键词
Interactive services; Tail latency; Optimization; Reissues; Partial results;
D O I
10.1145/2534169.2486028
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We found that interactive services at Bing have highly variable datacenter-side processing latencies because their processing consists of many sequential stages, parallelization across 10s-1000s of servers and aggregation of responses across the network. To improve the tail latency of such services, we use a few building blocks: reissuing laggards elsewhere in the cluster, new policies to return incomplete results and speeding up laggards by giving them more resources. Combining these building blocks to reduce the overall latency is non-trivial because for the same amount of resource (e.g., number of reissues), different stages improve their latency by different amounts. We present Kwiken, a framework that takes an end-to-end view of latency improvements and costs. It decomposes the problem of minimizing latency over a general processing DAG into a manageable optimization over individual stages. Through simulations with production traces, we show sizable gains; the 99th percentile of latency improves by over 50% when just 0.1% of the responses are allowed to have partial results and by over 40% for 25% of the services when just 5% extra resources are used for reissues.
引用
收藏
页码:219 / 230
页数:12
相关论文
共 50 条
  • [1] Request-Response Distributed Power Management in Cloud Data Centers
    Li, Jianxiang
    Zhang, Youchun
    JOURNAL OF INTELLIGENT SYSTEMS, 2013, 22 (04) : 437 - 451
  • [2] The Complexity of Request-Response Games
    Chatterjee, Krishnendu
    Henzinger, Thomas A.
    Horn, Florian
    LANGUAGE AND AUTOMATA THEORY AND APPLICATIONS, 2011, 6638 : 227 - 237
  • [3] Behavioural Contracts with Request-Response Operations
    Acciai, Lucia
    Boreale, Michele
    Zavattaro, Gianluigi
    COORDINATION MODELS AND LANGUAGES, PROCEEDINGS, 2010, 6116 : 16 - +
  • [4] A Taxonomy of Reliable Request-Response Protocols
    Ivaki, Naghmeh
    Laranjeiro, Nuno
    Araujo, Filipe
    30TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, VOLS I AND II, 2015, : 456 - 463
  • [5] NETWORK MEASUREMENT OF THE VMTP REQUEST-RESPONSE PROTOCOL IN THE V DISTRIBUTED SYSTEM.
    Cheriton, David R.
    Williamson, Carey L.
    Performance Evaluation Review, 1987, 15 (01): : 216 - 225
  • [6] Behavioural contracts with request-response operations
    Acciai, Lucia
    Boreale, Michele
    Zavattaro, Gianluigi
    SCIENCE OF COMPUTER PROGRAMMING, 2013, 78 (02) : 248 - 267
  • [7] The performance of a reliable, request-response transport protocol
    Salehi, N
    Obraczka, K
    Neuman, C
    IEEE INTERNATIONAL SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS, PROCEEDINGS, 1999, : 102 - 108
  • [8] OPTIMAL STRATEGY SYNTHESIS FOR REQUEST-RESPONSE GAMES
    Horn, Florian
    Thomas, Wolfgang
    Wallmeier, Nico
    Zimmermann, Martin
    RAIRO-THEORETICAL INFORMATICS AND APPLICATIONS, 2015, 49 (03): : 179 - 203
  • [9] Optimal Strategy Synthesis in Request-Response Games
    Horn, Florian
    Thomas, Wolfgang
    Wallmeier, Nico
    AUTOMATED TECHNOLOGY FOR VERIFICATION AND ANALYSIS, PROCEEDINGS, 2008, 5311 : 361 - +
  • [10] Graceful Interruption of Request-Response Service Interactions
    Preda, Mila Dalla
    Gabbrielli, Maurizio
    Lanese, Ivan
    Mauro, Jacopo
    Zavattaro, Gianluigi
    SERVICE-ORIENTED COMPUTING, 2011, 7084 : 590 - 600