Auto-scaling of Web Applications in Clouds: A Tail Latency Evaluation

被引:6
|
作者
Aslanpour, Mohammad S. [1 ,2 ]
Toosi, Adel N. [1 ]
Gaire, Raj [2 ]
Cheema, Muhammad Aamir [1 ]
机构
[1] Monash Univ, Fac Informat Technol, Clayton, Vic, Australia
[2] CSIROs Data61, Canberra, ACT, Australia
关键词
cloud computing; auto-scaling; tail latency; resource provisioning; performance evaluation;
D O I
10.1109/UCC48980.2020.00037
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Mechanisms for dynamically adding and removing Virtual Machines (VMs) to reduce cost while minimizing the latency are called auto-scaling. Latency improvements are mainly fulfilled through minimizing the "average" response times while unpredictabilities and fluctuations of the Web applications, aka flash crowds, can result in very high latencies for users' requests. Requests influenced by flash crowd suffer from long latencies, known as outliers. Such outliers are inevitable to a large extent as auto-scaling solutions continue to improve the average, not the "tail" of latencies. In this paper, we study possible sources of tail latency in auto-scaling mechanisms for Web applications. Based on our extensive evaluations in a real cloud platform, we discovered sources of a tail latency as 1) large requests, i.e. those data-intensive; 2) long-term scaling intervals; 3) instant analysis of scaling parameters; 4) conservative, i.e. tight, threshold tuning; 5) load-unaware surplus VM selection policies used for executing a scale-down decision; 6) cooldown feature, although cost-effective; and 7) VM start-up delay. We also discovered that after improving the average latency by auto-scaling mechanisms, the tail may behave differently, demanding dedicated tail-aware solutions for auto-scaling mechanisms.
引用
收藏
页码:186 / 195
页数:10
相关论文
共 50 条
  • [31] Coordination Pattern-Based Approach for Auto-Scaling in Multi-Clouds
    Kuehn, Eva
    Crass, Stefan
    2018 32ND INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS WORKSHOPS (WAINA), 2018, : 368 - 373
  • [32] A reliable and cost-efficient auto-scaling system for web applications using heterogeneous spot instances
    Qu, Chenhao
    Calheiros, Rodrigo N.
    Buyya, Rajkumar
    JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2016, 65 : 167 - 180
  • [33] Machine learning-based auto-scaling for containerized applications
    Mahmoud Imdoukh
    Imtiaz Ahmad
    Mohammad Gh. Alfailakawi
    Neural Computing and Applications, 2020, 32 : 9745 - 9760
  • [34] Faa$T: A Transparent Auto-Scaling Cache for Serverless Applications
    Romero, Francisco
    Chaudhry, Gohar Irfan
    Goiri, Inigo
    Gopa, Pragna
    Batum, Paul
    Yadwadkar, Neeraja J.
    Fonseca, Rodrigo
    Kozyrakis, Christos
    Bianchini, Ricardo
    PROCEEDINGS OF THE 2021 ACM SYMPOSIUM ON CLOUD COMPUTING (SOCC '21), 2021, : 122 - 137
  • [35] A Review of Auto-scaling Techniques for Elastic Applications in Cloud Environments
    Tania Lorido-Botran
    Jose Miguel-Alonso
    Jose A. Lozano
    Journal of Grid Computing, 2014, 12 : 559 - 592
  • [36] Predictive Container Auto-Scaling for Cloud-Native Applications
    Zhao, Hanqing
    Lim, Hyunwoo
    Hanif, Muhammad
    Lee, Choonhwa
    2019 10TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY CONVERGENCE (ICTC): ICT CONVERGENCE LEADING THE AUTONOMOUS FUTURE, 2019, : 1280 - 1282
  • [37] Optimal cloud resource provisioning for auto-scaling enterprise applications
    Srirama S.N.
    Ostovar A.
    Srirama, Satish Narayana (srirama@ut.ee), 2018, Inderscience Publishers (07) : 129 - 162
  • [38] A Review of Auto-scaling Techniques for Elastic Applications in Cloud Environments
    Lorido-Botran, Tania
    Miguel-Alonso, Jose
    Lozano, Jose A.
    JOURNAL OF GRID COMPUTING, 2014, 12 (04) : 559 - 592
  • [39] A cost-aware auto-scaling approach using the workload prediction in service clouds
    Jingqi Yang
    Chuanchang Liu
    Yanlei Shang
    Bo Cheng
    Zexiang Mao
    Chunhong Liu
    Lisha Niu
    Junliang Chen
    Information Systems Frontiers, 2014, 16 : 7 - 18
  • [40] A proactive auto-scaling scheme with latency guarantees for multi-tenant NFV cloud
    Hu, Guangwu
    Li, Qing
    Ai, Shuo
    Chen, Tan
    Duan, Jingpu
    Wu, Yu
    COMPUTER NETWORKS, 2020, 181