Auto-scaling of Web Applications in Clouds: A Tail Latency Evaluation

被引:6
|
作者
Aslanpour, Mohammad S. [1 ,2 ]
Toosi, Adel N. [1 ]
Gaire, Raj [2 ]
Cheema, Muhammad Aamir [1 ]
机构
[1] Monash Univ, Fac Informat Technol, Clayton, Vic, Australia
[2] CSIROs Data61, Canberra, ACT, Australia
关键词
cloud computing; auto-scaling; tail latency; resource provisioning; performance evaluation;
D O I
10.1109/UCC48980.2020.00037
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Mechanisms for dynamically adding and removing Virtual Machines (VMs) to reduce cost while minimizing the latency are called auto-scaling. Latency improvements are mainly fulfilled through minimizing the "average" response times while unpredictabilities and fluctuations of the Web applications, aka flash crowds, can result in very high latencies for users' requests. Requests influenced by flash crowd suffer from long latencies, known as outliers. Such outliers are inevitable to a large extent as auto-scaling solutions continue to improve the average, not the "tail" of latencies. In this paper, we study possible sources of tail latency in auto-scaling mechanisms for Web applications. Based on our extensive evaluations in a real cloud platform, we discovered sources of a tail latency as 1) large requests, i.e. those data-intensive; 2) long-term scaling intervals; 3) instant analysis of scaling parameters; 4) conservative, i.e. tight, threshold tuning; 5) load-unaware surplus VM selection policies used for executing a scale-down decision; 6) cooldown feature, although cost-effective; and 7) VM start-up delay. We also discovered that after improving the average latency by auto-scaling mechanisms, the tail may behave differently, demanding dedicated tail-aware solutions for auto-scaling mechanisms.
引用
收藏
页码:186 / 195
页数:10
相关论文
共 50 条
  • [41] Auto-scaling techniques for IoT-based cloud applications: a review
    Verma, Shveta
    Bala, Anju
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2021, 24 (03): : 2425 - 2459
  • [42] Performance and Energy-based Cost Prediction of Virtual Machines Auto-Scaling in Clouds
    Aldossary, Mohammad
    Djemame, Karim
    44TH EUROMICRO CONFERENCE ON SOFTWARE ENGINEERING AND ADVANCED APPLICATIONS (SEAA 2018), 2018, : 502 - 509
  • [43] Proactive Auto-Scaling Approach of Production Applications Using an Ensemble Model
    Samir, Mohamed
    Wassif, Khaled T. T.
    Makady, Soha H. H.
    IEEE ACCESS, 2023, 11 : 25008 - 25019
  • [44] Auto-Scaling Cloud-Based Memory-Intensive Applications
    Novak, Joe
    Kasera, Sneha Kumar
    Stutsman, Ryan
    2020 IEEE 13TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD 2020), 2020, : 229 - 237
  • [45] A cost-aware auto-scaling approach using the workload prediction in service clouds
    Yang, Jingqi
    Liu, Chuanchang
    Shang, Yanlei
    Cheng, Bo
    Mao, Zexiang
    Liu, Chunhong
    Niu, Lisha
    Chen, Junliang
    INFORMATION SYSTEMS FRONTIERS, 2014, 16 (01) : 7 - 18
  • [46] Project Hoover: Auto-Scaling Streaming Map-Reduce Applications
    Ramesh, Rajalakshmi
    Hu, Liting
    Schwan, Karsten
    MBDS '12: PROCEEDINGS OF THE 2012 WORKSHOP ON MANAGEMENT OF BIG DATA SYSTEMS, 2012, : 7 - 12
  • [47] AMAS: Adaptive Auto-Scaling on the Edge
    Mukherjee, Saptarshi
    Sidhanta, Subhajit
    21ST IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND INTERNET COMPUTING (CCGRID 2021), 2021, : 618 - 621
  • [48] Auto-scaling containerized cloud applications: A workload-driven approach
    Chouliaras, Spyridon
    Sotiriadis, Stelios
    SIMULATION MODELLING PRACTICE AND THEORY, 2022, 121
  • [49] Dynamic Multi-level Auto-scaling Rules for Containerized Applications
    Taherizadeh, Salman
    Stankovski, Vlado
    COMPUTER JOURNAL, 2019, 62 (02): : 174 - 197
  • [50] Auto-scaling techniques for IoT-based cloud applications: a review
    Shveta Verma
    Anju Bala
    Cluster Computing, 2021, 24 : 2425 - 2459