Auto-scaling of Web Applications in Clouds: A Tail Latency Evaluation

被引:6
|
作者
Aslanpour, Mohammad S. [1 ,2 ]
Toosi, Adel N. [1 ]
Gaire, Raj [2 ]
Cheema, Muhammad Aamir [1 ]
机构
[1] Monash Univ, Fac Informat Technol, Clayton, Vic, Australia
[2] CSIROs Data61, Canberra, ACT, Australia
关键词
cloud computing; auto-scaling; tail latency; resource provisioning; performance evaluation;
D O I
10.1109/UCC48980.2020.00037
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Mechanisms for dynamically adding and removing Virtual Machines (VMs) to reduce cost while minimizing the latency are called auto-scaling. Latency improvements are mainly fulfilled through minimizing the "average" response times while unpredictabilities and fluctuations of the Web applications, aka flash crowds, can result in very high latencies for users' requests. Requests influenced by flash crowd suffer from long latencies, known as outliers. Such outliers are inevitable to a large extent as auto-scaling solutions continue to improve the average, not the "tail" of latencies. In this paper, we study possible sources of tail latency in auto-scaling mechanisms for Web applications. Based on our extensive evaluations in a real cloud platform, we discovered sources of a tail latency as 1) large requests, i.e. those data-intensive; 2) long-term scaling intervals; 3) instant analysis of scaling parameters; 4) conservative, i.e. tight, threshold tuning; 5) load-unaware surplus VM selection policies used for executing a scale-down decision; 6) cooldown feature, although cost-effective; and 7) VM start-up delay. We also discovered that after improving the average latency by auto-scaling mechanisms, the tail may behave differently, demanding dedicated tail-aware solutions for auto-scaling mechanisms.
引用
收藏
页码:186 / 195
页数:10
相关论文
共 50 条
  • [1] Auto-Scaling Web Applications in Clouds: A Taxonomy and Survey
    Qu, Chenhao
    Calheiros, Rodrigo N.
    Buyya, Rajkumar
    ACM COMPUTING SURVEYS, 2018, 51 (04)
  • [2] AutoScaleSim: A simulation toolkit for auto-scaling Web applications in clouds
    Aslanpour, Mohammad S.
    Toosi, Adel N.
    Taheri, Javid
    Gaire, Raj
    SIMULATION MODELLING PRACTICE AND THEORY, 2021, 108 (108)
  • [3] Auto-scaling web applications in clouds: A cost-aware approach
    Aslanpour, Mohammad Sadegh
    Ghobaei-Arani, Mostafa
    Toosi, Adel Nadjaran
    JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2017, 95 : 26 - 41
  • [4] Auto-Scaling Containerized Applications in Geo-Distributed Clouds
    Shi, Tao
    Ma, Hui
    Chen, Gang
    Hartmann, Sven
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2023, 16 (06) : 4261 - 4274
  • [5] Optimal Cloud Resource Auto-Scaling for Web Applications
    Jiang, Jing
    Lu, Jie
    Zhang, Guangquan
    Long, Guodong
    PROCEEDINGS OF THE 2013 13TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID 2013), 2013, : 58 - 65
  • [6] Auto-Scaling Web Applications in Hybrid Cloud Based on Docker
    Li, Yunchun
    Xia, Yumeng
    PROCEEDINGS OF 2016 5TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT), 2016, : 75 - 79
  • [7] A hybrid auto-scaling technique for clouds processing applications with service level agreements
    Anshuman Biswas
    Shikharesh Majumdar
    Biswajit Nandy
    Ali El-Haraki
    Journal of Cloud Computing, 6
  • [8] A hybrid auto-scaling technique for clouds processing applications with service level agreements
    Biswas, Anshuman
    Majumdar, Shikharesh
    Nandy, Biswajit
    El-Haraki, Ali
    JOURNAL OF CLOUD COMPUTING-ADVANCES SYSTEMS AND APPLICATIONS, 2017, 6
  • [9] RHAS: robust hybrid auto-scaling for web applications in cloud computing
    Parminder Singh
    Avinash Kaur
    Pooja Gupta
    Sukhpal Singh Gill
    Kiran Jyoti
    Cluster Computing, 2021, 24 : 717 - 737
  • [10] Dynamic workload patterns prediction for proactive auto-scaling of web applications
    Iqbal, Waheed
    Erradi, Abdelkarim
    Mahmood, Arif
    JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2018, 124 : 94 - 107