Performance Interference-Aware Vertical Elasticity for Cloud-hosted Latency-Sensitive Applications

被引:30
|
作者
Shekhar, Shashank [1 ]
Abdel-Aziz, Hamzah [1 ]
Bhattacharjee, Anirban [1 ]
Gokhale, Aniruddha [1 ]
Koutsoukos, Xenofon [1 ]
机构
[1] Vanderbilt Univ, Dept EECS, 221 Kirkland Hall, Nashville, TN 37235 USA
基金
美国国家科学基金会;
关键词
Cloud computing; Data center; Multi-tenancy; Workload variability; Latency sensitive; Performance interference; Vertical elasticity; Virtualization; Linux containers; Docker; Online predictive models; Gaussian processes;
D O I
10.1109/CLOUD.2018.00018
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Elastic auto-scaling in cloud platforms has primarily used horizontal scaling by assigning application instances to distributed resources. Owing to rapid advances in hardware, cloud providers are now seeking vertical elasticity before attempting horizontal scaling to provide elastic auto-scaling for applications. Vertical elasticity solutions must, however, be cognizant of performance interference that stems from multi-tenant collocated applications since interference significantly impacts application quality-of-service (QoS) properties, such as latency. The problem becomes more pronounced for latency-sensitive applications that demand strict QoS properties. Further exacerbating the problem are variations in workloads, which make it hard to determine the right kinds of timely resource adaptations for latency-sensitive applications. To address these challenges and overcome limitations in existing offline approaches, we present an online, data-driven approach which utilizes Gaussian Processes-based machine learning techniques to build runtime predictive models of the performance of the system under different levels of interference. The predictive online models are then used in dynamically adapting to the workload variability by vertically auto-scaling co-located applications such that performance interference is minimized and QoS properties of latency-sensitive applications are met.
引用
收藏
页码:82 / 89
页数:8
相关论文
共 29 条
  • [21] User-Centric Interference-Aware Load Balancing for Cloud-Deployed Applications
    Javadi, Seyyed Ahmad
    Gandhi, Anshul
    IEEE TRANSACTIONS ON CLOUD COMPUTING, 2022, 10 (01) : 736 - 748
  • [22] FAS-DQN: Freshness-Aware Scheduling via Reinforcement Learning for Latency-Sensitive Applications
    Zhou, Chunyang
    Li, Guohui
    Li, Jianjun
    Zhou, Quan
    Guo, Bing
    IEEE TRANSACTIONS ON COMPUTERS, 2022, 71 (10) : 2381 - 2394
  • [23] DIAL: Reducing Tail Latencies for Cloud Applications via Dynamic Interference-aware Load Balancing
    Javadi, Seyyed Ahmad
    Gandhi, Anshul
    2017 IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC COMPUTING (ICAC), 2017, : 135 - 144
  • [24] ORIENT: A Priority-Aware Energy-Efficient Approach for Latency-Sensitive Applications in 6G
    Shokrnezhad, Masoud
    Taleb, Tarik
    ICC 2024 - IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2024, : 2089 - 2094
  • [25] An Interference-aware Virtual Machine Placement Strategy for High Performance Computing Applications in Clouds
    Alves, Maicon Melo
    Teylo, Luan
    Frota, Yuri
    Drummond, Lucia M. A.
    2018 SYMPOSIUM ON HIGH PERFORMANCE COMPUTING SYSTEMS (WSCAD 2018), 2018, : 94 - 100
  • [26] An Interference-Aware Strategy for Co-locating High Performance Computing Applications in Clouds
    Alves, Maicon Melo
    Teylo, Luan
    Frota, Yuri
    Drummond, Lucia Maria de A.
    HIGH PERFORMANCE COMPUTING SYSTEMS, WSCAD 2018, 2020, 1171 : 3 - 20
  • [27] Towards Latency Sensitive Cloud Native Applications: A Performance Study on AWS
    Pelle, Istvan
    Czentye, Janos
    Doka, Janos
    Sonkoly, Balazs
    2019 IEEE 12TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (IEEE CLOUD 2019), 2019, : 272 - 280
  • [28] Efficient Edge-Cloud Publish/Subscribe Broker Overlay Networks to Support Latency-Sensitive Wide-Scale IoT Applications
    Van-Nam Pham
    VanDung Nguyen
    Nguyen, Tri D. T.
    Huh, Eui-Nam
    SYMMETRY-BASEL, 2020, 12 (01):
  • [29] Alioth: A Machine Learning Based Interference -Aware Performance Monitor for Multi -Tenancy Applications in Public Cloud
    Shi, Tianyao
    Yang, Yingxuan
    Cheng, Yunlong
    Gao, Xiaofeng
    Fang, Zhen
    Yang, Yongqiang
    2023 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM, IPDPS, 2023, : 908 - 917