Amoeba: QoS-Awareness and Reduced Resource Usage of Microservices with Serverless Computing

被引:29
|
作者
Li, Zijun [1 ]
Chen, Quan [1 ,2 ]
Xue, Shuai [1 ]
Ma, Tao [3 ]
Yang, Yong [3 ]
Song, Zhuo [3 ]
Guo, Minyi [1 ,2 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai, Peoples R China
[2] Shanghai Jiao Tong Univ, Shanghai Inst Adv Commun & Data Sci, Shanghai, Peoples R China
[3] Alibaba Cloud, Beijing, Peoples R China
来源
2020 IEEE 34TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM IPDPS 2020 | 2020年
基金
中国国家自然科学基金;
关键词
Serverless computing; Microservices; QoS;
D O I
10.1109/IPDPS47924.2020.00049
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
While microservices that have stringent Quality-of-Service constraints are deployed in the Clouds, the long-term rented infrastructures that host the microservices are underutilized except peak hours due to the diurnal load pattern. It is resource efficient for Cloud vendors and cost efficient for service maintainers to deploy the microservices in the long-term infrastructure at high load and in the serverless computing platform at low load. However, prior work fails to take advantage of the opportunity, because the contention between microservices on the serverless platform seriously affects their response latencies. Our investigation shows that the load of a microservice, the shared resource contentions on the serverless platform, and its sensitivities to the contention together affect the response latency of the microservice on the platform. To this end, we propose Amoeba, a runtime system that dynamically switches the deployment of a microservice. Amoeba is comprised of a contention-aware deployment controller, a hybrid execution engine, and a multi-resource contention monitor. The deployment controller predicts the tail latency of a microservice based on its load and the contention on the serverless platform, and determines the appropriate deployment of the microservice. The hybrid execution engine enables the quick switch of the two deploy modes. The contention monitor periodically quantifies the contention on multiple types of shared resources. Experimental results show that Amoeba is able to significantly reduce up to 72.9% of CPU usage and up to 84.9% of memory usage compared with the traditional pure IaaS-based deployment, while ensuring the required latency target.
引用
收藏
页码:399 / 408
页数:10
相关论文
共 5 条
  • [1] Astra: Autonomous Serverless Analytics with Cost-Efficiency and QoS-Awareness
    Jarachanthan, Jananie
    Chen, Li
    Xu, Fei
    Li, Bo
    2021 IEEE 35TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), 2021, : 756 - 765
  • [2] QoS-awareness of Microservices with Excessive Loads via Inter-Datacenter Scheduling
    Shi, Jiuchen
    Wang, Jiawen
    Fu, Kaihua
    Chen, Quan
    Zeng, Deze
    Guo, Minyi
    2022 IEEE 36TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2022), 2022, : 324 - 334
  • [3] Astrea: Auto-Serverless Analytics Towards Cost-Efficiency and QoS-Awareness
    Jarachanthan, Jananie
    Chen, Li
    Xu, Fei
    Li, Bo
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2022, 33 (12) : 3833 - 3849
  • [4] Publisher-side event filtering for QoS-awareness in ubiquitous computing
    Cho, Kyu Bong
    Song, Sung Keun
    Youn, Hee Yong
    ICCSA 2007: PROCEEDINGS OF THE FIFTH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND APPLICATIONS, 2007, : 361 - +
  • [5] Optimized resource usage with hybrid auto-scaling system for knative serverless edge computing
    Tran, Minh-Ngoc
    Kim, Younghan
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2024, 152 : 304 - 316