Amoeba: QoS-Awareness and Reduced Resource Usage of Microservices with Serverless Computing

被引：29

作者：

Li, Zijun ^{[1
]}

Chen, Quan ^{[1
,2
]}

Xue, Shuai ^{[1
]}

Ma, Tao ^{[3
]}

Yang, Yong ^{[3
]}

Song, Zhuo ^{[3
]}

Guo, Minyi ^{[1
,2
]}

机构：

[1] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai, Peoples R China

[2] Shanghai Jiao Tong Univ, Shanghai Inst Adv Commun & Data Sci, Shanghai, Peoples R China

[3] Alibaba Cloud, Beijing, Peoples R China

来源：

2020 IEEE 34TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM IPDPS 2020 | 2020年

基金：

中国国家自然科学基金;

关键词：

Serverless computing; Microservices; QoS;

D O I：

10.1109/IPDPS47924.2020.00049

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

While microservices that have stringent Quality-of-Service constraints are deployed in the Clouds, the long-term rented infrastructures that host the microservices are underutilized except peak hours due to the diurnal load pattern. It is resource efficient for Cloud vendors and cost efficient for service maintainers to deploy the microservices in the long-term infrastructure at high load and in the serverless computing platform at low load. However, prior work fails to take advantage of the opportunity, because the contention between microservices on the serverless platform seriously affects their response latencies. Our investigation shows that the load of a microservice, the shared resource contentions on the serverless platform, and its sensitivities to the contention together affect the response latency of the microservice on the platform. To this end, we propose Amoeba, a runtime system that dynamically switches the deployment of a microservice. Amoeba is comprised of a contention-aware deployment controller, a hybrid execution engine, and a multi-resource contention monitor. The deployment controller predicts the tail latency of a microservice based on its load and the contention on the serverless platform, and determines the appropriate deployment of the microservice. The hybrid execution engine enables the quick switch of the two deploy modes. The contention monitor periodically quantifies the contention on multiple types of shared resources. Experimental results show that Amoeba is able to significantly reduce up to 72.9% of CPU usage and up to 84.9% of memory usage compared with the traditional pure IaaS-based deployment, while ensuring the required latency target.

引用

页码：399 / 408

页数：10

共 5 条

[1] Astra: Autonomous Serverless Analytics with Cost-Efficiency and QoS-Awareness
Jarachanthan, Jananie
Chen, Li
Xu, Fei
Li, Bo
2021 IEEE 35TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), 2021, : 756 - 765
[2] QoS-awareness of Microservices with Excessive Loads via Inter-Datacenter Scheduling
Shi, Jiuchen
Wang, Jiawen
Fu, Kaihua
Chen, Quan
Zeng, Deze
Guo, Minyi
2022 IEEE 36TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2022), 2022, : 324 - 334
[3] Astrea: Auto-Serverless Analytics Towards Cost-Efficiency and QoS-Awareness
Jarachanthan, Jananie
Chen, Li
Xu, Fei
Li, Bo
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2022, 33 (12) : 3833 - 3849
[4] Publisher-side event filtering for QoS-awareness in ubiquitous computing
Cho, Kyu Bong
Song, Sung Keun
Youn, Hee Yong
ICCSA 2007: PROCEEDINGS OF THE FIFTH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND APPLICATIONS, 2007, : 361 - +
[5] Optimized resource usage with hybrid auto-scaling system for knative serverless edge computing
Tran, Minh-Ngoc
Kim, Younghan
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2024, 152 : 304 - 316

← 1 →