Microservice Auto-Scaling Algorithm Based on Workload Prediction in Cloud-Edge Collaboration Environment

被引：0

作者：

Peng, Zijun ^{[1
,2
]}

Tang, Bing ^{[1
,2
]}

Xu, Wei ^{[1
,2
]}

Yang, Qing ^{[3
]}

Hussaini, Ehsanullah ^{[1
,2
]}

Xiao, Yuqiang ^{[1
,2
]}

Li, Haiyan ^{[1
,2
]}

机构：

[1] Hunan Univ Sci & Technol, Sch Comp Sci & Engn, Xiangtan 411201, Peoples R China

[2] Hunan Key Lab Serv Comp & Novel Software Technol, Xiangtan 411201, Peoples R China

[3] Guangzhou Maritime Univ, Ctr Network & Educ Technol, Guangzhou 510725, Peoples R China

来源：

2023 IEEE INTERNATIONAL CONFERENCES ON INTERNET OF THINGS, ITHINGS IEEE GREEN COMPUTING AND COMMUNICATIONS, GREENCOM IEEE CYBER, PHYSICAL AND SOCIAL COMPUTING, CPSCOM IEEE SMART DATA, SMARTDATA AND IEEE CONGRESS ON CYBERMATICS,CYBERMATICS | 2024年

基金：

中国国家自然科学基金; 国家重点研发计划;

关键词：

Auto-Scaling; Microservice; Workload Prediction; Cloud-Edge Collaboration;

D O I：

10.1109/iThings-GreenCom-CPSCom-SmartData-Cybermatics60724.2023.00112

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Cloud computing centrally consolidates hardware and computing resources, offering efficient and cost-effective services. However, as cloud computing centers are predominantly built and operated in a fully centralized fashion, the increased distance between these centers and users can lead to a decline in service quality. Real-time interaction and high business continuity are crucial in scenarios like traffic monitoring, AR/VR applications, and the Internet of Things (IoT). Edge computing is better suited to meet the demands of such latency-sensitive business needs. By analyzing and processing massive data directly at edge computing nodes, which focus on network edge devices, reliance on transmission resources is reduced, consequently improving the overall quality and performance of services. Nevertheless, resource-constrained edge nodes require efficient utilization of available infrastructure capacity to ensure specific service level objectives (SLO) for applications. Therefore, this paper introduces XScale, a cloud-edge collaborative system that enables microservices to adaptively scale elastically. XScale applies a Bi-LSTM with an attention mechanism to forecast the workload of microservices. When combined with mechanisms designed to handle burst traffic and a cloud-edge collaborative load forwarding strategy, it achieves both adaptive elastic scaling and proactive load forwarding. Experimental results, obtained using real-world microservice workloads, indicate that the XScale system can significantly reduce SLO violations by 88%, increase resource utilization by 15%, and decrease average response time by 21% when compared to existing advanced reactive scaling methods.

引用

页码：608 / 615

页数：8

共 50 条

[31] Auto-Scaling Approach for Cloud based Mobile Learning Applications
Almutlaq, Amani Nasser
Daadaa, Yassine
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2019, 10 (01) : 472 - 479
[32] ASTAW: Auto-scaling threshold-based approach for web application in cloud computing environment
Fallah, Monireh
Arani, Mostafa Ghobaei
International Journal of u- and e- Service, Science and Technology, 2015, 8 (03) : 221 - 230
[33] A Data Analytics Based Approach to Cloud Resource Auto-Scaling
Hao, Fang
Kodialam, Murali
Mukherjee, Sarit
Lakshman, T., V
2022 IEEE 23RD INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE SWITCHING AND ROUTING (IEEE HPSR), 2022, : 224 - 231
[34] A cost-aware auto-scaling approach using the workload prediction in service clouds
Jingqi Yang
Chuanchang Liu
Yanlei Shang
Bo Cheng
Zexiang Mao
Chunhong Liu
Lisha Niu
Junliang Chen
Information Systems Frontiers, 2014, 16 : 7 - 18
[35] A Dynamic Scalable Auto-Scaling Model as a Load Balancer in the Cloud Computing Environment
Rout, Saroja Kumar
Ravindra, J. V. R.
Meda, Anudeep
Mohanty, Sachi Nandan
Kavididevi, Venkatesh
EAI ENDORSED TRANSACTIONS ON SCALABLE INFORMATION SYSTEMS, 2023, 10 (05) : 1 - 7
[36] The Survival Analysis of Big Data Application Over Auto-scaling Cloud Environment
Rajput, R. S.
Goyal, Dinesh
Pant, Anjali
EMERGING TECHNOLOGIES IN COMPUTER ENGINEERING: MICROSERVICES IN BIG DATA ANALYTICS, 2019, 985 : 155 - 166
[37] A cost-aware auto-scaling approach using the workload prediction in service clouds
Yang, Jingqi
Liu, Chuanchang
Shang, Yanlei
Cheng, Bo
Mao, Zexiang
Liu, Chunhong
Niu, Lisha
Chen, Junliang
INFORMATION SYSTEMS FRONTIERS, 2014, 16 (01) : 7 - 18
[38] Auto-scaling and computation offloading in edge/cloud computing: a fuzzy Q-learning-based approach
Ma, Xiang
Zong, Kexuan
Rezaeipanah, Amin
WIRELESS NETWORKS, 2024, 30 (02) : 637 - 648
[39] Horizontal Auto-Scaling in Edge Computing Environment using Online Machine Learning
da Silva, Thiago Pereira
Rocha Neto, Aluizio F.
Batista, Thais Vasconcelos
Lopes, Frederico A. S.
Delicato, Flavia C.
Pires, Paulo F.
2021 IEEE INTL CONF ON DEPENDABLE, AUTONOMIC AND SECURE COMPUTING, INTL CONF ON PERVASIVE INTELLIGENCE AND COMPUTING, INTL CONF ON CLOUD AND BIG DATA COMPUTING, INTL CONF ON CYBER SCIENCE AND TECHNOLOGY CONGRESS DASC/PICOM/CBDCOM/CYBERSCITECH 2021, 2021, : 161 - 168
[40] Auto-scaling and computation offloading in edge/cloud computing: a fuzzy Q-learning-based approach
Xiang Ma
Kexuan Zong
Amin Rezaeipanah
Wireless Networks, 2024, 30 : 637 - 648

← 1 2 3 4 5 →