Microservice Auto-Scaling Algorithm Based on Workload Prediction in Cloud-Edge Collaboration Environment

被引:0
|
作者
Peng, Zijun [1 ,2 ]
Tang, Bing [1 ,2 ]
Xu, Wei [1 ,2 ]
Yang, Qing [3 ]
Hussaini, Ehsanullah [1 ,2 ]
Xiao, Yuqiang [1 ,2 ]
Li, Haiyan [1 ,2 ]
机构
[1] Hunan Univ Sci & Technol, Sch Comp Sci & Engn, Xiangtan 411201, Peoples R China
[2] Hunan Key Lab Serv Comp & Novel Software Technol, Xiangtan 411201, Peoples R China
[3] Guangzhou Maritime Univ, Ctr Network & Educ Technol, Guangzhou 510725, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Auto-Scaling; Microservice; Workload Prediction; Cloud-Edge Collaboration;
D O I
10.1109/iThings-GreenCom-CPSCom-SmartData-Cybermatics60724.2023.00112
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cloud computing centrally consolidates hardware and computing resources, offering efficient and cost-effective services. However, as cloud computing centers are predominantly built and operated in a fully centralized fashion, the increased distance between these centers and users can lead to a decline in service quality. Real-time interaction and high business continuity are crucial in scenarios like traffic monitoring, AR/VR applications, and the Internet of Things (IoT). Edge computing is better suited to meet the demands of such latency-sensitive business needs. By analyzing and processing massive data directly at edge computing nodes, which focus on network edge devices, reliance on transmission resources is reduced, consequently improving the overall quality and performance of services. Nevertheless, resource-constrained edge nodes require efficient utilization of available infrastructure capacity to ensure specific service level objectives (SLO) for applications. Therefore, this paper introduces XScale, a cloud-edge collaborative system that enables microservices to adaptively scale elastically. XScale applies a Bi-LSTM with an attention mechanism to forecast the workload of microservices. When combined with mechanisms designed to handle burst traffic and a cloud-edge collaborative load forwarding strategy, it achieves both adaptive elastic scaling and proactive load forwarding. Experimental results, obtained using real-world microservice workloads, indicate that the XScale system can significantly reduce SLO violations by 88%, increase resource utilization by 15%, and decrease average response time by 21% when compared to existing advanced reactive scaling methods.
引用
收藏
页码:608 / 615
页数:8
相关论文
共 50 条
  • [31] Auto-Scaling Approach for Cloud based Mobile Learning Applications
    Almutlaq, Amani Nasser
    Daadaa, Yassine
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2019, 10 (01) : 472 - 479
  • [32] ASTAW: Auto-scaling threshold-based approach for web application in cloud computing environment
    Fallah, Monireh
    Arani, Mostafa Ghobaei
    International Journal of u- and e- Service, Science and Technology, 2015, 8 (03) : 221 - 230
  • [33] A Data Analytics Based Approach to Cloud Resource Auto-Scaling
    Hao, Fang
    Kodialam, Murali
    Mukherjee, Sarit
    Lakshman, T., V
    2022 IEEE 23RD INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE SWITCHING AND ROUTING (IEEE HPSR), 2022, : 224 - 231
  • [34] A cost-aware auto-scaling approach using the workload prediction in service clouds
    Jingqi Yang
    Chuanchang Liu
    Yanlei Shang
    Bo Cheng
    Zexiang Mao
    Chunhong Liu
    Lisha Niu
    Junliang Chen
    Information Systems Frontiers, 2014, 16 : 7 - 18
  • [35] A Dynamic Scalable Auto-Scaling Model as a Load Balancer in the Cloud Computing Environment
    Rout, Saroja Kumar
    Ravindra, J. V. R.
    Meda, Anudeep
    Mohanty, Sachi Nandan
    Kavididevi, Venkatesh
    EAI ENDORSED TRANSACTIONS ON SCALABLE INFORMATION SYSTEMS, 2023, 10 (05) : 1 - 7
  • [36] The Survival Analysis of Big Data Application Over Auto-scaling Cloud Environment
    Rajput, R. S.
    Goyal, Dinesh
    Pant, Anjali
    EMERGING TECHNOLOGIES IN COMPUTER ENGINEERING: MICROSERVICES IN BIG DATA ANALYTICS, 2019, 985 : 155 - 166
  • [37] A cost-aware auto-scaling approach using the workload prediction in service clouds
    Yang, Jingqi
    Liu, Chuanchang
    Shang, Yanlei
    Cheng, Bo
    Mao, Zexiang
    Liu, Chunhong
    Niu, Lisha
    Chen, Junliang
    INFORMATION SYSTEMS FRONTIERS, 2014, 16 (01) : 7 - 18
  • [38] Auto-scaling and computation offloading in edge/cloud computing: a fuzzy Q-learning-based approach
    Ma, Xiang
    Zong, Kexuan
    Rezaeipanah, Amin
    WIRELESS NETWORKS, 2024, 30 (02) : 637 - 648
  • [39] Horizontal Auto-Scaling in Edge Computing Environment using Online Machine Learning
    da Silva, Thiago Pereira
    Rocha Neto, Aluizio F.
    Batista, Thais Vasconcelos
    Lopes, Frederico A. S.
    Delicato, Flavia C.
    Pires, Paulo F.
    2021 IEEE INTL CONF ON DEPENDABLE, AUTONOMIC AND SECURE COMPUTING, INTL CONF ON PERVASIVE INTELLIGENCE AND COMPUTING, INTL CONF ON CLOUD AND BIG DATA COMPUTING, INTL CONF ON CYBER SCIENCE AND TECHNOLOGY CONGRESS DASC/PICOM/CBDCOM/CYBERSCITECH 2021, 2021, : 161 - 168
  • [40] Auto-scaling and computation offloading in edge/cloud computing: a fuzzy Q-learning-based approach
    Xiang Ma
    Kexuan Zong
    Amin Rezaeipanah
    Wireless Networks, 2024, 30 : 637 - 648