Microservice Auto-Scaling Algorithm Based on Workload Prediction in Cloud-Edge Collaboration Environment

被引:0
|
作者
Peng, Zijun [1 ,2 ]
Tang, Bing [1 ,2 ]
Xu, Wei [1 ,2 ]
Yang, Qing [3 ]
Hussaini, Ehsanullah [1 ,2 ]
Xiao, Yuqiang [1 ,2 ]
Li, Haiyan [1 ,2 ]
机构
[1] Hunan Univ Sci & Technol, Sch Comp Sci & Engn, Xiangtan 411201, Peoples R China
[2] Hunan Key Lab Serv Comp & Novel Software Technol, Xiangtan 411201, Peoples R China
[3] Guangzhou Maritime Univ, Ctr Network & Educ Technol, Guangzhou 510725, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Auto-Scaling; Microservice; Workload Prediction; Cloud-Edge Collaboration;
D O I
10.1109/iThings-GreenCom-CPSCom-SmartData-Cybermatics60724.2023.00112
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cloud computing centrally consolidates hardware and computing resources, offering efficient and cost-effective services. However, as cloud computing centers are predominantly built and operated in a fully centralized fashion, the increased distance between these centers and users can lead to a decline in service quality. Real-time interaction and high business continuity are crucial in scenarios like traffic monitoring, AR/VR applications, and the Internet of Things (IoT). Edge computing is better suited to meet the demands of such latency-sensitive business needs. By analyzing and processing massive data directly at edge computing nodes, which focus on network edge devices, reliance on transmission resources is reduced, consequently improving the overall quality and performance of services. Nevertheless, resource-constrained edge nodes require efficient utilization of available infrastructure capacity to ensure specific service level objectives (SLO) for applications. Therefore, this paper introduces XScale, a cloud-edge collaborative system that enables microservices to adaptively scale elastically. XScale applies a Bi-LSTM with an attention mechanism to forecast the workload of microservices. When combined with mechanisms designed to handle burst traffic and a cloud-edge collaborative load forwarding strategy, it achieves both adaptive elastic scaling and proactive load forwarding. Experimental results, obtained using real-world microservice workloads, indicate that the XScale system can significantly reduce SLO violations by 88%, increase resource utilization by 15%, and decrease average response time by 21% when compared to existing advanced reactive scaling methods.
引用
收藏
页码:608 / 615
页数:8
相关论文
共 50 条
  • [1] An Efficient Algorithm for Microservice Placement in Cloud-Edge Collaborative Computing Environment
    He, Xiang
    Xu, Hanchuan
    Xu, Xiaofei
    Chen, Yin
    Wang, Zhongjie
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2024, 17 (05) : 1983 - 1997
  • [2] Predictive Auto-scaling: LSTM-Based Multi-step Cloud Workload Prediction
    Suleiman, Basem
    Alibasa, Muhammad Johan
    Chang, Ya-Yuan
    Anaissi, Ali
    SERVICE-ORIENTED COMPUTING - ICSOC 2023 WORKSHOPS, 2024, 14518 : 5 - 16
  • [3] Microservice Replacement Algorithm in Cloud-Edge System for Edge Intelligence
    Miao, Weiwei
    Zeng, Zeng
    Li, Shihao
    Wei, Lei
    Jiang, Chengling
    Quan, Siping
    Li, Yong
    19TH IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING WITH APPLICATIONS (ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM 2021), 2021, : 1737 - 1744
  • [4] HCA Operator: A Hybrid Cloud Auto-scaling Tooling for Microservice Workloads
    Wang, Yuyang
    Zhang, Fan
    Khan, Samee U.
    2022 18TH INTERNATIONAL CONFERENCE ON MOBILITY, SENSING AND NETWORKING, MSN, 2022, : 885 - 890
  • [5] Introducing an adaptive model for auto-scaling cloud computing based on workload classification
    Alanagh, Yoosef Alidoost
    Firouzi, Mojtaba
    Kenari, Abdolreza Rasouli
    Shamsi, Mahboubeh
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2023, 35 (22):
  • [6] The Power of Prediction: Microservice Auto Scaling via Workload Learning
    Luo, Shutian
    Xu, Huanle
    Ye, Kejiang
    Xu, Guoyao
    Zhang, Liping
    Yang, Guodong
    Xu, Chengzhong
    PROCEEDINGS OF THE 13TH SYMPOSIUM ON CLOUD COMPUTING, SOCC 2022, 2022, : 355 - 369
  • [7] Auto-scaling containerized cloud applications: A workload-driven approach
    Chouliaras, Spyridon
    Sotiriadis, Stelios
    SIMULATION MODELLING PRACTICE AND THEORY, 2022, 121
  • [8] Workload Characterization in HPC Environment for Auto-scaling of Resources - Preliminary Study
    Barve, Mahesh
    Sinha, Sharad
    Hardikar, Rahul Padmakar
    Gunturu, Ashok
    Mallik, Writtam
    2022 IEEE 19TH INDIA COUNCIL INTERNATIONAL CONFERENCE, INDICON, 2022,
  • [9] Evaluating Sensitivity of Auto-scaling Decisions in an Environment with Different Workload Patterns
    Nikravesh, Ali Yadavar
    Ajila, Samuel A.
    Lung, Chung-Horng
    39TH ANNUAL IEEE COMPUTERS, SOFTWARE AND APPLICATIONS CONFERENCE (COMPSAC 2015), VOL 2, 2015, : 415 - 420
  • [10] ServiceSim: A Modelling and Simulation Toolkit of Microservice Systems in Cloud-Edge Environment
    Shi, Haomai
    He, Xiang
    Wang, Teng
    Wang, Zhongjie
    SERVICE-ORIENTED COMPUTING, ICSOC 2023, PT I, 2023, 14419 : 258 - 272