Microservice Auto-Scaling Algorithm Based on Workload Prediction in Cloud-Edge Collaboration Environment

被引:0
|
作者
Peng, Zijun [1 ,2 ]
Tang, Bing [1 ,2 ]
Xu, Wei [1 ,2 ]
Yang, Qing [3 ]
Hussaini, Ehsanullah [1 ,2 ]
Xiao, Yuqiang [1 ,2 ]
Li, Haiyan [1 ,2 ]
机构
[1] Hunan Univ Sci & Technol, Sch Comp Sci & Engn, Xiangtan 411201, Peoples R China
[2] Hunan Key Lab Serv Comp & Novel Software Technol, Xiangtan 411201, Peoples R China
[3] Guangzhou Maritime Univ, Ctr Network & Educ Technol, Guangzhou 510725, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Auto-Scaling; Microservice; Workload Prediction; Cloud-Edge Collaboration;
D O I
10.1109/iThings-GreenCom-CPSCom-SmartData-Cybermatics60724.2023.00112
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cloud computing centrally consolidates hardware and computing resources, offering efficient and cost-effective services. However, as cloud computing centers are predominantly built and operated in a fully centralized fashion, the increased distance between these centers and users can lead to a decline in service quality. Real-time interaction and high business continuity are crucial in scenarios like traffic monitoring, AR/VR applications, and the Internet of Things (IoT). Edge computing is better suited to meet the demands of such latency-sensitive business needs. By analyzing and processing massive data directly at edge computing nodes, which focus on network edge devices, reliance on transmission resources is reduced, consequently improving the overall quality and performance of services. Nevertheless, resource-constrained edge nodes require efficient utilization of available infrastructure capacity to ensure specific service level objectives (SLO) for applications. Therefore, this paper introduces XScale, a cloud-edge collaborative system that enables microservices to adaptively scale elastically. XScale applies a Bi-LSTM with an attention mechanism to forecast the workload of microservices. When combined with mechanisms designed to handle burst traffic and a cloud-edge collaborative load forwarding strategy, it achieves both adaptive elastic scaling and proactive load forwarding. Experimental results, obtained using real-world microservice workloads, indicate that the XScale system can significantly reduce SLO violations by 88%, increase resource utilization by 15%, and decrease average response time by 21% when compared to existing advanced reactive scaling methods.
引用
收藏
页码:608 / 615
页数:8
相关论文
共 50 条
  • [41] Horizontal Auto-Scaling and Process Migration Mechanism for Cloud Services with Skewness Algorithm
    Chaloemwat, Wathit
    Kitisin, Sukumal
    2016 13TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER SCIENCE AND SOFTWARE ENGINEERING (JCSSE), 2016, : 556 - 561
  • [42] Optimizing the performance of optimization in the cloud environment-An intelligent auto-scaling approach
    Simic, Visnja
    Stojanovic, Boban
    Ivanovic, Milos
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2019, 101 : 909 - 920
  • [43] Microservice deployment in cloud-edge environment using enhanced global search grey wolf optimizer-greedy algorithm
    Wang, Shudong
    Zhang, Yanxiang
    He, Xiao
    Wang, Nuanlai
    Lu, Zhi
    Chen, Baoyun
    Pang, Shanchen
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2025, 28 (02):
  • [44] A FPGA-BASED CLOUD-EDGE COLLABORATION PLATFORM IN CLOUD MANUFACTURING
    Xiao, Chuan
    Zhao, Chun
    Liu, Yue
    Zhang, Lin
    PROCEEDINGS OF THE ASME 2021 16TH INTERNATIONAL MANUFACTURING SCIENCE AND ENGINEERING CONFERENCE (MSEC2021), VOL 2, 2021,
  • [45] Efficient Auto-scaling for Host Load Prediction through VM migration in Cloud
    Verma, Shveta
    Bala, Anju
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2024, 36 (04):
  • [46] Auto-scaling techniques for IoT-based cloud applications: a review
    Verma, Shveta
    Bala, Anju
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2021, 24 (03): : 2425 - 2459
  • [47] Metaheuristic based auto-scaling for microservices in cloud environment: a new container-aware application scheduling
    Sarma, Subramonian Krishna
    INTERNATIONAL JOURNAL OF PERVASIVE COMPUTING AND COMMUNICATIONS, 2023, 19 (01) : 74 - 96
  • [48] Auto-Scaling Cloud-Based Memory-Intensive Applications
    Novak, Joe
    Kasera, Sneha Kumar
    Stutsman, Ryan
    2020 IEEE 13TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD 2020), 2020, : 229 - 237
  • [49] Modified firefly algorithm for workflow scheduling in cloud-edge environment
    Nebojsa Bacanin
    Miodrag Zivkovic
    Timea Bezdan
    K. Venkatachalam
    Mohamed Abouhawwash
    Neural Computing and Applications, 2022, 34 : 9043 - 9068
  • [50] Performance Modelling and Verification of Cloud-based Auto-Scaling Policies
    Evangelidis, Alexandros
    Parker, David
    Bahsoon, Rami
    2017 17TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2017, : 355 - 364