Proactive Resource Autoscaling Scheme Based on SCINet for High-Performance Cloud Computing

被引:10
|
作者
Jeong, Byeonghui [1 ]
Jeon, Jueun [1 ]
Jeong, Young-Sik [2 ]
机构
[1] Dongguk Univ, Dept Multimedia Engn, Seoul 04620, South Korea
[2] Dongguk Univ, Dept AI SW, Seoul 04620, South Korea
关键词
Cloud computing; container resource autoscaling; resource management; time-series forecasting; MANAGEMENT;
D O I
10.1109/TCC.2023.3292378
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The container resource autoscaling technique provides scalability to cloud services composed of microservice architecture in a cloud-native computing environment. However, the service efficiency is reduced as the scaling is delayed because dynamic loads occur with various workload patterns. Furthermore, estimating the efficient resource size for the workload is difficult, resulting in resource waste and overload. Therefore, this study proposes high-performance resource management (HiPerRM), which stably and elastically manages container resources to ensure service scalability and efficiency even under rapidly changing dynamic loads. HiPerRM forecasts future workloads using a sample convolutional and interaction network (SCINet) model applied with the reversible instance normalization (RevIN) method. HiPerRM generates a resource request with an elastic size based on the forecasted CPU and memory usage, and then efficiently adjusts the pod's resource request and the number of replicas via HiPerRM's VPA (Hi-VPA) and HiPerRM's HPA (Hi-HPA). As a result of evaluating the performance of HiPerRM, the average resource utilization was improved by approximately 3.96-34.06% compared to conventional autoscaling techniques, even when the resource size was incorrectly estimated for various workloads, and there were relatively fewer overloads.
引用
收藏
页码:3497 / 3509
页数:13
相关论文
共 50 条
  • [1] SciNet: Codesign of Resource Management in Cloud Computing Environments
    Tuli, Shreshth
    Casale, Giuliano
    Jennings, Nicholas R.
    IEEE TRANSACTIONS ON COMPUTERS, 2023, 72 (12) : 3590 - 3602
  • [2] ARAScaler: Adaptive Resource Autoscaling Scheme Using ETimeMixer for Efficient Cloud-Native Computing
    Jeong, Byeonghui
    Jeong, Young-Sik
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2025, 18 (01) : 72 - 84
  • [3] Zero-Carbon Cloud: A Volatile Resource for High-Performance Computing
    Chien, Andrew A.
    Wolski, Rich
    Yang, Fan
    CIT/IUCC/DASC/PICOM 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY - UBIQUITOUS COMPUTING AND COMMUNICATIONS - DEPENDABLE, AUTONOMIC AND SECURE COMPUTING - PERVASIVE INTELLIGENCE AND COMPUTING, 2015, : 1998 - 2002
  • [4] A Distributed Cloud Resource Management Framework for High-Performance Computing (HPC) Applications
    Govindarajan, Kannan
    Kumar, Vivekanandan Suresh
    Somasundaram, Thamarai Selvi
    2016 EIGHTH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING (ICOAC), 2017, : 1 - 6
  • [5] SCinet: Testbed for high-performance networked applications
    Kramer, WTC
    COMPUTER, 2002, 35 (06) : 47 - +
  • [6] A proactive resource allocation method based on adaptive prediction of resource requests in cloud computing
    Chen, Jing
    Wang, Yinglong
    Liu, Tao
    EURASIP JOURNAL ON WIRELESS COMMUNICATIONS AND NETWORKING, 2021, 2021 (01)
  • [7] A proactive resource allocation method based on adaptive prediction of resource requests in cloud computing
    Jing Chen
    Yinglong Wang
    Tao Liu
    EURASIP Journal on Wireless Communications and Networking, 2021
  • [8] AI model auditing scheme towards cloud-edge high-performance computing
    Li, Yi
    Zheng, Wenying
    Ji, Sai
    JOURNAL OF SUPERCOMPUTING, 2025, 81 (04):
  • [9] Serverless High-Performance Computing over Cloud
    Petrosyan, Davit
    Astsatryan, Hrachya
    CYBERNETICS AND INFORMATION TECHNOLOGIES, 2022, 22 (03) : 82 - 92
  • [10] Confidential High-Performance Computing in the Public Cloud
    Chen, Keke
    IEEE INTERNET COMPUTING, 2023, 27 (01) : 24 - 32