Stateless Q-learning algorithm for service caching in resource constrained edge environment

被引:2
|
作者
Huang, Binbin [1 ]
Ran, Ziqi [1 ]
Yu, Dongjin [1 ]
Xiang, Yuanyuan [1 ]
Shi, Xiaoying [1 ]
Li, Zhongjin [1 ]
Xu, Zhengqian [1 ]
机构
[1] Hangzhou Dianzi Univ, Sch Comp, Hangzhou 310018, Peoples R China
基金
中国国家自然科学基金; 美国国家科学基金会;
关键词
Edge environment; service caching; Stateless Q-learning; Collaboration cost; Service latency;
D O I
10.1186/s13677-023-00506-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In resource constrained edge environment, multiple service providers can compete to rent the limited resources to cache their service instances on edge servers close to end users, thereby significantly reducing the service delay and improving quality of service (QoS). However, service providers renting the resources of different edge servers to deploy their service instances can incur different resource usage costs and service delay. To make full use of the limited resources of the edge servers to further reduce resource usage costs, multiple service providers on an edge server can form a coalition and share the limited resource of an edge server. In this paper, we investigate the service caching problem of multiple service providers in resource constrained edge environment, and propose an independent learners-based services caching scheme (ILSCS) which adopts a stateless Q-learning to learn an optimal service caching scheme. To verify the effectiveness of ILSCS scheme, we implement COALITION, RANDOM, MDU, and MCS four baseline algorithms, and compare the total collaboration cost and service latency of ILSCS scheme with these of these four baseline algorithms under different experimental parameter settings. The extensive experimental results show that the ILSCS scheme can achieve lower total collaboration cost and service latency.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] A novel Q-Learning Algorithm Based on the Stochastic Environment Path Planning Problem
    Jian, Li
    Rong, Fei
    Yu, Tang
    2020 IEEE 19TH INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM 2020), 2020, : 1977 - 1982
  • [32] Fuzzy Q-learning obstacle avoidance algorithm of humanoid robot in unknown environment
    Wen, Shuhuan
    Chen, Jianhua
    Li, Zhen
    Rad, Ahmad B.
    Othman, Kamal Mohammed
    2018 37TH CHINESE CONTROL CONFERENCE (CCC), 2018, : 5186 - 5190
  • [33] Q-Learning based Edge Caching Optimization for D2D Enabled Hierarchical Wireless Networks
    Wang, Chenyang
    Wang, Shanjia
    Li, Ding
    Wang, Xiaofei
    Li, Xiuhua
    Leung, Victor C. M.
    2018 IEEE 15TH INTERNATIONAL CONFERENCE ON MOBILE AD HOC AND SENSOR SYSTEMS (MASS), 2018, : 55 - 63
  • [34] Q-learning Approach in the Context of Virtual Learning Environment
    Liviu, Ionita
    Irina, Tudor
    PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON VIRTUAL LEARNING, 2008, : 209 - 214
  • [35] Investigation of Q-Learning in the Context of a Virtual Learning Environment
    Baziukaite, Dalia
    INFORMATICS IN EDUCATION, 2007, 6 (02): : 255 - 268
  • [36] Cold-start aware cloud-native service function chain caching in resource-constrained edge: A reinforcement learning approach
    Zhang, Jiayin
    Yu, Huiqun
    Fan, Guisheng
    Li, Zengpeng
    COMPUTER COMMUNICATIONS, 2022, 195 : 334 - 345
  • [37] Q-learning based hyper-heuristic algorithm for solving multi-mode resource-constrained project scheduling problem
    Cui J.
    Lyu Y.
    Xu Z.
    Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2022, 28 (05): : 1472 - 1481
  • [38] Q-Learning-based Edge Node Resource Allocation Algorithm in the Environment of Power Distribution Internet of Things
    Chen, Xi
    Xin, Rui
    He, Yue
    Zhang, Bo
    Lin, Peng
    IWCMC 2021: 2021 17TH INTERNATIONAL WIRELESS COMMUNICATIONS & MOBILE COMPUTING CONFERENCE (IWCMC), 2021, : 446 - 450
  • [39] Modification of Q-learning to Adapt to the Randomness of Environment
    Luo, Xiulian
    Gao, Youbing
    Huang, Shao
    Zhao, Yaodong
    Zhang, Shengmiao
    ICCAIS 2019: THE 8TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND INFORMATION SCIENCES, 2019,
  • [40] Q-learning with Experience Replay in a Dynamic Environment
    Pieters, Mathijs
    Wiering, Marco A.
    PROCEEDINGS OF 2016 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2016,