Stateless Q-learning algorithm for service caching in resource constrained edge environment

被引：2

作者：

Huang, Binbin ^{[1
]}

Ran, Ziqi ^{[1
]}

Yu, Dongjin ^{[1
]}

Xiang, Yuanyuan ^{[1
]}

Shi, Xiaoying ^{[1
]}

Li, Zhongjin ^{[1
]}

Xu, Zhengqian ^{[1
]}

机构：

[1] Hangzhou Dianzi Univ, Sch Comp, Hangzhou 310018, Peoples R China

来源：

JOURNAL OF CLOUD COMPUTING-ADVANCES SYSTEMS AND APPLICATIONS | 2023年 / 12卷 / 01期

基金：

中国国家自然科学基金; 美国国家科学基金会;

关键词：

Edge environment; service caching; Stateless Q-learning; Collaboration cost; Service latency;

D O I：

10.1186/s13677-023-00506-7

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In resource constrained edge environment, multiple service providers can compete to rent the limited resources to cache their service instances on edge servers close to end users, thereby significantly reducing the service delay and improving quality of service (QoS). However, service providers renting the resources of different edge servers to deploy their service instances can incur different resource usage costs and service delay. To make full use of the limited resources of the edge servers to further reduce resource usage costs, multiple service providers on an edge server can form a coalition and share the limited resource of an edge server. In this paper, we investigate the service caching problem of multiple service providers in resource constrained edge environment, and propose an independent learners-based services caching scheme (ILSCS) which adopts a stateless Q-learning to learn an optimal service caching scheme. To verify the effectiveness of ILSCS scheme, we implement COALITION, RANDOM, MDU, and MCS four baseline algorithms, and compare the total collaboration cost and service latency of ILSCS scheme with these of these four baseline algorithms under different experimental parameter settings. The extensive experimental results show that the ILSCS scheme can achieve lower total collaboration cost and service latency.

引用

页数：13

共 50 条

[31] A novel Q-Learning Algorithm Based on the Stochastic Environment Path Planning Problem
Jian, Li
Rong, Fei
Yu, Tang
2020 IEEE 19TH INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM 2020), 2020, : 1977 - 1982
[32] Fuzzy Q-learning obstacle avoidance algorithm of humanoid robot in unknown environment
Wen, Shuhuan
Chen, Jianhua
Li, Zhen
Rad, Ahmad B.
Othman, Kamal Mohammed
2018 37TH CHINESE CONTROL CONFERENCE (CCC), 2018, : 5186 - 5190
[33] Q-Learning based Edge Caching Optimization for D2D Enabled Hierarchical Wireless Networks
Wang, Chenyang
Wang, Shanjia
Li, Ding
Wang, Xiaofei
Li, Xiuhua
Leung, Victor C. M.
2018 IEEE 15TH INTERNATIONAL CONFERENCE ON MOBILE AD HOC AND SENSOR SYSTEMS (MASS), 2018, : 55 - 63
[34] Q-learning Approach in the Context of Virtual Learning Environment
Liviu, Ionita
Irina, Tudor
PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON VIRTUAL LEARNING, 2008, : 209 - 214
[35] Investigation of Q-Learning in the Context of a Virtual Learning Environment
Baziukaite, Dalia
INFORMATICS IN EDUCATION, 2007, 6 (02): : 255 - 268
[36] Cold-start aware cloud-native service function chain caching in resource-constrained edge: A reinforcement learning approach
Zhang, Jiayin
Yu, Huiqun
Fan, Guisheng
Li, Zengpeng
COMPUTER COMMUNICATIONS, 2022, 195 : 334 - 345
[37] Q-learning based hyper-heuristic algorithm for solving multi-mode resource-constrained project scheduling problem
Cui J.
Lyu Y.
Xu Z.
Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2022, 28 (05): : 1472 - 1481
[38] Q-Learning-based Edge Node Resource Allocation Algorithm in the Environment of Power Distribution Internet of Things
Chen, Xi
Xin, Rui
He, Yue
Zhang, Bo
Lin, Peng
IWCMC 2021: 2021 17TH INTERNATIONAL WIRELESS COMMUNICATIONS & MOBILE COMPUTING CONFERENCE (IWCMC), 2021, : 446 - 450
[39] Modification of Q-learning to Adapt to the Randomness of Environment
Luo, Xiulian
Gao, Youbing
Huang, Shao
Zhao, Yaodong
Zhang, Shengmiao
ICCAIS 2019: THE 8TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND INFORMATION SCIENCES, 2019,
[40] Q-learning with Experience Replay in a Dynamic Environment
Pieters, Mathijs
Wiering, Marco A.
PROCEEDINGS OF 2016 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2016,

← 1 2 3 4 5 →