Multi-Agent Reinforcement Learning Enabled Spectrum Sharing for Vehicular Networks

被引:0
|
作者
Wang W.-N. [1 ]
Su J. [1 ]
Chen Y. [2 ]
Zhang J.-Z. [2 ]
Tang Z. [1 ]
机构
[1] School of Computer and Software, Nanjing University of Information Science and Technology, Jiangsu, Nanjing
[2] The 63rd Research Institute, National University of Defense Technology, Jiangsu, Nanjing
来源
关键词
Deep reinforcement learning; Distributed spectrum sharing; Multi agent; Vehicular network;
D O I
10.12263/DZXB.20220320
中图分类号
学科分类号
摘要
Aiming at the problem that it is difficult for base stations to collect and manage instantaneous channel state information in high dynamic vehicle networking environment, a spectrum allocation algorithm for vehicle networking based on multi-agent deep reinforcement learning is proposed. The algorithm aims to maximize the network throughput under the constraints of vehicle communication delay and reliability, and uses the learning algorithm to improve the spectrum and power allocation strategy. Firstly, the implicit cooperative agent is trained by improving DQN model and EXP3 strategy. Secondly, the nonstationary problem caused by multi-agent concurrent learning is solved by using hysteretic Q-learning and concurrent experience replay trajectory. The simulation results show that the average successful delivery rate of the payload of the proposed algorithm can reach 95.89%, which is 16.48% higher than the random baseline algorithm. It can quickly obtain the approximate optimal solution, and has significant advantages in reducing the signaling overhead of the Internet of vehicles communication system. © 2024 Chinese Institute of Electronics. All rights reserved.
引用
收藏
页码:1690 / 1699
页数:9
相关论文
共 24 条
  • [1] GYAWALI S, XU S, QIAN Y, Et al., Challenges and solutions for cellular based V2X communications, IEEE Communications Surveys & Tutorials, 23, 1, pp. 222-255, (2020)
  • [2] MNIH V, KAVUKCUOGLU K, SILVER D, Et al., Human-level control through deep reinforcement learning, Nature, 518, 7540, pp. 529-533, (2015)
  • [3] WANG S, LIU H, GOMES P H, Et al., Deep reinforcement learning for dynamic multichannel access in wireless networks, IEEE Transactions on Cognitive Communications and Networking, 4, 2, pp. 257-265, (2018)
  • [4] NAPARSTEK O, COHEN K., Deep multi-user reinforcement learning for distributed dynamic spectrum access, IEEE Transactions on Wireless Communications, 18, 1, pp. 310-323, (2018)
  • [5] YU Y, WANG T, LIEW S C., Deep-reinforcement learning multiple access for heterogeneous wireless networks, IEEE Journal on Selected Areas in Communications, 37, 6, pp. 1277-1290, (2019)
  • [6] YE H, LI G Y, JUANG B H F., Deep reinforcement learning based resource allocation for V2V communications, IEEE Transactions on Vehicular Technology, 68, 4, pp. 3163-3173, (2019)
  • [7] ZHANG X, PENG M, YAN S, Et al., Deep-reinforcement-learning-based mode selection and resource allocation for cellular V2X communications, IEEE Internet of Things Journal, 7, 7, pp. 6380-6391, (2019)
  • [8] HUANG Y F, PENG N H, LIN Y, Et al., Dynamic al location of spectrum resources of Internet of vehicles based on sac reinforcement learning, Computer Engineering, 47, 9, pp. 34-43, (2021)
  • [9] WANG L, YE H, LIANG L, Et al., Learn to compress CSI and allocate resources in vehicular networks, IEEE Transactions on Communications, 68, 6, pp. 3640-3653, (2020)
  • [10] XU X C, LIU K, LIU C H, Et al., Channel allocation method for vehicle edge computing based on potential game, Acta Electronica Sinica, 49, 5, pp. 851-860, (2021)