A Deep Reinforcement Learning Scheme for Spectrum Sensing and Resource Allocation in ITS

被引:1
|
作者
Wei, Huang [1 ]
Peng, Yuyang [1 ]
Yue, Ming [1 ]
Long, Jiale [2 ]
AL-Hazemi, Fawaz [3 ]
Mirza, Mohammad Meraj [4 ]
机构
[1] Macau Univ Sci & Technol, Sch Comp Sci & Engn, Macau 999078, Peoples R China
[2] Wuyi Univ, Fac Intelligent Mfg, Jiangmen 529020, Peoples R China
[3] Univ Jeddah, Dept Comp & Network Engn, Jeddah 21959, Saudi Arabia
[4] Taif Univ, Coll Comp & Informat Technol, Dept Comp Sci, POB 11099, Taif 21944, Saudi Arabia
关键词
deep reinforcement learning; vehicle to vehicle; vehicle to infrastructure; spectrum resource allocation;
D O I
10.3390/math11163437
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
In recent years, the Internet of Vehicles (IoV) has been found to be of huge potential value in the promotion of the development of intelligent transportation systems (ITSs) and smart cities. However, the traditional scheme in IoV has difficulty in dealing with an uncertain environment, while reinforcement learning has the advantage of being able to deal with an uncertain environment. Spectrum resource allocation in IoV faces the uncertain environment in most cases. Therefore, this paper investigates the spectrum resource allocation problem by deep reinforcement learning after using spectrum sensing technology in the ITS, including the vehicle-to-infrastructure (V2I) link and the vehicle-to-vehicle (V2V) link. The spectrum resource allocation is modeled as a reinforcement learning-based multi-agent problem which is solved by using the soft actor critic (SAC) algorithm. Considered an agent, each V2V link interacts with the vehicle environment and makes a joint action. After that, each agent receives different observations as well as the same reward, and updates networks through the experiences from the memory. Therefore, during a certain time, each V2V link can optimize its spectrum allocation scheme to maximize the V2I capacity as well as increase the V2V payload delivery transmission rate. However, the number of SAC networks increases linearly as the number of V2V links increases, which means that the networks may have a problem in terms of convergence when there are an excessive number of V2V links. Consequently, a new algorithm, namely parameter sharing soft actor critic (PSSAC), is proposed to reduce the complexity for which the model is easier to converge. The simulation results show that both SAC and PSSAC can improve the V2I capacity and increase the V2V payload transmission success probability within a certain time. Specifically, these novel schemes have a 10 percent performance improvement compared with the existing scheme in the vehicular environment. Additionally, PSSAC has a lower complexity.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Intelligent Spectrum Sensing and Resource Allocation in Cognitive Networks via Deep Reinforcement Learning
    Nguyen, Dinh C.
    Love, David J.
    Brinton, Christopher G.
    ICC 2023-IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2023, : 4603 - 4608
  • [2] A deep reinforcement learning resource allocation strategy for integrated sensing, communication and computing
    Cai, Lili
    He, Jincan
    PHYSICAL COMMUNICATION, 2024, 64
  • [3] Resource Allocation Scheme Based on Deep Reinforcement Learning for Device-to-Device Communications
    Yu, Seoyoung
    Jeong, Yun Jae
    Lee, Jeong Woo
    35TH INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING (ICOIN 2021), 2021, : 712 - 714
  • [4] Deep Reinforcement Learning for Resource Allocation in Business Processes
    Zbikowski, Kamil
    Ostapowicz, Michal
    Gawrysiak, Piotr
    PROCESS MINING WORKSHOPS, ICPM 2022, 2023, 468 : 177 - 189
  • [5] Deep Reinforcement Learning for Resource Allocation in Massive MIMO
    Chen, Liang
    Sun, Fanglei
    Li, Kai
    Chen, Ruiqing
    Yang, Yang
    Wang, Jun
    29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021), 2021, : 1611 - 1615
  • [6] Deep Reinforcement Learning Based Resource Allocation for LoRaWAN
    Li, Aohan
    2022 IEEE 96TH VEHICULAR TECHNOLOGY CONFERENCE (VTC2022-FALL), 2022,
  • [7] Computing resource allocation scheme of IOV using deep reinforcement learning in edge computing environment
    Yiwei Zhang
    Min Zhang
    Caixia Fan
    Fuqiang Li
    Baofang Li
    EURASIP Journal on Advances in Signal Processing, 2021
  • [8] Computing resource allocation scheme of IOV using deep reinforcement learning in edge computing environment
    Zhang, Yiwei
    Zhang, Min
    Fan, Caixia
    Li, Fuqiang
    Li, Baofang
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2021, 2021 (01)
  • [9] Optimal Resource Allocation for Integrated Sensing and Communications in Internet of Vehicles: A Deep Reinforcement Learning Approach
    Liu, Congcong
    Xia, Minghua
    Zhao, Junhui
    Li, Huaicheng
    Gong, Yi
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2025, 74 (02) : 3028 - 3038
  • [10] Reinforcement learning-based hybrid spectrum resource allocation scheme for the high load of URLLC services
    Huang, Qian
    Xie, Xianzhong
    Cheriet, Mohamed
    EURASIP JOURNAL ON WIRELESS COMMUNICATIONS AND NETWORKING, 2020, 2020 (01)