DRL-Based Joint Resource Allocation and Platoon Control Optimization for UAV-Hosted Platoon Digital Twin

被引:0
|
作者
Wang, Lei [1 ,2 ]
Liang, Hongbin [1 ,2 ]
Tang, Yanmei [3 ]
Mao, Guotao [1 ,2 ]
Zhang, Han [1 ,2 ]
Zhao, Dongmei [4 ]
机构
[1] Southwest Jiaotong Univ, Sch Transportat & Logist, Natl United Engn Lab Integrated & Intelligent Tran, Chengdu 611756, Peoples R China
[2] Southwest Jiaotong Univ, Natl Engn Lab Integrated Transportat Big Data Appl, Chengdu 611756, Peoples R China
[3] Southwest Jiaotong Univ, Informatizat & Network Management Off, Chengdu 611756, Peoples R China
[4] McMaster Univ, Dept Elect & Comp Engn, Hamilton, ON L8S 4K1, Canada
来源
IEEE INTERNET OF THINGS JOURNAL | 2024年 / 11卷 / 22期
基金
中国国家自然科学基金;
关键词
Resource management; Vehicle dynamics; Autonomous aerial vehicles; Optimization; Numerical stability; Synchronization; Perturbation methods; Age of Information (AoI); deep reinforcement learning (DRL); digital twin (DT); platoon control; platoon; resource allocation; DELAY;
D O I
10.1109/JIOT.2024.3439576
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Digital twin (DT)-empowered platoon can improve platoon management efficiency and driving safety. However, the resource allocation scheme of low-latency platoon DT (PDT) and the interactions with platoon control strategy are important issues in the study of PDTs. In this article, we study the resource allocation in the PDT network and the interaction mechanism between PDT and platoon control for an unmanned aerial vehicle (UAV)-hosted PDT. We introduce the Age of Information (AoI) metrics to characterize the freshness of the DTs. To explore the impact of the PDT resource allocation scheme on the platoon control strategy, we propose a joint optimization model for power resource allocation and platoon control. Specifically, the allocation of power resources affects the PDT's AoI, and the high-latency PDT in turn affects the platoon control strategy. Our objective is minimize the weighted sum of the system's average energy consumption and the PDT's average peak AoI. To solve the problem, we first reformulate the power resource allocation problem over a period of time as a Markov decision process (MDP) model, and then propose the Dirichlet deep deterministic policy gradient (DDPG)-based power allocation (D3PGPA) method based on Dirichlet distribution and DDPG algorithm. The method can not only effectively explores the state space while satisfying the constraints of limited resources but also improve the stability of the algorithm. Numerical results show that the D3PGPA method can host a PDT with low AoI and improve the stability of the platoon. Besides, our proposed method performs stably and outperforms other benchmark methods.
引用
收藏
页码:37114 / 37126
页数:13
相关论文
共 35 条
  • [1] A Dynamic Resource Allocation Model Based on SMDP and DRL Algorithm for Truck Platoon in Vehicle Network
    Liang, Hongbin
    Zhou, Shuya
    Liu, Xiaobo
    Zheng, Fangfang
    Hong, Xintao
    Zhou, Xuemei
    Zhao, Lian
    IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (12) : 10295 - 10305
  • [2] DRL-based Resource Allocation Optimization for Computation Offloading in Mobile Edge Computing
    Wu, Guowen
    Zhao, Yuhan
    Shen, Yizhou
    Zhang, Hong
    Shen, Shigen
    Yu, Shui
    IEEE INFOCOM 2022 - IEEE CONFERENCE ON COMPUTER COMMUNICATIONS WORKSHOPS (INFOCOM WKSHPS), 2022,
  • [3] Joint Optimization of Platoon Control and Resource Scheduling in Cooperative Vehicle-Infrastructure System
    Zhang, Peiyu
    Tian, Daxin
    Zhou, Jianshan
    Duan, Xuting
    Sheng, Zhengguo
    Zhao, Dezong
    Cao, Dongpu
    IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2023, 8 (06): : 3629 - 3646
  • [4] Joint Optimization of Trajectory Control, Resource Allocation, and User Association Based on DRL for Multi-Fixed-Wing UAV Networks
    Yin, Baolin
    Fang, Xuming
    Wang, Xianbin
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2024, 23 (10) : 13330 - 13343
  • [5] DRL-based admission control and resource allocation for 5G network slicing
    Chakraborty, Saurav
    Sivalingam, Krishna M.
    SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2023, 48 (03):
  • [6] DRL-based admission control and resource allocation for 5G network slicing
    Saurav Chakraborty
    Krishna M Sivalingam
    Sādhanā, 48
  • [7] Resource Allocation for Dynamic Platoon Digital Twin Networks: A Multi-Agent Deep Reinforcement Learning Method
    Wang, Lei
    Liang, Hongbin
    Mao, Guotao
    Zhao, Dongmei
    Liu, Qian
    Yao, Yiting
    Zhang, Han
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (10) : 15609 - 15620
  • [8] Multi-UAV Assisted Air-Ground Collaborative MEC System: DRL-Based Joint Task Offloading and Resource Allocation and 3D UAV Trajectory Optimization
    Wang, Mingjun
    Li, Ruishan
    Jing, Feng
    Gao, Mei
    DRONES, 2024, 8 (09)
  • [9] Joint UAV Placement Optimization, Resource Allocation, and Computation Offloading for THz Band: A DRL Approach
    Wang, Heng
    Zhang, Haijun
    Liu, Xiangnan
    Long, Keping
    Nallanathan, Arumugam
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2023, 22 (07) : 4890 - 4900
  • [10] A Digital Twin and Consensus Empowered Cooperative Control Framework for Platoon-Based Autonomous Driving
    Cao, Jiayu
    Leng, Supeng
    Xiong, Kai
    Chen, Xiaosha
    TSINGHUA SCIENCE AND TECHNOLOGY, 2025, 30 (03): : 1096 - 1111