A Multi-Task Fusion Strategy-Based Decision-Making and Planning Method for Autonomous Driving Vehicles

被引:3
|
作者
Liu, Weiguo [1 ,2 ]
Xiang, Zhiyu [1 ]
Fang, Han [3 ]
Huo, Ke [2 ]
Wang, Zixu [2 ]
机构
[1] Zhejiang Univ, Informat Sci & Elect Engn, Hangzhou 310027, Peoples R China
[2] Natl Innovat Ctr Intelligent & Connected Vehicles, Beijing 100176, Peoples R China
[3] Wuhan Hudiandian Technol Co Ltd, Wuhan 430000, Peoples R China
关键词
deep reinforcement learning; decision-making planning; multi-task fusion; DDPG; simulation platform; end-to-end; VTD;
D O I
10.3390/s23167021
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
The autonomous driving technology based on deep reinforcement learning (DRL) has been confirmed as one of the most cutting-edge research fields worldwide. The agent is enabled to achieve the goal of making independent decisions by interacting with the environment and learning driving strategies based on the feedback from the environment. This technology has been widely used in end-to-end driving tasks. However, this field faces several challenges. First, developing real vehicles is expensive, time-consuming, and risky. To further expedite the testing, verification, and iteration of end-to-end deep reinforcement learning algorithms, a joint simulation development and validation platform was designed and implemented in this study based on VTD-CarSim and the Tensorflow deep learning framework, and research work was conducted based on this platform. Second, sparse reward signals can cause problems (e.g., a low-sample learning rate). It is imperative for the agent to be capable of navigating in an unfamiliar environment and driving safely under a wide variety of weather or lighting conditions. To address the problem of poor generalization ability of the agent to unknown scenarios, a deep deterministic policy gradient (DDPG) decision-making and planning method was proposed in this study in accordance with a multi-task fusion strategy. The main task based on DRL decision-making planning and the auxiliary task based on image semantic segmentation were cross-fused, and part of the network was shared with the main task to reduce the possibility of model overfitting and improve the generalization ability. As indicated by the experimental results, first, the joint simulation development and validation platform built in this study exhibited prominent versatility. Users were enabled to easily substitute any default module with customized algorithms and verify the effectiveness of new functions in enhancing overall performance using other default modules of the platform. Second, the deep reinforcement learning strategy based on multi-task fusion proposed in this study was competitive. Its performance was better than other DRL algorithms in certain tasks, which improved the generalization ability of the vehicle decision-making planning algorithm.
引用
收藏
页数:22
相关论文
共 50 条
  • [1] A Decision Control Method for Autonomous Driving Based on Multi-Task Reinforcement Learning
    Cai, Yingfeng
    Yang, Shaoqing
    Wang, Hai
    Teng, Chenglong
    Chen, Long
    IEEE ACCESS, 2021, 9 (09): : 154553 - 154562
  • [2] Planning and Decision-Making for Autonomous Vehicles
    Schwarting, Wilko
    Alonso-Mora, Javier
    Rus, Daniela
    ANNUAL REVIEW OF CONTROL, ROBOTICS, AND AUTONOMOUS SYSTEMS, VOL 1, 2018, 1 : 187 - 210
  • [3] Interactive Decision-making and Planning for Autonomous Driving vehicles in Unsignalized Intersection
    Xu C.
    Zhao W.
    Li L.
    Zhang R.
    Wang C.
    Chen F.
    Jixie Gongcheng Xuebao/Journal of Mechanical Engineering, 2023, 59 (14): : 202 - 212
  • [4] MTD-GPT: A Multi-Task Decision-Making GPT Model for Autonomous Driving at Unsignalized Intersections
    Liu, Jiaqi
    Hang, Peng
    Qi, Xiao
    Wang, Jianqiang
    Sun, Jian
    2023 IEEE 26TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS, ITSC, 2023, : 5154 - 5161
  • [5] Decision-Making and Planning Method for Autonomous Vehicles Based on Motivation and Risk Assessment
    Wang, Yisong
    Wang, Chunyan
    Zhao, Wanzhong
    Xu, Can
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2021, 70 (01) : 107 - 120
  • [6] Multi-task perception algorithm of autonomous driving based on temporal fusion
    Liu Z.-W.
    Fan S.-H.
    Qi M.-Y.
    Dong M.
    Wang P.
    Zhao X.-M.
    Jiaotong Yunshu Gongcheng Xuebao/Journal of Traffic and Transportation Engineering, 2021, 21 (04): : 223 - 234
  • [7] An actor-critic based learning method for decision-making and planning of autonomous vehicles
    XU Can
    ZHAO WanZhong
    CHEN QingYun
    WANG ChunYan
    Science China(Technological Sciences), 2021, 64 (05) : 984 - 994
  • [8] An actor-critic based learning method for decision-making and planning of autonomous vehicles
    Xu Can
    Zhao WanZhong
    Chen QingYun
    Wang ChunYan
    SCIENCE CHINA-TECHNOLOGICAL SCIENCES, 2021, 64 (05) : 984 - 994
  • [9] An actor-critic based learning method for decision-making and planning of autonomous vehicles
    XU Can
    ZHAO WanZhong
    CHEN QingYun
    WANG ChunYan
    Science China(Technological Sciences), 2021, (05) : 984 - 994
  • [10] An actor-critic based learning method for decision-making and planning of autonomous vehicles
    Can Xu
    WanZhong Zhao
    QingYun Chen
    ChunYan Wang
    Science China Technological Sciences, 2021, 64 : 984 - 994