An Efficient Self-Evolution Method of Autonomous Driving for Any Given Algorithm

被引:11
|
作者
Huang, Yanjun [1 ,2 ]
Yang, Shuo [1 ,3 ]
Wang, Liwen [1 ]
Yuan, Kang [3 ,4 ]
Zheng, Hongyu [5 ]
Chen, Hong [4 ]
机构
[1] Tongji Univ, Sch Automot Studies, Shanghai 201804, Peoples R China
[2] Frontiers Sci Ctr Intelligent Autonomous Syst, Shanghai 200120, Peoples R China
[3] Tongji Univ, Shanghai Inst Intelligent Sci & Technol, Shanghai 201804, Peoples R China
[4] Tongji Univ, Coll Elect & Informat Engn, Shanghai 201804, Peoples R China
[5] Jilin Univ, State Key Lab Automot Simulat & Control, Changchun 130022, Peoples R China
关键词
Autonomous driving; reinforcement learning; policy improvement;
D O I
10.1109/TITS.2023.3307873
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Autonomous vehicles are expected to achieve self-evolution in the real-world environment to gradually cover more complex and changing scenarios. Reinforcement learning focuses on how agents act in the environment to maximize the cumulative reward, with a great potential to achieve self-evolution ability. However, most of reinforcement learning algorithms suffer from a low sample efficiency, which greatly limits their application in autonomous driving. This paper presents an efficient self-evolution method for any given algorithm based on the combination of Soft Actor Critic (SAC) and Behavioral Cloning(BC). First, the states of the sample trajectory in the replay buffer are separated and input into the given algorithm (algorithm with fundamental performance) to get the output label of actions such that the SAC algorithm can be guided using BC to achieve fast iteration in the direction of optimization with existing basic performance. Then, the value iteration algorithm is combined to achieve the proportion allocation of mixed gradient feedback, in order to trade off exploitation and exploration. In addition, the proposed methodology is evaluated in simulation environment taking automated speed control as an example. Experiment results show that compared with SAC algorithm, the proposed method can realize more than three times of convergence efficiency improvement, while without destroying the exploration enhancement advantage of reinforcement learning algorithm, that is, the performance is improved by 20% compared with the given algorithm (Intelligent Driver Model, IDM). The proposed method can easily extended to improve any given model no matter it is model-based or learning-based algorithm.
引用
收藏
页码:602 / 612
页数:11
相关论文
共 50 条
  • [21] Autonomous Traffic Sign Detection for Self-Driving Car System Using Convolutional Neural Network Algorithm
    Yu, Zhao
    Ye, Ting
    JOURNAL OF OPTICS-INDIA, 2024, 53 (04): : 3359 - 3369
  • [22] Autonomous traffic sign detection for self-driving car system using convolutional neural network algorithm
    Yu, Zhao
    Ye, Ting
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2024, 46 (03) : 5975 - 5984
  • [23] A novel differential evolution algorithm with a self-adaptation parameter control method by differential evolution
    Laizhong Cui
    Genghui Li
    Zexuan Zhu
    Zhenkun Wen
    Nan Lu
    Jian Lu
    Soft Computing, 2018, 22 : 6171 - 6190
  • [24] A novel differential evolution algorithm with a self-adaptation parameter control method by differential evolution
    Cui, Laizhong
    Li, Genghui
    Zhu, Zexuan
    Wen, Zhenkun
    Lu, Nan
    Lu, Jian
    SOFT COMPUTING, 2018, 22 (18) : 6171 - 6190
  • [25] SA-YOLOv3: An Efficient and Accurate Object Detector Using Self-Attention Mechanism for Autonomous Driving
    Tian, Daxin
    Lin, Chunmian
    Zhou, Jianshan
    Duan, Xuting
    Cao, Yue
    Zhao, Dezong
    Cao, Dongpu
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (05) : 4099 - 4110
  • [26] Efficient Self-Driving Photoelectrocatalytic Reactor for Synergistic Water Purification and H2 Evolution
    Lian, Zichao
    Tao, Ying
    Liu, Yunni
    Zhang, Yang
    Zhu, Qiong
    Li, Guisheng
    Li, Hexing
    ACS APPLIED MATERIALS & INTERFACES, 2020, 12 (40) : 44731 - 44742
  • [27] Efficient Automated Data Augmentation Algorithm Based on Self-guided Evolution Strategy
    Zhu G.-H.
    Chen W.-Z.
    Zhu Z.-N.
    Yuan C.-F.
    Huang Y.-H.
    Ruan Jian Xue Bao/Journal of Software, 2024, 35 (06): : 3013 - 3035
  • [28] An efficient LiDAR-based localization method for self-driving cars in dynamic environments
    Zhang, Yihuan
    Wang, Liang
    Jiang, Xuhui
    Zeng, Yong
    Dai, Yifan
    ROBOTICA, 2022, 40 (01) : 38 - 55
  • [29] Autonomous Driving Decision Algorithm for Complex Multi-Vehicle Interactions: An Efficient Approach Based on Global Sorting and Local Gaming
    Li, Daofei
    Zhang, Jiajie
    Liu, Guanming
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (07) : 6580 - 6593
  • [30] End-to-End Autonomous Driving Decision Method Based on Improved TD3 Algorithm in Complex Scenarios
    Xu, Tao
    Meng, Zhiwei
    Lu, Weike
    Tong, Zhongwen
    SENSORS, 2024, 24 (15)