An Efficient Self-Evolution Method of Autonomous Driving for Any Given Algorithm

被引：11

作者：

Huang, Yanjun ^{[1
,2
]}

Yang, Shuo ^{[1
,3
]}

Wang, Liwen ^{[1
]}

Yuan, Kang ^{[3
,4
]}

Zheng, Hongyu ^{[5
]}

Chen, Hong ^{[4
]}

机构：

[1] Tongji Univ, Sch Automot Studies, Shanghai 201804, Peoples R China

[2] Frontiers Sci Ctr Intelligent Autonomous Syst, Shanghai 200120, Peoples R China

[3] Tongji Univ, Shanghai Inst Intelligent Sci & Technol, Shanghai 201804, Peoples R China

[4] Tongji Univ, Coll Elect & Informat Engn, Shanghai 201804, Peoples R China

[5] Jilin Univ, State Key Lab Automot Simulat & Control, Changchun 130022, Peoples R China

来源：

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS | 2024年 / 25卷 / 01期

关键词：

Autonomous driving; reinforcement learning; policy improvement;

D O I：

10.1109/TITS.2023.3307873

中图分类号：

TU [建筑科学];

学科分类号：

0813 ;

摘要：

Autonomous vehicles are expected to achieve self-evolution in the real-world environment to gradually cover more complex and changing scenarios. Reinforcement learning focuses on how agents act in the environment to maximize the cumulative reward, with a great potential to achieve self-evolution ability. However, most of reinforcement learning algorithms suffer from a low sample efficiency, which greatly limits their application in autonomous driving. This paper presents an efficient self-evolution method for any given algorithm based on the combination of Soft Actor Critic (SAC) and Behavioral Cloning(BC). First, the states of the sample trajectory in the replay buffer are separated and input into the given algorithm (algorithm with fundamental performance) to get the output label of actions such that the SAC algorithm can be guided using BC to achieve fast iteration in the direction of optimization with existing basic performance. Then, the value iteration algorithm is combined to achieve the proportion allocation of mixed gradient feedback, in order to trade off exploitation and exploration. In addition, the proposed methodology is evaluated in simulation environment taking automated speed control as an example. Experiment results show that compared with SAC algorithm, the proposed method can realize more than three times of convergence efficiency improvement, while without destroying the exploration enhancement advantage of reinforcement learning algorithm, that is, the performance is improved by 20% compared with the given algorithm (Intelligent Driver Model, IDM). The proposed method can easily extended to improve any given model no matter it is model-based or learning-based algorithm.

引用

页码：602 / 612

页数：11

共 50 条

[1] A safe self-evolution algorithm for autonomous driving based on data-driven risk quantification model
Yang, Shuo
Li, Shizhen
Huang, Yanjun
Chen, Hong
ACCIDENT ANALYSIS AND PREVENTION, 2025, 214
[2] Self-evolution Scenarios for Simulation Tests of Autonomous Vehicles Based on Different Models of Driving Styles
Ma Y.-N.
Jiang W.
Wu J.-Y.
Chen J.-Y.
Li N.
Xu Z.-G.
Xiong L.
Zhongguo Gonglu Xuebao/China Journal of Highway and Transport, 2023, 36 (02): : 216 - 228
[3] How to Guarantee Driving Safety for Autonomous Vehicles in a Real-World Environment: A Perspective on Self-Evolution Mechanisms
Yang, Shuo
Huang, Yanjun
Li, Li
Feng, Shuo
Na, Xiaoxiang
Chen, Hong
Khajepour, Amir
IEEE INTELLIGENT TRANSPORTATION SYSTEMS MAGAZINE, 2024, 16 (02) : 41 - 54
[4] Efficient Local Coherent Structure Learning via Self-Evolution Bipartite Graph
Wang, Zheng
Li, Qi
Nie, Feiping
Wang, Rong
Wang, Fei
Li, Xuelong
IEEE TRANSACTIONS ON CYBERNETICS, 2024, 54 (08) : 4527 - 4538
[5] A Hybrid Knowledge-Data Model to Driving the Self-Evolution of Building Digital Twins
Hu Chenxi
Yang Qiliang
Xing Jianchun
Qin Xia
Li Suliang
Jia Haining
2022 34TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2022, : 1975 - 1980
[6] Efficient classification method for autonomous driving application
Jeong, P
Nedevschi, S
IMAGE ANALYSIS AND RECOGNITION, PT 1, PROCEEDINGS, 2004, 3211 : 228 - 235
[7] A Fair and Efficient Federated Learning Algorithm for Autonomous Driving
Tang, Xinlong
Zhang, Jiayi
Fu, Yuchuan
Li, Changle
Cheng, Nan
Yuan, Xiaoming
2023 IEEE 98TH VEHICULAR TECHNOLOGY CONFERENCE, VTC2023-FALL, 2023,
[8] A Multivariate Time Series Forecasting Algorithm Based on Self-Evolution and Pre-training
Wan C.
Li W.-Z.
Ding W.-X.
Zhang Z.-J.
Ye B.-L.
Lu S.-L.
Jisuanji Xuebao/Chinese Journal of Computers, 2022, 45 (03): : 513 - 525
[9] Self-evolution algorithm for Job-Shop knowledgeable manufacturing cell based on link constraint
Li, W.-C. (zeropoint@ujs.edu.cn), 1911, CIMS (18):
[10] Stable autonomous driving method using modified Otsu algorithm
Lee, DE
Yoo, SH
Kim, YB
INTERNATIONAL JOURNAL OF AUTOMOTIVE TECHNOLOGY, 2006, 7 (02) : 227 - 235

← 1 2 3 4 5 →