Complex Network Cognition-Based Federated Reinforcement Learning for End-to-End Urban Autonomous Driving

被引：0

作者：

Cai, Yingfeng ^{[1
]}

Lu, Sikai ^{[1
]}

Wang, Hai ^{[2
,3
]}

Lian, Yubo ^{[4
]}

Chen, Long ^{[1
]}

Liu, Qingchao ^{[1
]}

机构：

[1] Jiangsu Univ, Automot Engn Res Inst, Zhenjiang 212013, Peoples R China

[2] Jiangsu Univ, Sch Automot & Traff Engn, Zhenjiang 212013, Peoples R China

[3] Jiangsu Univ, Zhenjiang City Jiangsu Univ Engn Technol, Res Inst, Zhenjiang 212013, Peoples R China

[4] BYD Auto Ind Co Ltd, Shenzhen 518116, Peoples R China

来源：

IEEE TRANSACTIONS ON TRANSPORTATION ELECTRIFICATION | 2024年 / 10卷 / 03期

基金：

中国国家自然科学基金;

关键词：

Cognition; Vehicle dynamics; Training; Safety; Heuristic algorithms; Complex networks; Transportation; Autonomous driving (AD); complex network; deep reinforcement learning (DRL); end-to-end; federated learning (FL);

D O I：

10.1109/TTE.2023.3332345

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Compared to the modularized rule-based framework, end-to-end deep reinforcement learning (DRL) algorithms have demonstrated greater adaptability in autonomous driving (AD) scenarios. However, DRL algorithms often face challenges related to model convergence and sample dependence, which limit their applicability to complex driving tasks and lack interpretability. To address these limitations, we present a novel hybrid algorithm framework called federated learning (FL)-based distributed proximal policy optimization (FLDPPO). This framework combines modularized rule-based complex network cognition and end-to-end DRL to realize the fusion driving of the mechanism model and data. Our algorithm generates dynamic driving recommendations that guide agent learning rules, enabling the model to handle complex driving environments. In addition, FLDPPO addresses model robustness and sample dependence issues through a model confidence-based distributed multiagent aggregation architecture. By measuring model confidence, the architecture learns to effectively aggregate knowledge from each unique experience distribution. Simulation results show that the proposed FLDPPO algorithm achieves competitive performance on various benchmarks.

引用

页码：7513 / 7525

页数：13

共 50 条

[41] End-to-End Learning of Behavioural Inputs for Autonomous Driving in Dense Traffic
Shrestha, Jatan
Idoko, Simon
Sharma, Basant
Singh, Arun Kumar
2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 10020 - 10027
[42] Multi-task Learning with Attention for End-to-end Autonomous Driving
Ishihara, Keishi
Kanervisto, Anssi
Miura, Jun
Hautamaki, Ville
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 2896 - 2905
[43] End-to-End Autonomous Exploration with Deep Reinforcement Learning and Intrinsic Motivation
Ruan, Xiaogang
Li, Peng
Zhu, Xiaoqing
Yu, Hejie
Yu, Naigong
Computational Intelligence and Neuroscience, 2021, 2021
[44] End-to-End Autonomous Exploration with Deep Reinforcement Learning and Intrinsic Motivation
Ruan, Xiaogang
Li, Peng
Zhu, Xiaoqing
Yu, Hejie
Yu, Naigong
COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2021, 2021
[45] End-to-End Driving in a Realistic Racing Game with Deep Reinforcement Learning
Perot, Etienne
Jaritz, Maximilian
Toromanoff, Marin
de Charette, Raoul
2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, : 474 - 475
[46] End-to-end Autonomous Driving Perception with Sequential Latent Representation Learning
Chen, Jianyu
Xu, Zhuo
Tomizuka, Masayoshi
2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 1999 - 2006
[47] Learning End-to-end Autonomous Driving using Guided Auxiliary Supervision
Mehta, Ashish
Subramanian, Adithya
Subramanian, Anbumani
ELEVENTH INDIAN CONFERENCE ON COMPUTER VISION, GRAPHICS AND IMAGE PROCESSING (ICVGIP 2018), 2018,
[48] Agile Autonomous Driving using End-to-End Deep Imitation Learning
Pan, Yunpeng
Cheng, Ching-An
Saigol, Kamil
Lee, Keuntaek
Yan, Xinyan
Theodorou, Evangelos A.
Boots, Byron
ROBOTICS: SCIENCE AND SYSTEMS XIV, 2018,
[49] End-to-end deep learning for reverse driving trajectory of autonomous bulldozer
You, Ke
Ding, Lieyun
Jiang, Yutian
Wu, Zhangang
Zhou, Cheng
KNOWLEDGE-BASED SYSTEMS, 2022, 252
[50] Latency Equalization Policy of End-to-End Network Slicing Based on Reinforcement Learning
Bai, Haonan
Zhang, Yong
Zhang, Zhenyu
Yuan, Siyu
IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2023, 20 (01): : 88 - 103

← 1 2 3 4 5 →