Complex Network Cognition-Based Federated Reinforcement Learning for End-to-End Urban Autonomous Driving

被引:0
|
作者
Cai, Yingfeng [1 ]
Lu, Sikai [1 ]
Wang, Hai [2 ,3 ]
Lian, Yubo [4 ]
Chen, Long [1 ]
Liu, Qingchao [1 ]
机构
[1] Jiangsu Univ, Automot Engn Res Inst, Zhenjiang 212013, Peoples R China
[2] Jiangsu Univ, Sch Automot & Traff Engn, Zhenjiang 212013, Peoples R China
[3] Jiangsu Univ, Zhenjiang City Jiangsu Univ Engn Technol, Res Inst, Zhenjiang 212013, Peoples R China
[4] BYD Auto Ind Co Ltd, Shenzhen 518116, Peoples R China
基金
中国国家自然科学基金;
关键词
Cognition; Vehicle dynamics; Training; Safety; Heuristic algorithms; Complex networks; Transportation; Autonomous driving (AD); complex network; deep reinforcement learning (DRL); end-to-end; federated learning (FL);
D O I
10.1109/TTE.2023.3332345
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Compared to the modularized rule-based framework, end-to-end deep reinforcement learning (DRL) algorithms have demonstrated greater adaptability in autonomous driving (AD) scenarios. However, DRL algorithms often face challenges related to model convergence and sample dependence, which limit their applicability to complex driving tasks and lack interpretability. To address these limitations, we present a novel hybrid algorithm framework called federated learning (FL)-based distributed proximal policy optimization (FLDPPO). This framework combines modularized rule-based complex network cognition and end-to-end DRL to realize the fusion driving of the mechanism model and data. Our algorithm generates dynamic driving recommendations that guide agent learning rules, enabling the model to handle complex driving environments. In addition, FLDPPO addresses model robustness and sample dependence issues through a model confidence-based distributed multiagent aggregation architecture. By measuring model confidence, the architecture learns to effectively aggregate knowledge from each unique experience distribution. Simulation results show that the proposed FLDPPO algorithm achieves competitive performance on various benchmarks.
引用
收藏
页码:7513 / 7525
页数:13
相关论文
共 50 条
  • [41] End-to-End Learning of Behavioural Inputs for Autonomous Driving in Dense Traffic
    Shrestha, Jatan
    Idoko, Simon
    Sharma, Basant
    Singh, Arun Kumar
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 10020 - 10027
  • [42] Multi-task Learning with Attention for End-to-end Autonomous Driving
    Ishihara, Keishi
    Kanervisto, Anssi
    Miura, Jun
    Hautamaki, Ville
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 2896 - 2905
  • [43] End-to-End Autonomous Exploration with Deep Reinforcement Learning and Intrinsic Motivation
    Ruan, Xiaogang
    Li, Peng
    Zhu, Xiaoqing
    Yu, Hejie
    Yu, Naigong
    Computational Intelligence and Neuroscience, 2021, 2021
  • [44] End-to-End Autonomous Exploration with Deep Reinforcement Learning and Intrinsic Motivation
    Ruan, Xiaogang
    Li, Peng
    Zhu, Xiaoqing
    Yu, Hejie
    Yu, Naigong
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2021, 2021
  • [45] End-to-End Driving in a Realistic Racing Game with Deep Reinforcement Learning
    Perot, Etienne
    Jaritz, Maximilian
    Toromanoff, Marin
    de Charette, Raoul
    2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, : 474 - 475
  • [46] End-to-end Autonomous Driving Perception with Sequential Latent Representation Learning
    Chen, Jianyu
    Xu, Zhuo
    Tomizuka, Masayoshi
    2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 1999 - 2006
  • [47] Learning End-to-end Autonomous Driving using Guided Auxiliary Supervision
    Mehta, Ashish
    Subramanian, Adithya
    Subramanian, Anbumani
    ELEVENTH INDIAN CONFERENCE ON COMPUTER VISION, GRAPHICS AND IMAGE PROCESSING (ICVGIP 2018), 2018,
  • [48] Agile Autonomous Driving using End-to-End Deep Imitation Learning
    Pan, Yunpeng
    Cheng, Ching-An
    Saigol, Kamil
    Lee, Keuntaek
    Yan, Xinyan
    Theodorou, Evangelos A.
    Boots, Byron
    ROBOTICS: SCIENCE AND SYSTEMS XIV, 2018,
  • [49] End-to-end deep learning for reverse driving trajectory of autonomous bulldozer
    You, Ke
    Ding, Lieyun
    Jiang, Yutian
    Wu, Zhangang
    Zhou, Cheng
    KNOWLEDGE-BASED SYSTEMS, 2022, 252
  • [50] Latency Equalization Policy of End-to-End Network Slicing Based on Reinforcement Learning
    Bai, Haonan
    Zhang, Yong
    Zhang, Zhenyu
    Yuan, Siyu
    IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2023, 20 (01): : 88 - 103