Belief state separated reinforcement learning for autonomous vehicle decision making under uncertainty

被引:2
|
作者
Gu, Ziqing [1 ]
Yang, Yujie [1 ]
Duan, Jingliang [1 ]
Li, Shengbo Eben [1 ]
Chen, Jianyu [2 ]
Cao, Wenhan [1 ]
Zheng, Sifa [1 ]
机构
[1] Tsinghua Univ, Sch Vehicle & Mobil, State Key Lab Automot Safety & Energy, Beijing 100084, Peoples R China
[2] Tsinghua Univ, Inst Interdiscriplinary Informat Sci, Beijing, Peoples R China
基金
国家重点研发计划;
关键词
autonomous vehicle; Markov decision process; uncertain environment; partially observable;
D O I
10.1109/ITSC48978.2021.9564576
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In autonomous driving, the ego vehicle and its surrounding traffic environments always have uncertainties like parameter and structural errors, behavior randomness of road users, etc. Furthermore, environmental sensors are noisy or even biased. This problem can be formulated as a partially observable Markov decision process. Existing methods lack a good representation of historical information, making it very challenging to find an optimal policy. This paper proposes a belief state separated reinforcement learning (RL) algorithm for decision-making of autonomous driving in uncertain environments. We extend the separation principle from linear Gaussian systems to general nonlinear stochastic environments, where the belief state, defined as the posterior distribution of the true state, is found to be a sufficient statistic of historical information. This belief state is estimated by action-enhanced variational inference from historical information and is proved to satisfy the Markovian property, thus allowing us to obtain the optimal policy using traditional RL algorithms for Markov decision processes. The policy gradient of a task-specific prior model is mixed with that of the interaction data to improve learning performance. The proposed algorithm is evaluated in a multi-lane autonomous driving task, where the surrounding vehicles are subject to behavior uncertainty and observation noise. The simulation results show that compared with existing RL algorithms, the proposed method can achieve a higher average return with better driving performance.
引用
收藏
页码:586 / 592
页数:7
相关论文
共 50 条
  • [21] Uncertainty-based Decision Making Using Deep Reinforcement Learning
    Zhao, Xujiang
    Hu, Shu
    Cho, Jin-Hee
    Chen, Feng
    2019 22ND INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION 2019), 2019,
  • [22] Decision-making models on perceptual uncertainty with distributional reinforcement learning
    Xu, Shuyuan
    Liu, Qiao
    Hu, Yuhui
    Xu, Mengtian
    Hao, Jiachen
    GREEN ENERGY AND INTELLIGENT TRANSPORTATION, 2023, 2 (02):
  • [23] Reinforcement Learning with Uncertainty Estimation for Tactical Decision-Making in Intersections
    Hoel, Carl-Johan
    Tram, Tommy
    Sjoberg, Jonas
    2020 IEEE 23RD INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2020,
  • [24] Decision-making under severe uncertainty for autonomous mobile robots
    Berleant, Daniel
    Anderson, Gary T.
    2007 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS, VOLS 1-8, 2007, : 2657 - +
  • [25] Autonomous Exploration Under Uncertainty via Deep Reinforcement Learning on Graphs
    Chen, Fanfei
    Martin, John D.
    Huang, Yewei
    Wang, Jinkun
    Englot, Brendan
    2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 6140 - 6147
  • [26] An Autonomous Vehicle Behavior Decision Method Based on Deep Reinforcement Learning with Hybrid State Space and Driving Risk
    Wang, Xu
    Qian, Bo
    Zhuo, Junchao
    Liu, Weiqun
    SENSORS, 2025, 25 (03)
  • [27] Autonomous Vehicle Decision and Control through Reinforcement Learning with Traffic Flow Randomization
    Lin, Yuan
    Xie, Antai
    Liu, Xiao
    MACHINES, 2024, 12 (04)
  • [28] Driver-like decision-making method for vehicle longitudinal autonomous driving based on deep reinforcement learning
    Gao, Zhenhai
    Yan, Xiangtong
    Gao, Fei
    He, Lei
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART D-JOURNAL OF AUTOMOBILE ENGINEERING, 2022, 236 (13) : 3060 - 3070
  • [29] Ethical Alignment Decision Making for Connected Autonomous Vehicle in Traffic Dilemmas via Reinforcement Learning From Human Feedback
    Gao, Xin
    Luan, Tian
    Li, Xueyuan
    Liu, Qi
    Ma, Zhaoyang
    Meng, Xiaoqiang
    Li, Zirui
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (23): : 38585 - 38600
  • [30] PNNUAD: Perception Neural Networks Uncertainty Aware Decision-Making for Autonomous Vehicle
    Liu, Jiaxin
    Wang, Hong
    Peng, Liang
    Cao, Zhong
    Yang, Diange
    Li, Jun
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (12) : 24355 - 24368