Belief state separated reinforcement learning for autonomous vehicle decision making under uncertainty

被引:2
|
作者
Gu, Ziqing [1 ]
Yang, Yujie [1 ]
Duan, Jingliang [1 ]
Li, Shengbo Eben [1 ]
Chen, Jianyu [2 ]
Cao, Wenhan [1 ]
Zheng, Sifa [1 ]
机构
[1] Tsinghua Univ, Sch Vehicle & Mobil, State Key Lab Automot Safety & Energy, Beijing 100084, Peoples R China
[2] Tsinghua Univ, Inst Interdiscriplinary Informat Sci, Beijing, Peoples R China
基金
国家重点研发计划;
关键词
autonomous vehicle; Markov decision process; uncertain environment; partially observable;
D O I
10.1109/ITSC48978.2021.9564576
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In autonomous driving, the ego vehicle and its surrounding traffic environments always have uncertainties like parameter and structural errors, behavior randomness of road users, etc. Furthermore, environmental sensors are noisy or even biased. This problem can be formulated as a partially observable Markov decision process. Existing methods lack a good representation of historical information, making it very challenging to find an optimal policy. This paper proposes a belief state separated reinforcement learning (RL) algorithm for decision-making of autonomous driving in uncertain environments. We extend the separation principle from linear Gaussian systems to general nonlinear stochastic environments, where the belief state, defined as the posterior distribution of the true state, is found to be a sufficient statistic of historical information. This belief state is estimated by action-enhanced variational inference from historical information and is proved to satisfy the Markovian property, thus allowing us to obtain the optimal policy using traditional RL algorithms for Markov decision processes. The policy gradient of a task-specific prior model is mixed with that of the interaction data to improve learning performance. The proposed algorithm is evaluated in a multi-lane autonomous driving task, where the surrounding vehicles are subject to behavior uncertainty and observation noise. The simulation results show that compared with existing RL algorithms, the proposed method can achieve a higher average return with better driving performance.
引用
收藏
页码:586 / 592
页数:7
相关论文
共 50 条
  • [1] Autonomous Driving Systems for Decision-Making Under Uncertainty Using Deep Reinforcement Learning
    Haklidir, Mehmet
    Temeltas, Hakan
    2022 30TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU, 2022,
  • [2] Reinforcement learning for decision-making under deep uncertainty
    Pei, Zhihao
    Rojas-Arevalo, Angela M.
    de Haan, Fjalar J.
    Lipovetzky, Nir
    Moallemi, Enayat A.
    JOURNAL OF ENVIRONMENTAL MANAGEMENT, 2024, 359
  • [3] Tactical Decision-Making in Autonomous Driving by Reinforcement Learning with Uncertainty Estimation
    Hoel, Carl-Johan
    Wolff, Krister
    Laine, Leo
    2020 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2020, : 1563 - 1569
  • [4] Highway Traffic Modeling and Decision Making for Autonomous Vehicle Using Reinforcement Learning
    You, Changxi
    Lu, Jianbo
    Filev, Dimitar
    Tsiotras, Panagiotis
    2018 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2018, : 1227 - 1232
  • [5] CHAMP: Integrated Logic with Reinforcement Learning for Hybrid Decision Making for Autonomous Vehicle Planning
    Jafari, Rouhollah
    Ashari, Alireza Esna
    Huber, Marcus
    2023 AMERICAN CONTROL CONFERENCE, ACC, 2023, : 3310 - 3315
  • [6] Research on decision-making of autonomous vehicle following based on reinforcement learning method
    Gao, Hongbo
    Shi, Guanya
    Wang, Kelong
    Xie, Guotao
    Liu, Yuchao
    INDUSTRIAL ROBOT-THE INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH AND APPLICATION, 2019, 46 (03): : 444 - 452
  • [7] Decision-Making of an Autonomous Vehicle when Approached by an Emergency Vehicle using Deep Reinforcement Learning
    Shoaraee, Hamid
    Chen, Liang
    Jiang, Fan
    2021 IEEE INTL CONF ON DEPENDABLE, AUTONOMIC AND SECURE COMPUTING, INTL CONF ON PERVASIVE INTELLIGENCE AND COMPUTING, INTL CONF ON CLOUD AND BIG DATA COMPUTING, INTL CONF ON CYBER SCIENCE AND TECHNOLOGY CONGRESS DASC/PICOM/CBDCOM/CYBERSCITECH 2021, 2021, : 185 - 191
  • [8] A Decision-Making Strategy for Vehicle Autonomous Braking in Emergency via Deep Reinforcement Learning
    Fu, Yuchuan
    Li, Changle
    Yu, Fei Richard
    Luan, Tom H.
    Zhang, Yao
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (06) : 5876 - 5888
  • [9] Heuristic Reinforcement Learning Based Overtaking Decision for an Autonomous Vehicle
    Du, Guodong
    Zou, Yuan
    Zhang, Xudong
    Dong, Guoshun
    Yin, Xin
    IFAC PAPERSONLINE, 2021, 54 (10): : 59 - 66
  • [10] A model-free, reinforcement learning algorithm for perceptual decision making under uncertainty
    Esmaily, Jamal
    Moran, Rani
    Roudi, Yasser
    Bahrami, Bahador
    JOURNAL OF COMPUTATIONAL NEUROSCIENCE, 2024, 52 : S21 - S21