Efficient Sampling-Based Maximum Entropy Inverse Reinforcement Learning With Application to Autonomous Driving

被引:72
|
作者
Wu, Zheng [1 ]
Sun, Liting [1 ]
Zhan, Wei [1 ]
Yang, Chenyu [2 ]
Tomizuka, Masayoshi [1 ]
机构
[1] Univ Calif Berkeley, Dept Mech Engn, Berkeley, CA 94709 USA
[2] Shanghai Jiao Tong Univ, Dept Comp Sci, Shanghai 200240, Peoples R China
关键词
Learning from demonstration; intelligent transportation systems; inverse reinforcement learning; autonomous driving; social human-robot interaction; ALGORITHMS;
D O I
10.1109/LRA.2020.3005126
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
In the past decades, we have witnessed significant progress in the domain of autonomous driving. Advanced techniques based on optimization and reinforcement learning become increasingly powerful when solving the forward problem: given designed reward/cost functions, how we should optimize them and obtain driving policies that interact with the environment safely and efficiently. Such progress has raised another equally important question: what should we optimize? Instead of manually specifying the reward functions, it is desired that we can extract what human drivers try to optimize from real traffic data and assign that to autonomous vehicles to enable more naturalistic and transparent interaction between humans and intelligent agents. To address this issue, we present an efficient sampling-based maximum-entropy inverse reinforcement learning (IRL) algorithm in this letter. Different from existing IRL algorithms, by introducing an efficient continuous-domain trajectory sampler, the proposed algorithm can directly learn the reward functions in the continuous domain while considering the uncertainties in demonstrated trajectories from human drivers. We evaluate the proposed algorithm via real-world driving data, including both non-interactive and interactive scenarios. The experimental results show that the proposed algorithm achieves more accurate prediction performance with faster convergence speed and better generalization compared to other baseline IRL algorithms.
引用
收藏
页码:5355 / 5362
页数:8
相关论文
共 50 条
  • [11] A Distributional Analysis of Sampling-Based Reinforcement Learning Algorithms
    Amortila, Philip
    Precup, Doina
    Panangaden, Prakash
    Bellemare, Marc G.
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108 : 4357 - 4365
  • [12] Personalized Car Following for Autonomous Driving with Inverse Reinforcement Learning
    Zhao, Zhouqiao
    Wang, Ziran
    Han, Kyungtae
    Gupta, Rohit
    Tiwari, Prashant
    Wu, Guoyuan
    Barth, Matthew J.
    2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2022), 2022, : 2891 - 2897
  • [13] Maximum Entropy Semi-Supervised Inverse Reinforcement Learning
    Audiffren, Julien
    Valko, Michal
    Lazaric, Alessandro
    Ghavamzadeh, Mohammad
    PROCEEDINGS OF THE TWENTY-FOURTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI), 2015, : 3315 - 3321
  • [14] Comparison and Deduction of Maximum Entropy Deep Inverse Reinforcement Learning
    Chen, Guannan
    Fu, Yanfang
    Liu, Yu
    Dang, Xiangbin
    Hao, Jiajun
    Liu, Xinchen
    2023 IEEE 2ND INDUSTRIAL ELECTRONICS SOCIETY ANNUAL ON-LINE CONFERENCE, ONCON, 2023,
  • [15] Adaptive generative adversarial maximum entropy inverse reinforcement learning
    Song, Li
    Li, Dazi
    Xu, Xin
    INFORMATION SCIENCES, 2025, 695
  • [16] A Study of Continuous Maximum Entropy Deep Inverse Reinforcement Learning
    Chen, Xi-liang
    Cao, Lei
    Xu, Zhi-xiong
    Lai, Jun
    Li, Chen-xi
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2019, 2019
  • [17] A Decision-making Method for Longitudinal Autonomous Driving Based on Inverse Reinforcement Learning
    Gao Z.
    Yan X.
    Gao F.
    Qiche Gongcheng/Automotive Engineering, 2022, 44 (07): : 969 - 975
  • [18] Safe, efficient, and comfortable velocity control based on reinforcement learning for autonomous driving
    Zhu, Meixin
    Wang, Yinhai
    Pu, Ziyuan
    Hu, Jingyun
    Wang, Xuesong
    Ke, Ruimin
    TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2020, 117
  • [19] Attention-Based Distributional Reinforcement Learning for Safe and Efficient Autonomous Driving
    Liu, Jia
    Yin, Jianwen
    Jiang, Zhengmin
    Liang, Qingyi
    Li, Huiyun
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (09): : 7477 - 7484
  • [20] Application of Reinforcement Learning in the Autonomous Driving Platform of the DeepRacer
    Zhu, Wenjie
    Du, Haikuo
    Zhu, Moyan
    Liu, Yanbo
    Lin, Chaoting
    Wang, Shaobo
    Sun, Weiqi
    Yan, Huaming
    2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 5345 - 5352