Inverse Risk-Sensitive Reinforcement Learning

被引:16
|
作者
Ratliff, Lillian J. [1 ]
Mazumdar, Eric [2 ]
机构
[1] Univ Washington, Dept Elect Engn, Seattle, WA 98195 USA
[2] Univ Calif Berkeley, Dept Elect Engn & Comp Sci, Berkeley, CA 94720 USA
基金
美国国家科学基金会;
关键词
Autonomous systems; Markov processes; optimization; reinforcement learning; CHOICE;
D O I
10.1109/TAC.2019.2926674
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This work addresses the problem of inverse reinforcement learning in Markov decision processes where the decision-making agent is risk-sensitive. In particular, a risk-sensitive reinforcement learning algorithm with convergence guarantees that makes use of coherent risk metrics and models of human decision-making which have their origins in behavioral psychology and economics is presented. The risk-sensitive reinforcement learning algorithm provides the theoretical underpinning for a gradient-based inverse reinforcement learning algorithm that seeks to minimize a loss function defined on the observed behavior. It is shown that the gradient of the loss function with respect to the model parameters is well defined and computable via a contraction map argument. Evaluation of the proposed technique is performed on a Grid World example, a canonical benchmark problem.
引用
收藏
页码:1256 / 1263
页数:8
相关论文
共 50 条
  • [41] Risk-Sensitive Autonomous Exploration of Unknown Environments: A Deep Reinforcement Learning Perspective
    Sarfi, Mohammad Hossein
    Bisheban, Mahdis
    JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2025, 111 (01)
  • [42] A Tighter Problem-Dependent Regret Bound for Risk-Sensitive Reinforcement Learning
    Hu, Xiaoyan
    Leung, Ho-Fung
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 206, 2023, 206
  • [43] Sample-Efficient Multimodal Dynamics Modeling for Risk-Sensitive Reinforcement Learning
    Yashima, Ryota
    Yamaguchi, Akihiko
    Hashimoto, Koichi
    2022 8th International Conference on Mechatronics and Robotics Engineering, ICMRE 2022, 2022, : 21 - 27
  • [44] Embracing Risk in Reinforcement Learning: The Connection between Risk-Sensitive Exponential and Distributionally Robust Criteria
    Noorani, Erfaun
    Baras, John S.
    2022 AMERICAN CONTROL CONFERENCE, ACC, 2022, : 2703 - 2708
  • [45] Risk-Sensitive Reinforcement Learning: Near-Optimal Risk-Sample Tradeoff in Regret
    Fei, Yingjie
    Yang, Zhuoran
    Chen, Yudong
    Wang, Zhaoran
    Xie, Qiaomin
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [46] A Learning Algorithm for Risk-Sensitive Cost
    Basu, Arnab
    Bhattacharyya, Tirthankar
    Borkar, Vivek S.
    MATHEMATICS OF OPERATIONS RESEARCH, 2008, 33 (04) : 880 - 898
  • [47] Policy Gradient Based Entropic-VaR Optimization in Risk-Sensitive Reinforcement Learning
    Ni, Xinyi
    Lai, Lifeng
    2022 58TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2022,
  • [48] Influence of budget and reinforcement location on risk-sensitive preference
    O'Daly, Matthew
    Case, David A.
    Fantino, Edmund
    BEHAVIOURAL PROCESSES, 2006, 73 (02) : 125 - 135
  • [49] Uncertainty-Aware Reinforcement Learning for Risk-Sensitive Player Evaluation in Sports Game
    Liu, Guiliang
    Luo, Yudong
    Schulte, Oliver
    Poupart, Pascal
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [50] Risk-Sensitive Mobile Robot Navigation in Crowded Environment via Offline Reinforcement Learning
    Wu, Jiaxu
    Wang, Yusheng
    Asama, Hajime
    An, Qi
    Yamashita, Atsushi
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 7456 - 7462