Inverse Risk-Sensitive Reinforcement Learning

被引：16

作者：

Ratliff, Lillian J. ^{[1
]}

Mazumdar, Eric ^{[2
]}

机构：

[1] Univ Washington, Dept Elect Engn, Seattle, WA 98195 USA

[2] Univ Calif Berkeley, Dept Elect Engn & Comp Sci, Berkeley, CA 94720 USA

来源：

IEEE TRANSACTIONS ON AUTOMATIC CONTROL | 2020年 / 65卷 / 03期

基金：

美国国家科学基金会;

关键词：

Autonomous systems; Markov processes; optimization; reinforcement learning; CHOICE;

D O I：

10.1109/TAC.2019.2926674

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This work addresses the problem of inverse reinforcement learning in Markov decision processes where the decision-making agent is risk-sensitive. In particular, a risk-sensitive reinforcement learning algorithm with convergence guarantees that makes use of coherent risk metrics and models of human decision-making which have their origins in behavioral psychology and economics is presented. The risk-sensitive reinforcement learning algorithm provides the theoretical underpinning for a gradient-based inverse reinforcement learning algorithm that seeks to minimize a loss function defined on the observed behavior. It is shown that the gradient of the loss function with respect to the model parameters is well defined and computable via a contraction map argument. Evaluation of the proposed technique is performed on a Grid World example, a canonical benchmark problem.

引用

页码：1256 / 1263

页数：8

共 50 条

[41] Risk-Sensitive Autonomous Exploration of Unknown Environments: A Deep Reinforcement Learning Perspective
Sarfi, Mohammad Hossein
Bisheban, Mahdis
JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2025, 111 (01)
[42] A Tighter Problem-Dependent Regret Bound for Risk-Sensitive Reinforcement Learning
Hu, Xiaoyan
Leung, Ho-Fung
INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 206, 2023, 206
[43] Sample-Efficient Multimodal Dynamics Modeling for Risk-Sensitive Reinforcement Learning
Yashima, Ryota
Yamaguchi, Akihiko
Hashimoto, Koichi
2022 8th International Conference on Mechatronics and Robotics Engineering, ICMRE 2022, 2022, : 21 - 27
[44] Embracing Risk in Reinforcement Learning: The Connection between Risk-Sensitive Exponential and Distributionally Robust Criteria
Noorani, Erfaun
Baras, John S.
2022 AMERICAN CONTROL CONFERENCE, ACC, 2022, : 2703 - 2708
[45] Risk-Sensitive Reinforcement Learning: Near-Optimal Risk-Sample Tradeoff in Regret
Fei, Yingjie
Yang, Zhuoran
Chen, Yudong
Wang, Zhaoran
Xie, Qiaomin
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
[46] A Learning Algorithm for Risk-Sensitive Cost
Basu, Arnab
Bhattacharyya, Tirthankar
Borkar, Vivek S.
MATHEMATICS OF OPERATIONS RESEARCH, 2008, 33 (04) : 880 - 898
[47] Policy Gradient Based Entropic-VaR Optimization in Risk-Sensitive Reinforcement Learning
Ni, Xinyi
Lai, Lifeng
2022 58TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2022,
[48] Influence of budget and reinforcement location on risk-sensitive preference
O'Daly, Matthew
Case, David A.
Fantino, Edmund
BEHAVIOURAL PROCESSES, 2006, 73 (02) : 125 - 135
[49] Uncertainty-Aware Reinforcement Learning for Risk-Sensitive Player Evaluation in Sports Game
Liu, Guiliang
Luo, Yudong
Schulte, Oliver
Poupart, Pascal
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[50] Risk-Sensitive Mobile Robot Navigation in Crowded Environment via Offline Reinforcement Learning
Wu, Jiaxu
Wang, Yusheng
Asama, Hajime
An, Qi
Yamashita, Atsushi
2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 7456 - 7462

← 1 2 3 4 5 →