Inverse Risk-Sensitive Reinforcement Learning

被引：16

作者：

Ratliff, Lillian J. ^{[1
]}

Mazumdar, Eric ^{[2
]}

机构：

[1] Univ Washington, Dept Elect Engn, Seattle, WA 98195 USA

[2] Univ Calif Berkeley, Dept Elect Engn & Comp Sci, Berkeley, CA 94720 USA

来源：

IEEE TRANSACTIONS ON AUTOMATIC CONTROL | 2020年 / 65卷 / 03期

基金：

美国国家科学基金会;

关键词：

Autonomous systems; Markov processes; optimization; reinforcement learning; CHOICE;

D O I：

10.1109/TAC.2019.2926674

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This work addresses the problem of inverse reinforcement learning in Markov decision processes where the decision-making agent is risk-sensitive. In particular, a risk-sensitive reinforcement learning algorithm with convergence guarantees that makes use of coherent risk metrics and models of human decision-making which have their origins in behavioral psychology and economics is presented. The risk-sensitive reinforcement learning algorithm provides the theoretical underpinning for a gradient-based inverse reinforcement learning algorithm that seeks to minimize a loss function defined on the observed behavior. It is shown that the gradient of the loss function with respect to the model parameters is well defined and computable via a contraction map argument. Evaluation of the proposed technique is performed on a Grid World example, a canonical benchmark problem.

引用

页码：1256 / 1263

页数：8

共 50 条

[1] Risk-sensitive Inverse Reinforcement Learning via Coherent Risk Models
Majumdar, Anirudha
Singh, Sumeet
Mandlekar, Ajay
Pavone, Marco
ROBOTICS: SCIENCE AND SYSTEMS XIII, 2017,
[2] Gradient-Based Inverse Risk-Sensitive Reinforcement Learning
Mazumdar, Eric
Ratliff, Lillian J.
Fiez, Tanner
Sastry, S. Shankar
2017 IEEE 56TH ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2017,
[3] Risk-Sensitive Reinforcement Learning
Shen, Yun
Tobia, Michael J.
Sommer, Tobias
Obermayer, Klaus
NEURAL COMPUTATION, 2014, 26 (07) : 1298 - 1328
[4] Risk-sensitive reinforcement learning
Mihatsch, O
Neuneier, R
MACHINE LEARNING, 2002, 49 (2-3) : 267 - 290
[5] Risk-Sensitive Reinforcement Learning
Oliver Mihatsch
Ralph Neuneier
Machine Learning, 2002, 49 : 267 - 290
[6] Risk-Sensitive Policy with Distributional Reinforcement Learning
Theate, Thibaut
Ernst, Damien
ALGORITHMS, 2023, 16 (07)
[7] Risk-sensitive Reinforcement Learning and Robust Learning for Control
Noorani, Erfaun
Baras, John S.
2021 60TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2021, : 2976 - 2981
[8] A Probabilistic Perspective on Risk-sensitive Reinforcement Learning
Noorani, Erfaun
Baras, John S.
2022 AMERICAN CONTROL CONFERENCE, ACC, 2022, : 2697 - 2702
[9] Regret Bounds for Risk-Sensitive Reinforcement Learning
Bastani, Osbert
Ma, Yecheng Jason
Shen, Estelle
Xu, Wanqiao
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
[10] Distributional Reinforcement Learning for Risk-Sensitive Policies
Lim, Shiau Hong
Malik, Ilyas
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,

← 1 2 3 4 5 →