Inverse Risk-Sensitive Reinforcement Learning

被引:16
|
作者
Ratliff, Lillian J. [1 ]
Mazumdar, Eric [2 ]
机构
[1] Univ Washington, Dept Elect Engn, Seattle, WA 98195 USA
[2] Univ Calif Berkeley, Dept Elect Engn & Comp Sci, Berkeley, CA 94720 USA
基金
美国国家科学基金会;
关键词
Autonomous systems; Markov processes; optimization; reinforcement learning; CHOICE;
D O I
10.1109/TAC.2019.2926674
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This work addresses the problem of inverse reinforcement learning in Markov decision processes where the decision-making agent is risk-sensitive. In particular, a risk-sensitive reinforcement learning algorithm with convergence guarantees that makes use of coherent risk metrics and models of human decision-making which have their origins in behavioral psychology and economics is presented. The risk-sensitive reinforcement learning algorithm provides the theoretical underpinning for a gradient-based inverse reinforcement learning algorithm that seeks to minimize a loss function defined on the observed behavior. It is shown that the gradient of the loss function with respect to the model parameters is well defined and computable via a contraction map argument. Evaluation of the proposed technique is performed on a Grid World example, a canonical benchmark problem.
引用
收藏
页码:1256 / 1263
页数:8
相关论文
共 50 条
  • [1] Risk-sensitive Inverse Reinforcement Learning via Coherent Risk Models
    Majumdar, Anirudha
    Singh, Sumeet
    Mandlekar, Ajay
    Pavone, Marco
    ROBOTICS: SCIENCE AND SYSTEMS XIII, 2017,
  • [2] Gradient-Based Inverse Risk-Sensitive Reinforcement Learning
    Mazumdar, Eric
    Ratliff, Lillian J.
    Fiez, Tanner
    Sastry, S. Shankar
    2017 IEEE 56TH ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2017,
  • [3] Risk-Sensitive Reinforcement Learning
    Shen, Yun
    Tobia, Michael J.
    Sommer, Tobias
    Obermayer, Klaus
    NEURAL COMPUTATION, 2014, 26 (07) : 1298 - 1328
  • [4] Risk-sensitive reinforcement learning
    Mihatsch, O
    Neuneier, R
    MACHINE LEARNING, 2002, 49 (2-3) : 267 - 290
  • [5] Risk-Sensitive Reinforcement Learning
    Oliver Mihatsch
    Ralph Neuneier
    Machine Learning, 2002, 49 : 267 - 290
  • [6] Risk-Sensitive Policy with Distributional Reinforcement Learning
    Theate, Thibaut
    Ernst, Damien
    ALGORITHMS, 2023, 16 (07)
  • [7] Risk-sensitive Reinforcement Learning and Robust Learning for Control
    Noorani, Erfaun
    Baras, John S.
    2021 60TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2021, : 2976 - 2981
  • [8] A Probabilistic Perspective on Risk-sensitive Reinforcement Learning
    Noorani, Erfaun
    Baras, John S.
    2022 AMERICAN CONTROL CONFERENCE, ACC, 2022, : 2697 - 2702
  • [9] Regret Bounds for Risk-Sensitive Reinforcement Learning
    Bastani, Osbert
    Ma, Yecheng Jason
    Shen, Estelle
    Xu, Wanqiao
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
  • [10] Distributional Reinforcement Learning for Risk-Sensitive Policies
    Lim, Shiau Hong
    Malik, Ilyas
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,