RISK-SENSITIVE DECISION MAKING VIA CONSTRAINED EXPECTED RETURNS

被引:0
|
作者
Hahn, Juergen [1 ]
Zoubir, Abdelhak M. [1 ]
机构
[1] Tech Univ Darmstadt, Signal Proc Grp, Merckstr 25, D-64283 Darmstadt, Germany
关键词
Markov decision process; Risk; Decision making; Constrained optimization; Reinforcement Learning; REINFORCEMENT;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Decision making based on Markov decision processes (MDPs) is an emerging research area as MDPs provide a convenient formalism to learn an optimal behavior in terms of a given reward. In many applications there are critical states that might harm the agent or the environment and should therefore be avoided. In practice, those states are often simply penalized with a negative reward where the penalty is set in a trial-anderror approach. For this reason, we propose a modification of the well-known value iteration algorithm that guarantees that critical states are visited with a pre-set probability only. Since this leads to an infeasible problem, we investigate the effect of nonlinear and linear approximations and discuss the effects. Two examples demonstrate the effectiveness of the proposed approach.
引用
收藏
页码:2569 / 2573
页数:5
相关论文
共 50 条
  • [31] Risk-Sensitive and Average Optimality in Markov Decision Processes
    Sladky, Karel
    PROCEEDINGS OF 30TH INTERNATIONAL CONFERENCE MATHEMATICAL METHODS IN ECONOMICS, PTS I AND II, 2012, : 799 - 804
  • [32] THE STOCHASTIC INTERDEPENDENCE OF DYNAMIC RISK-SENSITIVE DECISION RULES
    HALLETT, AJH
    INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 1984, 15 (12) : 1301 - 1310
  • [33] On Risk-Sensitive Piecewise Deterministic Markov Decision Processes
    Xin Guo
    Yi Zhang
    Applied Mathematics & Optimization, 2020, 81 : 685 - 710
  • [34] Partially Observable Risk-Sensitive Markov Decision Processes
    Baeuerle, Nicole
    Rieder, Ulrich
    MATHEMATICS OF OPERATIONS RESEARCH, 2017, 42 (04) : 1180 - 1196
  • [35] Risk-Sensitive Average Optimality in Markov Decision Chains
    Sladky, Karel
    Montes-de-Oca, Raul
    OPERATIONS RESEARCH PROCEEDINGS 2007, 2008, : 69 - +
  • [36] Volatility Estimation of Financial Returns Using Risk-Sensitive Particle Filters
    Mundnich, Karel
    Orchard, Marcos E.
    Selva, Jorge F.
    Parada, Patricio
    STUDIES IN INFORMATICS AND CONTROL, 2013, 22 (03): : 297 - 306
  • [37] Risk-Sensitive Markov Decision Under Risk Constraints with Coherent Risk Measures
    Yoshida, Yuji
    MODELING DECISIONS FOR ARTIFICIAL INTELLIGENCE (MDAI 2019), 2019, 11676 : 29 - 40
  • [38] Robust Ranking Models via Risk-Sensitive Optimization
    Wang, Lidan
    Bennett, Paul N.
    Collins-Thompson, Kevyn
    SIGIR 2012: PROCEEDINGS OF THE 35TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2012, : 761 - 770
  • [39] Risk-Sensitive Reinforcement Learning Part I: Constrained Optimization Framework
    Prashanth, L. A.
    2019 FIFTH INDIAN CONTROL CONFERENCE (ICC), 2019, : 9 - 9
  • [40] Risk-Sensitive and Mean Variance Optimality in Markov Decision Processes
    Sladky, Karel
    Sitar, Milan
    PROCEEDINGS OF THE 26TH INTERNATIONAL CONFERENCE ON MATHEMATICAL METHODS IN ECONOMICS 2008, 2008, : 451 - 459