Reinforcement Learning Under Moral Uncertainty

被引:0
|
作者
Ecoffet, Adrien [1 ,2 ]
Lehman, Joel [1 ,2 ]
机构
[1] Uber AI Labs, San Francisco, CA 94103 USA
[2] OpenAI, San Francisco, CA 94110 USA
关键词
SOCIAL RATE;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
An ambitious goal for machine learning is to create agents that behave ethically: The capacity to abide by human moral norms would greatly expand the context in which autonomous agents could be practically and safely deployed, e.g. fully autonomous vehicles will encounter charged moral decisions that complicate their deployment. While ethical agents could be trained by rewarding correct behavior under a specific moral theory (e.g. utilitarianism), there remains widespread disagreement about the nature of morality. Acknowledging such disagreement, recent work in moral philosophy proposes that ethical behavior requires acting under moral uncertainty, i.e. to take into account when acting that one's credence is split across several plausible ethical theories. This paper translates such insights to the field of reinforcement learning, proposes two training methods that realize different points among competing desiderata, and trains agents in simple environments to act under moral uncertainty. The results illustrate (1) how such uncertainty can help curb extreme behavior from commitment to single theories and (2) several technical complications arising from attempting to ground moral philosophy in RL (e.g. how can a principled trade-off between two competing but incomparable reward functions be reached). The aim is to catalyze progress towards morally-competent agents and highlight the potential of RL to contribute towards the computational grounding of moral philosophy.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Constrained Reinforcement Learning for Dynamic Optimization under Uncertainty
    Petsagkourakis, P.
    Sandoval, I. O.
    Bradford, E.
    Zhang, D.
    del Rio-Chanona, E. A.
    IFAC PAPERSONLINE, 2020, 53 (02): : 11264 - 11270
  • [2] MORAL ENCROACHMENT UNDER MORAL UNCERTAINTY
    Babic, Boris
    King, Zoe Johnson
    PHILOSOPHERS IMPRINT, 2023, 23 (12): : 1 - 28
  • [3] Reinforcement Learning Under Uncertainty: Expected Versus Unexpected Uncertainty and State Versus Reward Uncertainty
    Ez-zizi A.
    Farrell S.
    Leslie D.
    Malhotra G.
    Ludwig C.J.H.
    Computational Brain & Behavior, 2023, 6 (4) : 626 - 650
  • [4] Reinforcement learning for decision-making under deep uncertainty
    Pei, Zhihao
    Rojas-Arevalo, Angela M.
    de Haan, Fjalar J.
    Lipovetzky, Nir
    Moallemi, Enayat A.
    JOURNAL OF ENVIRONMENTAL MANAGEMENT, 2024, 359
  • [5] SKILLFUL CONTROL UNDER UNCERTAINTY VIA DIRECT REINFORCEMENT LEARNING
    GULLAPALLI, V
    ROBOTICS AND AUTONOMOUS SYSTEMS, 1995, 15 (04) : 237 - 246
  • [6] A Deep Reinforcement Learning Approach to Sensor Placement under Uncertainty
    Jabini, Amin
    Johnson, Erik A.
    IFAC PAPERSONLINE, 2022, 55 (27): : 178 - 183
  • [7] Moral Reasoning under Uncertainty
    The Anh Han
    Saptawijaya, Ari
    Pereira, Luis Moniz
    LOGIC FOR PROGRAMMING, ARTIFICIAL INTELLIGENCE, AND REASONING (LPAR-18), 2012, 7180 : 212 - 227
  • [8] Reinforcement Learning Framework for Modeling Spatial Sequential Decisions under Uncertainty
    Truc Viet Le
    Liu, Siyuan
    Lau, Hoong Chuin
    AAMAS'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS, 2016, : 1449 - 1450
  • [9] Autonomous Exploration Under Uncertainty via Deep Reinforcement Learning on Graphs
    Chen, Fanfei
    Martin, John D.
    Huang, Yewei
    Wang, Jinkun
    Englot, Brendan
    2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 6140 - 6147
  • [10] Modelling and Predictive Monitoring of Business Processes under Uncertainty with Reinforcement Learning
    Bousdekis, Alexandros
    Kerasiotis, Athanasios
    Kotsias, Silvester
    Theodoropoulou, Georgia
    Miaoulis, Georgios
    Ghazanfarpour, Djamchid
    SENSORS, 2023, 23 (15)