Moral Gridworlds: A Theoretical Proposal for Modeling Artificial Moral Cognition

被引:0
|
作者
Julia Haas
机构
[1] Rhodes College,Department of Philosophy
来源
Minds and Machines | 2020年 / 30卷
关键词
Artificial intelligence; Moral AI; Moral cognition; Machine ethics; Moral psychology; Reinforcement learning; Fairness;
D O I
暂无
中图分类号
学科分类号
摘要
I describe a suite of reinforcement learning environments in which artificial agents learn to value and respond to moral content and contexts. I illustrate the core principles of the framework by characterizing one such environment, or “gridworld,” in which an agent learns to trade-off between monetary profit and fair dealing, as applied in a standard behavioral economic paradigm. I then highlight the core technical and philosophical advantages of the learning approach for modeling moral cognition, and for addressing the so-called value alignment problem in AI.
引用
收藏
页码:219 / 246
页数:27
相关论文
共 50 条