The Puzzle of Evaluating Moral Cognition in Artificial Agents

被引：4

作者：

Reinecke, Madeline G. ^{[1
,2
,3
]}

Mao, Yiran ^{[1
,4
]}

Kunesch, Markus ^{[1
]}

Duenez-Guzman, Edgar A. ^{[1
]}

Haas, Julia ^{[1
]}

Leibo, Joel Z. ^{[1
]}

机构：

[1] Google DeepMind, London, England

[2] Yale Univ, Dept Psychol, New Haven, CT USA

[3] Yale Univ, Dept Psychol, 100 Coll St, New Haven, CT 06510 USA

[4] Google Deep Mind, London N1C 4DN, England

来源：

COGNITIVE SCIENCE | 2023年 / 47卷 / 08期

关键词：

Moral cognition; Artificial intelligence; Multi-agent reinforcement learning; JUDGMENTS; INTENT;

D O I：

10.1111/cogs.13315

中图分类号：

B84 [心理学];

学科分类号：

04 ; 0402 ;

摘要：

In developing artificial intelligence (AI), researchers often benchmark against human performance as a measure of progress. Is this kind of comparison possible for moral cognition? Given that human moral judgment often hinges on intangible properties like "intention" which may have no natural analog in artificial agents, it may prove difficult to design a "like-for-like" comparison between the moral behavior of artificial and human agents. What would a measure of moral behavior for both humans and AI look like? We unravel the complexity of this question by discussing examples within reinforcement learning and generative AI, and we examine how the puzzle of evaluating artificial agents' moral cognition remains open for further investigation within cognitive science.

引用

页数：7

共 50 条

[1] Social Cognition and Artificial Agents
Strasser, Anna
PHILOSOPHY AND THEORY OF ARTIFICIAL INTELLIGENCE 2017, 2018, 44 : 106 - 114
[2] Moral sensitivity and the limits of artificial moral agents
Joris Graff
Ethics and Information Technology, 2024, 26
[3] Moral sensitivity and the limits of artificial moral agents
Graff, Joris
ETHICS AND INFORMATION TECHNOLOGY, 2024, 26 (01)
[4] On the Moral Equality of Artificial Agents
Wareham, Christopher
INTERNATIONAL JOURNAL OF TECHNOETHICS, 2011, 2 (01) : 35 - 42
[5] Moral Gridworlds: A Theoretical Proposal for Modeling Artificial Moral Cognition
Haas, Julia
MINDS AND MACHINES, 2020, 30 (02) : 219 - 246
[6] Moral Gridworlds: A Theoretical Proposal for Modeling Artificial Moral Cognition
Julia Haas
Minds and Machines, 2020, 30 : 219 - 246
[7] Artificial moral agents: moral mentors or sensible tools?
Fabio Fossa
Ethics and Information Technology, 2018, 20 : 115 - 126
[8] Artificial moral agents: moral mentors or sensible tools?
Fossa, Fabio
ETHICS AND INFORMATION TECHNOLOGY, 2018, 20 (02) : 115 - 126
[9] Artificial virtue: the machine question and perceptions of moral character in artificial moral agents
Patrick Gamez
Daniel B. Shank
Carson Arnold
Mallory North
AI & SOCIETY, 2020, 35 : 795 - 809
[10] Artificial virtue: the machine question and perceptions of moral character in artificial moral agents
Gamez, Patrick
Shank, Daniel B.
Arnold, Carson
North, Mallory
AI & SOCIETY, 2020, 35 (04) : 795 - 809

← 1 2 3 4 5 →