SOTOPIA-Π: Interactive Learning of Socially Intelligent Language Agents

被引：0

作者：

Wang, Ruiyi ^{[1
]}

Yu, Haofei ^{[1
]}

Zhang, Wenxin ^{[1
]}

Qi, Zhengyang ^{[1
]}

Sap, Maarten ^{[1
]}

Neubig, Graham ^{[1
]}

Bisk, Yonatan ^{[1
]}

Zhu, Hao ^{[1
]}

机构：

[1] Carnegie Mellon Univ, Language Technol Inst, Pittsburgh, PA 15213 USA

来源：

PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS | 2024年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Humans learn social skills through both imitation and social interaction. This social learning process is largely understudied by existing research on building language agents. Motivated by this gap, we propose an interactive learning method, SOTOPIA-p, improving the social intelligence of language agents. This method leverages behavior cloning and self-reinforcement training on filtered social interaction data according to large language model (LLM) ratings. We show that our training method allows a 7B LLM to reach the social goal completion ability of an expert model (GPT-4-based agent), while improving the safety of language agents and maintaining general QA ability on the MMLU benchmark. We also find that this training paradigm uncovers some difficulties in LLM-based evaluation of social intelligence: LLM-based evaluators overestimate the abilities of the language agents trained specifically for social interaction.

引用

页码：12912 / 12940

页数：29

共 50 条

[1] Let's talk! Socially intelligent agents for language conversation training
Prendinger, H
Ishizuka, M
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART A-SYSTEMS AND HUMANS, 2001, 31 (05): : 465 - 471
[2] Socially intelligent tutor agents
Heylen, D
Nijholt, A
den Akker, RO
Vissers, M
INTELLIGENT VIRTUAL AGENTS, 2003, 2792 : 341 - 347
[3] Modeling socially intelligent agents
Edmonds, B
APPLIED ARTIFICIAL INTELLIGENCE, 1998, 12 (7-8) : 677 - 699
[4] Using Reinforcement Learning to Optimize the Policies of an Intelligent Tutoring System for Interpersonal Skills Training Socially Interactive Agents Track
Georgila, Kallirroi
Core, Mark G.
Nye, Benjamin D.
Karumbaiah, Shamya
Auerbach, Daniel
Ram, Maya
AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 737 - 745
[5] Explanatory style for socially interactive agents
Oh, Sejin
Gratch, Jonathan
Woo, Woontack
AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION, PROCEEDINGS, 2007, 4738 : 534 - +
[6] Prompting for Socially Intelligent Agents with ChatGPT
Antunes, Ana
Campos, Joana
Guimaraes, Manuel
Dias, Joao
Santos, Pedro A.
PROCEEDINGS OF THE 23RD ACM INTERNATIONAL CONFERENCE ON INTELLIGENT VIRTUAL AGENTS, IVA 2023, 2023,
[7] Socially intelligent agents - The human in the loop
Dautenhahn, K
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART A-SYSTEMS AND HUMANS, 2001, 31 (05): : 345 - 348
[8] Socially intelligent reasoning for autonomous agents
Hogg, LMJ
Jennings, NR
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART A-SYSTEMS AND HUMANS, 2001, 31 (05): : 381 - 393
[9] Perspectives on Socially Intelligent Conversational Agents
Brinkschulte, Luisa
Schloegl, Stephan
Monz, Alexander
Schottle, Pascal
Janetschek, Matthias
MULTIMODAL TECHNOLOGIES AND INTERACTION, 2022, 6 (08)
[10] Animating groups of socially intelligent agents
Grimaldo, Francisco
Lozano, Miguel
Barber, Fernando
Vigueras, Guillermo
2007 INTERNATIONAL CONFERENCE ON CYBERWORLDS, PROCEEDINGS, 2007, : 136 - 143

← 1 2 3 4 5 →