SOTOPIA-Π: Interactive Learning of Socially Intelligent Language Agents

被引：0

作者：

Wang, Ruiyi ^{[1
]}

Yu, Haofei ^{[1
]}

Zhang, Wenxin ^{[1
]}

Qi, Zhengyang ^{[1
]}

Sap, Maarten ^{[1
]}

Neubig, Graham ^{[1
]}

Bisk, Yonatan ^{[1
]}

Zhu, Hao ^{[1
]}

机构：

[1] Carnegie Mellon Univ, Language Technol Inst, Pittsburgh, PA 15213 USA

来源：

PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS | 2024年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Humans learn social skills through both imitation and social interaction. This social learning process is largely understudied by existing research on building language agents. Motivated by this gap, we propose an interactive learning method, SOTOPIA-p, improving the social intelligence of language agents. This method leverages behavior cloning and self-reinforcement training on filtered social interaction data according to large language model (LLM) ratings. We show that our training method allows a 7B LLM to reach the social goal completion ability of an expert model (GPT-4-based agent), while improving the safety of language agents and maintaining general QA ability on the MMLU benchmark. We also find that this training paradigm uncovers some difficulties in LLM-based evaluation of social intelligence: LLM-based evaluators overestimate the abilities of the language agents trained specifically for social interaction.

引用

页码：12912 / 12940

页数：29

共 50 条

[21] Engineering Socially Intelligent Personal Agents via Norms
Ajmeri, Nirav
AAMAS'17: PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2017, : 1822 - 1823
[22] Simulating socially intelligent agents in semantic virtual environments
Grimaldo, Francisco
Lozano, Miguel
Barber, Fernando
Vigueras, Guillermo
KNOWLEDGE ENGINEERING REVIEW, 2008, 23 (04): : 369 - 388
[23] Socially Intelligent Genetic Agents for the Emergence of Explicit Norms
Agrawal, Rishabh
Ajmeri, Nirav
Singh, Munindar P.
PROCEEDINGS OF THE THIRTY-FIRST INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2022, 2022, : 10 - 16
[24] ESL GENIE: AN INTERACTIVE, INTELLIGENT SYSTEM FOR SECOND LANGUAGE LEARNING AND TEACHING ON THE WEB
Alexandre, Jamie
Elman, Jeff
INTED2012: INTERNATIONAL TECHNOLOGY, EDUCATION AND DEVELOPMENT CONFERENCE, 2012, : 1300 - 1305
[25] Newtonian Action Advice: Integrating Human Verbal Instruction with Reinforcement Learning Socially Interactive Agents Track
Krening, Samantha
Feigh, Karen M.
AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 720 - 727
[26] Towards Investigating Gaze and Laughter Coordination in Socially Interactive Agents
Maraev, Vladislav
Mazzocconi, Chiara
Howes, Christine
Pelachaud, Catherine
PROCEEDINGS OF THE 11TH CONFERENCE ON HUMAN-AGENT INTERACTION, HAI 2023, 2023, : 473 - 475
[27] Simulating Interactive Learning Scenarios with Intelligent Pedagogical Agents in a Virtual World through BDI-Based Agents
Soliman, Mohamed
Guetl, Christian
INTERNATIONAL JOURNAL OF ENGINEERING PEDAGOGY, 2013, 3 (02): : 41 - 47
[28] Socially intelligent agents: Creating relationships with computers and robots.
Yang, Y
Wang, R
INFORMATION PROCESSING & MANAGEMENT, 2004, 40 (02) : 379 - 381
[29] Intelligent Affect: Rational Decision Making for Socially Aligned Agents
Asghar, Nabiha
Hoey, Jesse
UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2015, : 72 - 81
[30] Modeling volunteer computing grid on the Web by socially intelligent agents
Li, W
Yu, XH
COMPUTERS AND THEIR APPLICATIONS, 2003, : 450 - 455

← 1 2 3 4 5 →