SOTOPIA-Π: Interactive Learning of Socially Intelligent Language Agents

被引:0
|
作者
Wang, Ruiyi [1 ]
Yu, Haofei [1 ]
Zhang, Wenxin [1 ]
Qi, Zhengyang [1 ]
Sap, Maarten [1 ]
Neubig, Graham [1 ]
Bisk, Yonatan [1 ]
Zhu, Hao [1 ]
机构
[1] Carnegie Mellon Univ, Language Technol Inst, Pittsburgh, PA 15213 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Humans learn social skills through both imitation and social interaction. This social learning process is largely understudied by existing research on building language agents. Motivated by this gap, we propose an interactive learning method, SOTOPIA-p, improving the social intelligence of language agents. This method leverages behavior cloning and self-reinforcement training on filtered social interaction data according to large language model (LLM) ratings. We show that our training method allows a 7B LLM to reach the social goal completion ability of an expert model (GPT-4-based agent), while improving the safety of language agents and maintaining general QA ability on the MMLU benchmark. We also find that this training paradigm uncovers some difficulties in LLM-based evaluation of social intelligence: LLM-based evaluators overestimate the abilities of the language agents trained specifically for social interaction.
引用
收藏
页码:12912 / 12940
页数:29
相关论文
共 50 条
  • [21] Engineering Socially Intelligent Personal Agents via Norms
    Ajmeri, Nirav
    AAMAS'17: PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2017, : 1822 - 1823
  • [22] Simulating socially intelligent agents in semantic virtual environments
    Grimaldo, Francisco
    Lozano, Miguel
    Barber, Fernando
    Vigueras, Guillermo
    KNOWLEDGE ENGINEERING REVIEW, 2008, 23 (04): : 369 - 388
  • [23] Socially Intelligent Genetic Agents for the Emergence of Explicit Norms
    Agrawal, Rishabh
    Ajmeri, Nirav
    Singh, Munindar P.
    PROCEEDINGS OF THE THIRTY-FIRST INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2022, 2022, : 10 - 16
  • [24] ESL GENIE: AN INTERACTIVE, INTELLIGENT SYSTEM FOR SECOND LANGUAGE LEARNING AND TEACHING ON THE WEB
    Alexandre, Jamie
    Elman, Jeff
    INTED2012: INTERNATIONAL TECHNOLOGY, EDUCATION AND DEVELOPMENT CONFERENCE, 2012, : 1300 - 1305
  • [25] Newtonian Action Advice: Integrating Human Verbal Instruction with Reinforcement Learning Socially Interactive Agents Track
    Krening, Samantha
    Feigh, Karen M.
    AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 720 - 727
  • [26] Towards Investigating Gaze and Laughter Coordination in Socially Interactive Agents
    Maraev, Vladislav
    Mazzocconi, Chiara
    Howes, Christine
    Pelachaud, Catherine
    PROCEEDINGS OF THE 11TH CONFERENCE ON HUMAN-AGENT INTERACTION, HAI 2023, 2023, : 473 - 475
  • [27] Simulating Interactive Learning Scenarios with Intelligent Pedagogical Agents in a Virtual World through BDI-Based Agents
    Soliman, Mohamed
    Guetl, Christian
    INTERNATIONAL JOURNAL OF ENGINEERING PEDAGOGY, 2013, 3 (02): : 41 - 47
  • [28] Socially intelligent agents: Creating relationships with computers and robots.
    Yang, Y
    Wang, R
    INFORMATION PROCESSING & MANAGEMENT, 2004, 40 (02) : 379 - 381
  • [29] Intelligent Affect: Rational Decision Making for Socially Aligned Agents
    Asghar, Nabiha
    Hoey, Jesse
    UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2015, : 72 - 81
  • [30] Modeling volunteer computing grid on the Web by socially intelligent agents
    Li, W
    Yu, XH
    COMPUTERS AND THEIR APPLICATIONS, 2003, : 450 - 455