SOTOPIA-Π: Interactive Learning of Socially Intelligent Language Agents

被引:0
|
作者
Wang, Ruiyi [1 ]
Yu, Haofei [1 ]
Zhang, Wenxin [1 ]
Qi, Zhengyang [1 ]
Sap, Maarten [1 ]
Neubig, Graham [1 ]
Bisk, Yonatan [1 ]
Zhu, Hao [1 ]
机构
[1] Carnegie Mellon Univ, Language Technol Inst, Pittsburgh, PA 15213 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Humans learn social skills through both imitation and social interaction. This social learning process is largely understudied by existing research on building language agents. Motivated by this gap, we propose an interactive learning method, SOTOPIA-p, improving the social intelligence of language agents. This method leverages behavior cloning and self-reinforcement training on filtered social interaction data according to large language model (LLM) ratings. We show that our training method allows a 7B LLM to reach the social goal completion ability of an expert model (GPT-4-based agent), while improving the safety of language agents and maintaining general QA ability on the MMLU benchmark. We also find that this training paradigm uncovers some difficulties in LLM-based evaluation of social intelligence: LLM-based evaluators overestimate the abilities of the language agents trained specifically for social interaction.
引用
收藏
页码:12912 / 12940
页数:29
相关论文
共 50 条
  • [1] Let's talk! Socially intelligent agents for language conversation training
    Prendinger, H
    Ishizuka, M
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART A-SYSTEMS AND HUMANS, 2001, 31 (05): : 465 - 471
  • [2] Socially intelligent tutor agents
    Heylen, D
    Nijholt, A
    den Akker, RO
    Vissers, M
    INTELLIGENT VIRTUAL AGENTS, 2003, 2792 : 341 - 347
  • [3] Modeling socially intelligent agents
    Edmonds, B
    APPLIED ARTIFICIAL INTELLIGENCE, 1998, 12 (7-8) : 677 - 699
  • [4] Using Reinforcement Learning to Optimize the Policies of an Intelligent Tutoring System for Interpersonal Skills Training Socially Interactive Agents Track
    Georgila, Kallirroi
    Core, Mark G.
    Nye, Benjamin D.
    Karumbaiah, Shamya
    Auerbach, Daniel
    Ram, Maya
    AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 737 - 745
  • [5] Explanatory style for socially interactive agents
    Oh, Sejin
    Gratch, Jonathan
    Woo, Woontack
    AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION, PROCEEDINGS, 2007, 4738 : 534 - +
  • [6] Prompting for Socially Intelligent Agents with ChatGPT
    Antunes, Ana
    Campos, Joana
    Guimaraes, Manuel
    Dias, Joao
    Santos, Pedro A.
    PROCEEDINGS OF THE 23RD ACM INTERNATIONAL CONFERENCE ON INTELLIGENT VIRTUAL AGENTS, IVA 2023, 2023,
  • [7] Socially intelligent agents - The human in the loop
    Dautenhahn, K
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART A-SYSTEMS AND HUMANS, 2001, 31 (05): : 345 - 348
  • [8] Socially intelligent reasoning for autonomous agents
    Hogg, LMJ
    Jennings, NR
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART A-SYSTEMS AND HUMANS, 2001, 31 (05): : 381 - 393
  • [9] Perspectives on Socially Intelligent Conversational Agents
    Brinkschulte, Luisa
    Schloegl, Stephan
    Monz, Alexander
    Schottle, Pascal
    Janetschek, Matthias
    MULTIMODAL TECHNOLOGIES AND INTERACTION, 2022, 6 (08)
  • [10] Animating groups of socially intelligent agents
    Grimaldo, Francisco
    Lozano, Miguel
    Barber, Fernando
    Vigueras, Guillermo
    2007 INTERNATIONAL CONFERENCE ON CYBERWORLDS, PROCEEDINGS, 2007, : 136 - 143