Emergent Social Learning via Multi-agent Reinforcement Learning

被引:0
|
作者
Ndousse, Kamal [1 ]
Eck, Douglas [2 ]
Levine, Sergey [2 ,3 ]
Jaques, Natasha [2 ,3 ]
机构
[1] OpenAI, San Francisco, CA 94110 USA
[2] Google Res, Brain Team, Mountain View, CA USA
[3] Univ Calif Berkeley, Berkeley, CA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Social learning is a key component of human and animal intelligence. By taking cues from the behavior of experts in their environment, social learners can acquire sophisticated behavior and rapidly adapt to new circumstances. This paper investigates whether independent reinforcement learning (RL) agents in a multi-agent environment can learn to use social learning to improve their performance. We find that in most circumstances, vanilla model-free RL agents do not use social learning. We analyze the reasons for this deficiency, and show that by imposing constraints on the training environment and introducing a model-based auxiliary loss we are able to obtain generalized social learning policies which enable agents to: i) discover complex skills that are not learned from single-agent training, and ii) adapt online to novel environments by taking cues from experts present in the new environment. In contrast, agents trained with model-free RL or imitation learning generalize poorly and do not succeed in the transfer tasks. By mixing multi-agent and solo training, we can obtain agents that use social learning to gain skills that they can deploy when alone, even out-performing agents trained alone from the start.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] Learning adversarial policy in multiple scenes environment via multi-agent reinforcement learning
    Li, Yang
    Wang, Xinzhi
    Wang, Wei
    Zhang, Zhenyu
    Wang, Jianshu
    Luo, Xiangfeng
    Xie, Shaorong
    CONNECTION SCIENCE, 2021, 33 (03) : 407 - 426
  • [42] FedQMIX: Communication-efficient federated learning via multi-agent reinforcement learning
    Cao, Shaohua
    Zhang, Hanqing
    Wen, Tian
    Zhao, Hongwei
    Zheng, Quancheng
    Zhang, Weishan
    Zheng, Danyang
    HIGH-CONFIDENCE COMPUTING, 2024, 4 (02):
  • [43] Emergent Collective Behaviors in a Multi-agent Reinforcement Learning Pedestrian Simulation: A Case Study
    Martinez-Gil, Francisco
    Lozano, Miguel
    Fernandez, Fernando
    MULTI-AGENT-BASED SIMULATION XV, 2015, 9002 : 228 - 238
  • [44] Emergent behaviors and scalability for multi-agent reinforcement learning-based pedestrian models
    Martinez-Gil, Francisco
    Lozano, Miguel
    Fernandez, Fernando
    SIMULATION MODELLING PRACTICE AND THEORY, 2017, 74 : 117 - 133
  • [45] Emergent Escape-based Flocking Behavior using Multi-Agent Reinforcement Learning
    Hahn, Carsten
    Phan, Thomy
    Gabor, Thomas
    Belzner, Lenz
    Linnhoff-Popien, Claudia
    ALIFE 2019: THE 2019 CONFERENCE ON ARTIFICIAL LIFE, 2019, : 598 - 605
  • [46] Modeling Moral Choices in Social Dilemmas with Multi-Agent Reinforcement Learning
    Tennant, Elizaveta
    Hailes, Stephen
    Musolesi, Mirco
    PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 317 - 325
  • [47] Social Influence as Intrinsic Motivation for Multi-Agent Deep Reinforcement Learning
    Jaques, Natasha
    Lazaridou, Angeliki
    Hughes, Edward
    Gulcehre, Caglar
    Ortega, Pedro A.
    Strouse, D. J.
    Leibo, Joel Z.
    de Freitas, Nando
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [48] Formal contracts mitigate social dilemmas in multi-agent reinforcement learning
    Haupt, Andreas
    Christoffersen, Phillip
    Damani, Mehul
    Hadfield-Menell, Dylan
    AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2024, 38 (02)
  • [49] Automating Feature Subspace Exploration via Multi-Agent Reinforcement Learning
    Liu, Kunpeng
    Fu, Yanjie
    Wang, Pengfei
    Wu, Le
    Bo, Rui
    Li, Xiaolin
    KDD'19: PROCEEDINGS OF THE 25TH ACM SIGKDD INTERNATIONAL CONFERENCCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2019, : 207 - 215
  • [50] Collaborative Multi-Agent Dialogue Model Training Via Reinforcement Learning
    Papangelis, Alexandros
    Wang, Yi-Chia
    Molino, Piero
    Tur, Gokhan
    20TH ANNUAL MEETING OF THE SPECIAL INTEREST GROUP ON DISCOURSE AND DIALOGUE (SIGDIAL 2019), 2019, : 92 - 102