A Hierarchical Approach to Population Training for Human-AI Collaboration

被引:0
|
作者
Loo, Yi [1 ]
Gong, Chen [1 ]
Meghjani, Malika [1 ]
机构
[1] Singapore Univ Technol & Design SUTD, Singapore, Singapore
基金
新加坡国家研究基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A major challenge for deep reinforcement learning (DRL) agents is to collaborate with novel partners that were not encountered by them during the training phase. This is specifically worsened by an increased variance in action responses when the DRL agents collaborate with human partners due to the lack of consistency in human behaviors. Recent work have shown that training a single agent as the best response to a diverse population of training partners significantly increases an agent's robustness to novel partners. We further enhance the population-based training approach by introducing a Hierarchical Reinforcement Learning (HRL) based method for Human-AI Collaboration. Our agent is able to learn multiple best-response policies as its low-level policy while at the same time, it learns a high-level policy that acts as a manager which allows the agent to dynamically switch between the low-level best-response policies based on its current partner. We demonstrate that our method is able to dynamically adapt to novel partners of different play styles and skill levels in the 2-player collaborative Overcooked game environment. We also conducted a human study in the same environment to test the effectiveness of our method when partnering with real human subjects. Code is available at https://gitlab.com/marvl-hipt/hipt.
引用
收藏
页码:3011 / 3019
页数:9
相关论文
共 50 条
  • [1] Emotions in Human-AI Collaboration
    Ferrada, Filipa
    Camarinha-Matos, Luis M.
    NAVIGATING UNPREDICTABILITY: COLLABORATIVE NETWORKS IN NON-LINEAR WORLDS, PRO-VE 2024, PT I, 2024, 726 : 101 - 117
  • [2] Human-AI Collaboration with Bandit Feedback
    Gao, Ruijiang
    Saar-Tsechansky, Maytal
    De-Arteaga, Maria
    Han, Ligong
    Lee, Min Kyung
    Lease, Matthew
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 1722 - 1728
  • [3] Human-AI Collaboration in Recruitment and Selection
    Natarajan, Neil
    PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 7089 - 7090
  • [4] Rethinking Fairness for Human-AI Collaboration
    Ge, Haosen
    Bastani, Hamsa
    Bastani, Osbert
    15TH INNOVATIONS IN THEORETICAL COMPUTER SCIENCE CONFERENCE, ITCS 2024, 2024,
  • [5] Diverse Conventions for Human-AI Collaboration
    Sarkar, Bidipta
    Shih, Andy
    Sadigh, Dorsa
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [6] AI in Education, Learner Control, and Human-AI Collaboration
    Brusilovsky, Peter
    INTERNATIONAL JOURNAL OF ARTIFICIAL INTELLIGENCE IN EDUCATION, 2024, 34 (01) : 122 - 135
  • [7] Improving Human-AI Collaboration With Descriptions of AI Behavior
    Cabrera Á.A.
    Perer A.
    Hong J.I.
    Proc. ACM Hum. Comput. Interact., 2023, CSCW1
  • [8] AI in Education, Learner Control, and Human-AI Collaboration
    Peter Brusilovsky
    International Journal of Artificial Intelligence in Education, 2024, 34 : 122 - 135
  • [9] Specifying AI Objectives as a Human-AI Collaboration Problem
    Dragan, Anca
    AIES '19: PROCEEDINGS OF THE 2019 AAAI/ACM CONFERENCE ON AI, ETHICS, AND SOCIETY, 2019, : 329 - 329
  • [10] Human-AI Collaboration to Increase the Perception of VR
    Jaszcz, Antoni
    Prokop, Katarzyna
    Polap, Dawid
    Srivastava, Gautam
    Lin, Jerry Chun-Wei
    ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, ICAISC 2022, PT I, 2023, 13588 : 51 - 60