Large Language Models as Zero-Shot Human Models for Human-Robot Interaction

被引:16
|
作者
Zhang, Bowen [1 ]
Soh, Harold [2 ]
机构
[1] Natl Univ Singapore, Dept Comp Sci, Singapore, Singapore
[2] NUS, Smart Syst Inst SSI, Singapore, Singapore
来源
2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS) | 2023年
基金
新加坡国家研究基金会;
关键词
D O I
10.1109/IROS55552.2023.10341488
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Human models play a crucial role in human-robot interaction (HRI), enabling robots to consider the impact of their actions on people and plan their behavior accordingly. However, crafting good human models is challenging; capturing context-dependent human behavior requires significant prior knowledge and/or large amounts of interaction data, both of which are difficult to obtain. In this work, we explore the potential of large language models (LLMs) - which have consumed vast amounts of human-generated text data - to act as zero-shot human models for HRI. Our experiments on three social datasets yield promising results; the LLMs are able to achieve performance comparable to purpose-built models. That said, we also discuss current limitations, such as sensitivity to prompts and spatial/numerical reasoning mishaps. Based on our findings, we demonstrate how LLM-based human models can be integrated into a social robot's planning process and applied in HRI scenarios focused on the important element of trust. Specifically, we present one case study on a simulated trust-based table-clearing task and replicate past results that relied on custom models. Next, we conduct a new robot utensil-passing experiment ( n = 65) where preliminary results show that planning with an LLM-based human model can achieve gains over a basic myopic plan. In summary, our results show that LLMs offer a promising (but incomplete) approach to human modeling for HRI.
引用
收藏
页码:7961 / 7968
页数:8
相关论文
共 50 条
  • [11] Large Language Models Are Zero-Shot Time Series Forecasters
    Gruver, Nate
    Finzi, Marc
    Qiu, Shikai
    Wilson, Andrew Gordon
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [12] Examining Zero-Shot Vulnerability Repair with Large Language Models
    Pearce, Hammond
    Tan, Benjamin
    Ahmad, Baleegh
    Karri, Ramesh
    Dolan-Gavitt, Brendan
    2023 IEEE SYMPOSIUM ON SECURITY AND PRIVACY, SP, 2023, : 2339 - 2356
  • [13] Examining Zero-Shot Vulnerability Repair with Large Language Models
    Pearce, Hammond
    Tan, Benjamin
    Ahmad, Baleegh
    Karri, Ramesh
    Dolan-Gavitt, Brendan
    2023 IEEE SYMPOSIUM ON SECURITY AND PRIVACY, SP, 2023, : 2339 - 2356
  • [14] Revisiting Large Language Models as Zero-shot Relation Extractors
    Li, Guozheng
    Wang, Peng
    Ke, Wenjun
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS - EMNLP 2023, 2023, : 6877 - 6892
  • [15] Multi-modal Language Models for Human-Robot Interaction
    Janssens, Ruben
    COMPANION OF THE 2024 ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN-ROBOT INTERACTION, HRI 2024 COMPANION, 2024, : 109 - 111
  • [16] Multi-turn Instruction Invocation on Human-Robot Interaction by Large Language Models
    Cheng, Baoping
    Huang, Yong
    Sun, Xiaoran
    Hu, Jingxi
    Li, Bo
    Pu, Qiran
    Wu, Zijian
    Tao, Xiaoming
    INTELLIGENT ROBOTICS AND APPLICATIONS, ICIRA 2024, PT VII, 2025, 15207 : 207 - 219
  • [17] Comparison of various models of robot and human in human-robot interaction
    Luh, JYS
    Hu, SY
    1998 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-5, 1998, : 1139 - 1144
  • [18] Language Models as Zero-Shot Trajectory Generators
    Kwon, Teyun
    Di Palo, Norman
    Johns, Edward
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (07): : 6728 - 6735
  • [19] MEDAGENTS: Large Language Models as Collaborators for Zero-shot Medical Reasoning
    Tang, Xiangru
    Zou, Anni
    Zhang, Zhuosheng
    Li, Ziming
    Zhao, Yilun
    Zhang, Xingyao
    Cohen, Arman
    Gerstein, Mark
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 599 - 621
  • [20] Zero-shot Bilingual App Reviews Mining with Large Language Models
    Wei, Jialiang
    Courbis, Anne-Lise
    Lambolais, Thomas
    Xu, Binbin
    Bernard, Pierre Louis
    Dray, Gerard
    2023 IEEE 35TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2023, : 898 - 904