Preference-Conditioned Language-Guided Abstraction

被引：2

作者：

Peng, Andi ^{[1
]}

Bobu, Andreea ^{[2
]}

Li, Belinda Z. ^{[1
]}

Sumers, Theodore R. ^{[3
]}

Sucholutsky, Ilia ^{[3
]}

Kumar, Nishanth ^{[1
]}

Grifths, Thomas L. ^{[3
]}

Shah, Julie A. ^{[1
]}

机构：

[1] MIT, Cambridge, MA 02139 USA

[2] Boston Dynam AI Inst, Cambridge, MA USA

[3] Princeton, Princeton, NJ USA

来源：

PROCEEDINGS OF THE 2024 ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN-ROBOT INTERACTION, HRI 2024 | 2024年

关键词：

state abstraction; learning from human input; human preferences;

D O I：

10.1145/3610977.3634930

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Learning from demonstrations is a common way for users to teach robots, but it is prone to spurious feature correlations. Recent work constructs state abstractions, i.e. visual representations containing task-relevant features, from language as a way to perform more generalizable learning. However, these abstractions also depend on a user's preference for what matters in a task, which may be hard to describe or infeasible to exhaustively specify using language alone. How do we construct abstractions to capture these latent preferences? We observe that how humans behave reveals how they see the world. Our key insight is that changes in human behavior inform us that there are diferences in preferences for how humans see the world, i.e. their state abstractions. In this work, we propose using language models (LMs) to query for those preferences directly given knowledge that a change in behavior has occurred. In our framework, we use the LM in two ways: frst, given a text description of the task and knowledge of behavioral change between states, we query the LM for possible hidden preferences; second, given the most likely preference, we query the LM to construct the state abstraction. In this framework, the LM is also able to ask the human directly when uncertain about its own estimate. We demonstrate our framework's ability to construct efective preference-conditioned abstractions in simulated experiments, a user study, as well as on a real Spot robot performing mobile manipulation tasks.

引用

页码：572 / 581

页数：10

共 50 条

[1] PREFERENCE-CONDITIONED NECESSITIES: DETACHMENT AND PRACTICAL REASONING
Lauer, Sven
Condoravdi, Cleo
PACIFIC PHILOSOPHICAL QUARTERLY, 2014, 95 (04) : 584 - 621
[2] PACER: Preference-Conditioned All-Terrain Costmap Generation
Mao, Luisa
Warnell, Garrett
Stone, Peter
Biswas, Joydeep
IEEE ROBOTICS AND AUTOMATION LETTERS, 2025, 10 (05): : 4572 - 4579
[3] Language-guided Image Reflection Separation
Zhong, Haofeng
Hong, Yuchen
Weng, Shuchen
Liang, Jinxiu
Shi, Boxin
2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 24913 - 24922
[4] LAGOON: Language-Guided Motion Control
Xu, Shusheng
Wang, Huaijie
Ouyang, Yutao
Gao, Jiaxuan
Meng, Zhiyu
Yu, Chao
Wu, Yi
2024 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2024), 2024, : 9743 - 9750
[5] DocEdit: Language-Guided Document Editing
Mathur, Puneet
Jain, Rajiv
Gu, Jiuxiang
Dernoncourt, Franck
Manocha, Dinesh
Morariu, Vlad I.
THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 2, 2023, : 1914 - 1922
[6] CLIP-It! Language-Guided Video Summarization
Narasimhan, Medhini
Rohrbach, Anna
Darrell, Trevor
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
[7] LIDNeRF: Language-Guided NeRF Editing With InstructDiffusion
Kulkarni, Vaishali
Sharma, Khushal Hemant
Shah, Manan
Vinay, Aniruddh
INTERNATIONAL JOURNAL OF INTELLIGENT INFORMATION TECHNOLOGIES, 2025, 21 (01)
[8] Language-Guided Controller Synthesis for Linear Systems
Gol, Ebru Aydin
Lazar, Mircea
Belta, Calin
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2014, 59 (05) : 1163 - 1176
[9] mmFilter: Language-Guided Video Analytics at the Edge
Hu, Zhiming
Ye, Ning
Phillips, Caleb
Capes, Tim
Mohomed, Iqbal
PROCEEDINGS OF THE 2020 21ST INTERNATIONAL MIDDLEWARE CONFERENCE INDUSTRIAL TRACK (MIDDLEWARE INDUSTRY '20), 2020, : 1 - 7
[10] A Hardware Accelerator for Language-Guided Reinforcement Learning
Shiri, Aidin
Mazumder, Arnab Neelim
Prakash, Bharat
Homayoun, Houman
Waytowich, Nicholas R.
Mohsenin, Tinoosh
IEEE DESIGN & TEST, 2022, 39 (03) : 37 - 44

← 1 2 3 4 5 →