Preference-Conditioned Language-Guided Abstraction

被引:2
|
作者
Peng, Andi [1 ]
Bobu, Andreea [2 ]
Li, Belinda Z. [1 ]
Sumers, Theodore R. [3 ]
Sucholutsky, Ilia [3 ]
Kumar, Nishanth [1 ]
Grifths, Thomas L. [3 ]
Shah, Julie A. [1 ]
机构
[1] MIT, Cambridge, MA 02139 USA
[2] Boston Dynam AI Inst, Cambridge, MA USA
[3] Princeton, Princeton, NJ USA
关键词
state abstraction; learning from human input; human preferences;
D O I
10.1145/3610977.3634930
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning from demonstrations is a common way for users to teach robots, but it is prone to spurious feature correlations. Recent work constructs state abstractions, i.e. visual representations containing task-relevant features, from language as a way to perform more generalizable learning. However, these abstractions also depend on a user's preference for what matters in a task, which may be hard to describe or infeasible to exhaustively specify using language alone. How do we construct abstractions to capture these latent preferences? We observe that how humans behave reveals how they see the world. Our key insight is that changes in human behavior inform us that there are diferences in preferences for how humans see the world, i.e. their state abstractions. In this work, we propose using language models (LMs) to query for those preferences directly given knowledge that a change in behavior has occurred. In our framework, we use the LM in two ways: frst, given a text description of the task and knowledge of behavioral change between states, we query the LM for possible hidden preferences; second, given the most likely preference, we query the LM to construct the state abstraction. In this framework, the LM is also able to ask the human directly when uncertain about its own estimate. We demonstrate our framework's ability to construct efective preference-conditioned abstractions in simulated experiments, a user study, as well as on a real Spot robot performing mobile manipulation tasks.
引用
收藏
页码:572 / 581
页数:10
相关论文
共 50 条
  • [1] PREFERENCE-CONDITIONED NECESSITIES: DETACHMENT AND PRACTICAL REASONING
    Lauer, Sven
    Condoravdi, Cleo
    PACIFIC PHILOSOPHICAL QUARTERLY, 2014, 95 (04) : 584 - 621
  • [2] PACER: Preference-Conditioned All-Terrain Costmap Generation
    Mao, Luisa
    Warnell, Garrett
    Stone, Peter
    Biswas, Joydeep
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2025, 10 (05): : 4572 - 4579
  • [3] Language-guided Image Reflection Separation
    Zhong, Haofeng
    Hong, Yuchen
    Weng, Shuchen
    Liang, Jinxiu
    Shi, Boxin
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 24913 - 24922
  • [4] LAGOON: Language-Guided Motion Control
    Xu, Shusheng
    Wang, Huaijie
    Ouyang, Yutao
    Gao, Jiaxuan
    Meng, Zhiyu
    Yu, Chao
    Wu, Yi
    2024 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2024), 2024, : 9743 - 9750
  • [5] DocEdit: Language-Guided Document Editing
    Mathur, Puneet
    Jain, Rajiv
    Gu, Jiuxiang
    Dernoncourt, Franck
    Manocha, Dinesh
    Morariu, Vlad I.
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 2, 2023, : 1914 - 1922
  • [6] CLIP-It! Language-Guided Video Summarization
    Narasimhan, Medhini
    Rohrbach, Anna
    Darrell, Trevor
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [7] LIDNeRF: Language-Guided NeRF Editing With InstructDiffusion
    Kulkarni, Vaishali
    Sharma, Khushal Hemant
    Shah, Manan
    Vinay, Aniruddh
    INTERNATIONAL JOURNAL OF INTELLIGENT INFORMATION TECHNOLOGIES, 2025, 21 (01)
  • [8] Language-Guided Controller Synthesis for Linear Systems
    Gol, Ebru Aydin
    Lazar, Mircea
    Belta, Calin
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2014, 59 (05) : 1163 - 1176
  • [9] mmFilter: Language-Guided Video Analytics at the Edge
    Hu, Zhiming
    Ye, Ning
    Phillips, Caleb
    Capes, Tim
    Mohomed, Iqbal
    PROCEEDINGS OF THE 2020 21ST INTERNATIONAL MIDDLEWARE CONFERENCE INDUSTRIAL TRACK (MIDDLEWARE INDUSTRY '20), 2020, : 1 - 7
  • [10] A Hardware Accelerator for Language-Guided Reinforcement Learning
    Shiri, Aidin
    Mazumder, Arnab Neelim
    Prakash, Bharat
    Homayoun, Houman
    Waytowich, Nicholas R.
    Mohsenin, Tinoosh
    IEEE DESIGN & TEST, 2022, 39 (03) : 37 - 44