Navigation with Large Language Models: Semantic Guesswork as a Heuristic for Planning

被引:0
|
作者
Shah, Dhruv [1 ]
Equi, Michael [1 ]
Osinski, Blazej [3 ]
Xia, Fei [2 ]
Ichter, Brian [2 ]
Levine, Sergey [1 ,2 ]
机构
[1] Univ Calif Berkeley, Berkeley, CA 94720 USA
[2] Google DeepMind, London, England
[3] Univ Warsaw, Warsaw, Poland
来源
关键词
navigation; language models; planning; semantic scene understanding; VISION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Navigation in unfamiliar environments presents a major challenge for robots: while mapping and planning techniques can be used to build up a representation of the world, quickly discovering a path to a desired goal in unfamiliar settings with such methods often requires lengthy mapping and exploration. Humans can rapidly navigate new environments, particularly indoor environments that are laid out logically, by leveraging semantics-e.g., a kitchen often adjoins a living room, an exit sign indicates the way out, and so forth. Language models can provide robots with such knowledge, but directly using language models to instruct a robot how to reach some destination can also be impractical: while language models might produce a narrative about how to reach some goal, because they are not grounded in real-world observations, this narrative might be arbitrarily wrong. Therefore, in this paper we study how the "semantic guesswork" produced by language models can be utilized as a guiding heuristic for planning algorithms. Our method, Language Frontier Guide (LFG), uses the language model to bias exploration of novel real-world environments by incorporating the semantic knowledge stored in language models as a search heuristic for planning with either topological or metric maps. We evaluate LFG in challenging real-world environments and simulated benchmarks, outperforming uninformed exploration and other ways of using language models.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] Distilling Script Knowledge from Large Language Models for Constrained Language Planning
    Yuan, Siyu
    Chen, Jiangjie
    Fu, Ziquan
    Ge, Xuyang
    Shah, Soham
    Jankowski, Charles Robert
    Xiao, Yanghua
    Yang, Deqing
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 4303 - 4325
  • [22] Understanding the Importance of Evolutionary Search in Automated Heuristic Design with Large Language Models
    Zhang, Rui
    Liu, Fei
    Lin, Xi
    Wang, Zhenkun
    Lu, Zhichao
    Zhang, Qingfu
    PARALLEL PROBLEM SOLVING FROM NATURE-PPSN XVIII, PT II, PPSN 2024, 2024, 15149 : 185 - 202
  • [23] Probing the "Creativity" of Large Language Models: Can Models Produce Divergent Semantic Association?
    Chen, Honghua
    Ding, Nai
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 12881 - 12888
  • [24] Web-Scale Semantic Product Search with Large Language Models
    Muhamed, Aashiq
    Srinivasan, Sriram
    Teo, Choon-Hui
    Cui, Qingjun
    Zeng, Belinda
    Chilimbi, Trishul
    Vishwanathan, S. V. N.
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2023, PT III, 2023, 13937 : 73 - 85
  • [25] Semantic Scene Understanding with Large Language Models on Unmanned Aerial Vehicles
    de Curto, J.
    de Zarza, I.
    Calafate, Carlos T.
    DRONES, 2023, 7 (02)
  • [26] Reducing hallucinations of large language models via hierarchical semantic piece
    Liu, Yanyi
    Yang, Qingwen
    Tang, Jiawei
    Guo, Tiezheng
    Wang, Chen
    Li, Pan
    Xu, Sai
    Gao, Xianlin
    Li, Zhi
    Liu, Jun
    Wen, Yingyou
    COMPLEX & INTELLIGENT SYSTEMS, 2025, 11 (05)
  • [27] Extending Context Window of Large Language Models via Semantic Compression
    Fei, Weizhi
    Niu, Xueyan
    Zhou, Pingyi
    Hou, Lu
    Bai, Bo
    Deng, Lei
    Han, Wei
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 5169 - 5181
  • [28] Large Language Models as Commonsense Knowledge for Large-Scale Task Planning
    Zhao, Zirui
    Lee, Wee Sun
    Hsu, David
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [29] Task and Motion Planning with Large Language Models for Object Rearrangement
    Ding, Yan
    Zhang, Xiaohan
    Paxton, Chris
    Zhang, Shiqi
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, IROS, 2023, : 2086 - 2092
  • [30] Evaluation of Pretrained Large Language Models in Embodied Planning Tasks
    Sarkisyan, Christina
    Korchemnyi, Alexandr
    Kovalev, Alexey K.
    Panov, Aleksandr, I
    ARTIFICIAL GENERAL INTELLIGENCE, AGI 2023, 2023, 13921 : 222 - 232