Navigation with Large Language Models: Semantic Guesswork as a Heuristic for Planning

被引:0
|
作者
Shah, Dhruv [1 ]
Equi, Michael [1 ]
Osinski, Blazej [3 ]
Xia, Fei [2 ]
Ichter, Brian [2 ]
Levine, Sergey [1 ,2 ]
机构
[1] Univ Calif Berkeley, Berkeley, CA 94720 USA
[2] Google DeepMind, London, England
[3] Univ Warsaw, Warsaw, Poland
来源
关键词
navigation; language models; planning; semantic scene understanding; VISION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Navigation in unfamiliar environments presents a major challenge for robots: while mapping and planning techniques can be used to build up a representation of the world, quickly discovering a path to a desired goal in unfamiliar settings with such methods often requires lengthy mapping and exploration. Humans can rapidly navigate new environments, particularly indoor environments that are laid out logically, by leveraging semantics-e.g., a kitchen often adjoins a living room, an exit sign indicates the way out, and so forth. Language models can provide robots with such knowledge, but directly using language models to instruct a robot how to reach some destination can also be impractical: while language models might produce a narrative about how to reach some goal, because they are not grounded in real-world observations, this narrative might be arbitrarily wrong. Therefore, in this paper we study how the "semantic guesswork" produced by language models can be utilized as a guiding heuristic for planning algorithms. Our method, Language Frontier Guide (LFG), uses the language model to bias exploration of novel real-world environments by incorporating the semantic knowledge stored in language models as a search heuristic for planning with either topological or metric maps. We evaluate LFG in challenging real-world environments and simulated benchmarks, outperforming uninformed exploration and other ways of using language models.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] SayCanPay: Heuristic Planning with Large Language Models Using Learnable Domain Knowledge
    Hazra, Rishi
    Dos Martires, Pedro Zuidberg
    De Raedt, Luc
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 18, 2024, : 20123 - 20133
  • [2] Semantic anomaly detection with large language models
    Elhafsi, Amine
    Sinha, Rohan
    Agia, Christopher
    Schmerling, Edward
    Nesnas, Issa A. D.
    Pavone, Marco
    AUTONOMOUS ROBOTS, 2023, 47 (08) : 1035 - 1055
  • [3] Semantic anomaly detection with large language models
    Amine Elhafsi
    Rohan Sinha
    Christopher Agia
    Edward Schmerling
    Issa A. D. Nesnas
    Marco Pavone
    Autonomous Robots, 2023, 47 : 1035 - 1055
  • [4] Leveraging Large Language Models for Effective Organizational Navigation
    Chandrasekar, Haresh
    Gupta, Srishti
    Liu, Chun-Tzu
    Tsai, Chun-Hua
    PROCEEDINGS OF THE 25TH ANNUAL INTERNATIONAL CONFERENCE ON DIGITAL GOVERNMENT RESEARCH, DGO 2024, 2024, : 1020 - 1022
  • [5] Semantic Mechanical Search with Large Vision and Language Models
    Sharma, Satvik
    Huang, Huang
    Shivakumar, Kaushik
    Chen, Lawrence Yunliang
    Hoque, Ryan
    Ichter, Brian
    Goldberg, Ken
    CONFERENCE ON ROBOT LEARNING, VOL 229, 2023, 229
  • [6] NavGPT: Explicit Reasoning in Vision-and-Language Navigation with Large Language Models
    Zhou, Gengze
    Hong, Yicong
    Wu, Qi
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 7, 2024, : 7641 - 7649
  • [7] Enhancing Large Language Models with RAG for Visual Language Navigation in Continuous Environments
    Bao, Xiaoan
    Lv, Zhiqiang
    Wu, Biao
    ELECTRONICS, 2025, 14 (05):
  • [8] Leveraging large language models for autonomous robotic mapping and navigation
    Espada, Jordan Pascual
    Qiu, Sofia Yiyu
    Crespo, Ruben Gonzalez
    Carus, Juan Luis
    INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2025, 22 (02):
  • [9] Navigation Instruction Generation with BEV Perception and Large Language Models
    Fan, Sheng
    Liu, Rui
    Wang, Wenguan
    Yang, Yi
    COMPUTER VISION-ECCV 2024, PT XXII, 2025, 15080 : 368 - 387
  • [10] Cocoon: Semantic Table Profiling Using Large Language Models
    Huang, Zezhou
    Wu, Eugene
    WORKSHOP ON HUMAN-IN-THE-LOOP DATA ANALYTICS, HILDA 2024, 2024,