Enhancing Robot Task Planning and Execution through Multi-Layer Large Language Models

被引:0
|
作者
Luan, Zhirong [1 ]
Lai, Yujun [1 ]
Huang, Rundong [1 ]
Bai, Shuanghao [2 ]
Zhang, Yuedi [2 ]
Zhang, Haoran [2 ]
Wang, Qian [1 ]
机构
[1] Xian Univ Technol, Sch Elect Engn, Xian 710000, Peoples R China
[2] Xi An Jiao Tong Univ, Coll Artificial Intelligence, Xian 710000, Peoples R China
基金
中国国家自然科学基金;
关键词
robots; large language models; natural language; semantic alignment method;
D O I
10.3390/s24051687
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Large language models have found utility in the domain of robot task planning and task decomposition. Nevertheless, the direct application of these models for instructing robots in task execution is not without its challenges. Limitations arise in handling more intricate tasks, encountering difficulties in effective interaction with the environment, and facing constraints in the practical executability of machine control instructions directly generated by such models. In response to these challenges, this research advocates for the implementation of a multi-layer large language model to augment a robot's proficiency in handling complex tasks. The proposed model facilitates a meticulous layer-by-layer decomposition of tasks through the integration of multiple large language models, with the overarching goal of enhancing the accuracy of task planning. Within the task decomposition process, a visual language model is introduced as a sensor for environment perception. The outcomes of this perception process are subsequently assimilated into the large language model, thereby amalgamating the task objectives with environmental information. This integration, in turn, results in the generation of robot motion planning tailored to the specific characteristics of the current environment. Furthermore, to enhance the executability of task planning outputs from the large language model, a semantic alignment method is introduced. This method aligns task planning descriptions with the functional requirements of robot motion, thereby refining the overall compatibility and coherence of the generated instructions. To validate the efficacy of the proposed approach, an experimental platform is established utilizing an intelligent unmanned vehicle. This platform serves as a means to empirically verify the proficiency of the multi-layer large language model in addressing the intricate challenges associated with both robot task planning and execution.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] Neural Symbol Grounding with Multi-Layer Attention for Robot Task Planning
    Lv, Pinxin
    Ning, Li
    Jiang, Hao
    Huang, Yushuang
    Liu, Jing
    Wang, Zhaoqi
    2022 IEEE-RAS 21ST INTERNATIONAL CONFERENCE ON HUMANOID ROBOTS (HUMANOIDS), 2022, : 155 - 162
  • [2] Multi-Layer Ranking with Large Language Models for News Source Recommendation
    Zhang, Wenjia
    Gui, Lin
    Procter, Rob
    He, Yulan
    PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, : 2537 - 2542
  • [3] Human-robot interaction through joint robot planning with large language models
    Asuzu, Kosi
    Singh, Harjinder
    Idrissi, Moad
    INTELLIGENT SERVICE ROBOTICS, 2025, : 261 - 277
  • [4] PROGPROMPT: program generation for situated robot task planning using large language models
    Singh, Ishika
    Blukis, Valts
    Mousavian, Arsalan
    Goyal, Ankit
    Xu, Danfei
    Tremblay, Jonathan
    Fox, Dieter
    Thomason, Jesse
    Garg, Animesh
    AUTONOMOUS ROBOTS, 2023, 47 (08) : 999 - 1012
  • [5] ProgPrompt: program generation for situated robot task planning using large language models
    Ishika Singh
    Valts Blukis
    Arsalan Mousavian
    Ankit Goyal
    Danfei Xu
    Jonathan Tremblay
    Dieter Fox
    Jesse Thomason
    Animesh Garg
    Autonomous Robots, 2023, 47 : 999 - 1012
  • [6] TPML: Task Planning for Multi-UAV System with Large Language Models
    Cui, Jinqiang
    Liu, Guocai
    Wang, Hui
    Yu, Yue
    Yang, Jiankun
    2024 IEEE 18TH INTERNATIONAL CONFERENCE ON CONTROL & AUTOMATION, ICCA 2024, 2024, : 886 - 891
  • [7] Task Planning for a Factory Robot Using Large Language Model
    Tsushima, Yosuke
    Yamamoto, Shu
    Ravankar, Ankit A.
    Luces, Jose Victorio Salazar
    Hirata, Yasuhisa
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2025, 10 (03): : 2383 - 2390
  • [8] LiP-LLM: Integrating Linear Programming and Dependency Graph With Large Language Models for Multi-Robot Task Planning
    Obata, Kazuma
    Aoki, Tatsuya
    Horii, Takato
    Taniguchi, Tadahiro
    Nagai, Takayuki
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2025, 10 (02): : 1122 - 1129
  • [9] Optimal Multi-robot Task Planning: from Synthesis to Execution (and Back)
    Leofante, Francesco
    PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 5771 - 5772
  • [10] Task and Motion Planning with Large Language Models for Object Rearrangement
    Ding, Yan
    Zhang, Xiaohan
    Paxton, Chris
    Zhang, Shiqi
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, IROS, 2023, : 2086 - 2092