Enhancing Robot Task Planning and Execution through Multi-Layer Large Language Models

被引:0
|
作者
Luan, Zhirong [1 ]
Lai, Yujun [1 ]
Huang, Rundong [1 ]
Bai, Shuanghao [2 ]
Zhang, Yuedi [2 ]
Zhang, Haoran [2 ]
Wang, Qian [1 ]
机构
[1] Xian Univ Technol, Sch Elect Engn, Xian 710000, Peoples R China
[2] Xi An Jiao Tong Univ, Coll Artificial Intelligence, Xian 710000, Peoples R China
基金
中国国家自然科学基金;
关键词
robots; large language models; natural language; semantic alignment method;
D O I
10.3390/s24051687
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Large language models have found utility in the domain of robot task planning and task decomposition. Nevertheless, the direct application of these models for instructing robots in task execution is not without its challenges. Limitations arise in handling more intricate tasks, encountering difficulties in effective interaction with the environment, and facing constraints in the practical executability of machine control instructions directly generated by such models. In response to these challenges, this research advocates for the implementation of a multi-layer large language model to augment a robot's proficiency in handling complex tasks. The proposed model facilitates a meticulous layer-by-layer decomposition of tasks through the integration of multiple large language models, with the overarching goal of enhancing the accuracy of task planning. Within the task decomposition process, a visual language model is introduced as a sensor for environment perception. The outcomes of this perception process are subsequently assimilated into the large language model, thereby amalgamating the task objectives with environmental information. This integration, in turn, results in the generation of robot motion planning tailored to the specific characteristics of the current environment. Furthermore, to enhance the executability of task planning outputs from the large language model, a semantic alignment method is introduced. This method aligns task planning descriptions with the functional requirements of robot motion, thereby refining the overall compatibility and coherence of the generated instructions. To validate the efficacy of the proposed approach, an experimental platform is established utilizing an intelligent unmanned vehicle. This platform serves as a means to empirically verify the proficiency of the multi-layer large language model in addressing the intricate challenges associated with both robot task planning and execution.
引用
收藏
页数:20
相关论文
共 50 条
  • [31] PATROLLING TASK PLANNING FOR THE MULTI-LAYER MULTI-AGENT SYSTEM BASED ON SEQUENTIAL ALLOCATION METHOD
    Zhou, Xin
    Wang, Weiping
    Wang, Tao
    Li, Xiaobo
    M&S AND COMPLEXITY IN INTELLIGENT, ADAPTIVE AND AUTONOMOUS SYSTEMS SYMPOSIUM (MSCIAAS 2018), 2018,
  • [32] PROGPROMPT: Generating Situated Robot Task Plans using Large Language Models
    Singh, Ishika
    Blukis, Valts
    Mousavian, Arsalan
    Goyal, Ankit
    Xu, Danfei
    Tremblay, Jonathan
    Fox, Dieter
    Thomason, Jesse
    Garg, Animesh
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023), 2023, : 11523 - 11530
  • [33] Enhancing Troubleshooting Task-Oriented Dialog Systems with Large Language Models
    Zhou, Jiahao
    Zhang, Qiang
    Zhang, Fengda
    Yuan, Caixia
    INTELLIGENT ROBOTICS AND APPLICATIONS, ICIRA 2024, PT VI, 2025, 15206 : 328 - 338
  • [34] Task Planning of Multi-Robot Inspection in Large Storage Tank Environment
    Zhang, Fulong
    Li, Chunshu
    Wang, Yan
    Computer Engineering and Applications, 2023, 59 (12) : 278 - 285
  • [35] Investigating interpretative models in music through multi-layer representation formats
    Barate, Adriano
    Haus, Goffredo
    Ludovico, Luca Andrea
    Presti, Giorgio
    JOURNAL OF MUSIC TECHNOLOGY & EDUCATION, 2019, 12 (01) : 95 - 113
  • [36] VirtuWander: Enhancing Multi-modal Interaction for Virtual Tour Guidance through Large Language Models
    Wang, Zhan
    Yuan, Lin-Ping
    Wang, Liangwei
    Jiang, Bingchuan
    Zeng, Wei
    PROCEEDINGS OF THE 2024 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYTEMS, CHI 2024, 2024,
  • [37] Multi-Robot Task Planning and Sequencing using the SAT-TSP Language
    Imeson, Frank
    Smith, Stephen L.
    2015 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2015, : 5397 - 5402
  • [38] LEARNING MULTI-LAYER TRANSFORM MODELS
    Ravishankar, Saiprasad
    Wohlberg, Brendt
    2018 56TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2018, : 160 - 165
  • [39] On multi-layer architecture of process models
    Aiordachioaie, D
    Dugan, V
    Solea, R
    6TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL XII, PROCEEDINGS: INDUSTRIAL SYSTEMS AND ENGINEERING II, 2002, : 399 - 404
  • [40] Multi-layer implicit garment models
    Pérez-Urbiola, RE
    Rudomín, I
    SHAPE MODELING INTERNATIONAL '99 - INTERNATIONAL CONFERENCE ON SHAPE MODELING AND APPLICATIONS, PROCEEDINGS, 1999, : 66 - +