GameVLM: A Decision-making Framework for Robotic Task Planning Based on Visual Language Models and Zero-sum Games

被引:1
|
作者
Mei, Aoran [1 ]
Wang, Jianhua [1 ]
Zhu, Guo-Niu [1 ]
Gan, Zhongxue [1 ]
机构
[1] Fudan Univ, Acad Engn & Technol, Shanghai 200433, Peoples R China
关键词
Task planning; Multi-agent; Visual language models; Zero-sum game theory; Decision-making;
D O I
10.1109/ICMA61710.2024.10633088
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With their prominent scene understanding and reasoning capabilities, pre-trained visual-language models (VLMs) such as GPT-4V have attracted increasing attention in robotic task planning. Compared with traditional task planning strategies, VLMs are strong in multimodal information parsing and code generation and show remarkable efficiency. Although VLMs demonstrate great potential in robotic task planning, they suffer from challenges like hallucination, semantic complexity, and limited context. To handle such issues, this paper proposes a multi-agent framework, i.e., GameVLM, to enhance the decision-making process in robotic task planning. In this study, VLM-based decision and expert agents are presented to conduct the task planning. Specifically, decision agents are used to plan the task, and the expert agent is employed to evaluate these task plans. Zero-sum game theory is introduced to resolve inconsistencies among different agents and determine the optimal solution. Experimental results on real robots demonstrate the efficacy of the proposed framework, with an average success rate of 83.3%. Videos of our experiments are available at https://youtu.be/sam-MKCPP7Y.
引用
收藏
页码:1771 / 1776
页数:6
相关论文
共 38 条
  • [21] Optimization-based decision-making models for disaster recovery and reconstruction planning of transportation networks
    Milad Zamanifar
    Timo Hartmann
    Natural Hazards, 2020, 104 : 1 - 25
  • [22] Optimization-based decision-making models for disaster recovery and reconstruction planning of transportation networks
    Zamanifar, Milad
    Hartmann, Timo
    NATURAL HAZARDS, 2020, 104 (01) : 1 - 25
  • [23] Digitized Decision-Making System Research for Warehouse Planning Using Navigation Patterns Based on Visual LISP
    Yao, H.
    Fang, Y. D.
    Cao, Y.
    INTERNATIONAL JOURNAL OF ONLINE ENGINEERING, 2013, 9 (05) : 68 - 72
  • [24] A decision-making framework for promoting the optimum design and planning of Nature-based Solutions at local scale
    Semeraro, Teodoro
    Gatto, Elisa
    De Bellis, Luigi
    Luvisi, Andrea
    Emmanuel, Rohinton
    Buccolieri, Riccardo
    URBAN FORESTRY & URBAN GREENING, 2023, 84
  • [25] A framework for hydro-power vibration dynamic measurement and decision-making based on natural language processing
    Yang, Peng
    Jiao, Jiangming
    Zhang, Xiaoyu
    Liu, Xianke
    Duan, Peng
    SOUND AND VIBRATION, 2025, 59 (01):
  • [26] A Multi-Task Fusion Strategy-Based Decision-Making and Planning Method for Autonomous Driving Vehicles
    Liu, Weiguo
    Xiang, Zhiyu
    Fang, Han
    Huo, Ke
    Wang, Zixu
    SENSORS, 2023, 23 (16)
  • [27] A Utility-Based Path Planning for Safe UAS Operations with a Task-Level Decision-Making Capability
    Kaya, Uluhan C.
    Dogan, Atilla
    Huber, Manfred
    2019 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), 2019, : 1227 - 1233
  • [28] LLM-MANUF: An integrated framework of Fine-Tuning large language models for intelligent Decision-Making in manufacturing
    Du, Kaze
    Yang, Bo
    Xie, Keqiang
    Dong, Nan
    Zhang, Zhengping
    Wang, Shilong
    Mo, Fan
    ADVANCED ENGINEERING INFORMATICS, 2025, 65
  • [29] A multi-agent decision-making framework for planning and operating human-factor-based rural community
    Liu, Yi
    Xu, Xiao
    Liu, Youbo
    Liu, Junyong
    Hu, Weihao
    Yang, Nan
    Jawad, Shafqat
    Wei, Zhaobin
    JOURNAL OF CLEANER PRODUCTION, 2024, 440
  • [30] Integration of Decision-Making and Motion Planning for Autonomous Driving Based on Double-Layer Reinforcement Learning Framework
    Liao, Yaping
    Yu, Guizhen
    Chen, Peng
    Zhou, Bin
    Li, Han
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (03) : 3142 - 3158