GameVLM: A Decision-making Framework for Robotic Task Planning Based on Visual Language Models and Zero-sum Games

被引:1
|
作者
Mei, Aoran [1 ]
Wang, Jianhua [1 ]
Zhu, Guo-Niu [1 ]
Gan, Zhongxue [1 ]
机构
[1] Fudan Univ, Acad Engn & Technol, Shanghai 200433, Peoples R China
关键词
Task planning; Multi-agent; Visual language models; Zero-sum game theory; Decision-making;
D O I
10.1109/ICMA61710.2024.10633088
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With their prominent scene understanding and reasoning capabilities, pre-trained visual-language models (VLMs) such as GPT-4V have attracted increasing attention in robotic task planning. Compared with traditional task planning strategies, VLMs are strong in multimodal information parsing and code generation and show remarkable efficiency. Although VLMs demonstrate great potential in robotic task planning, they suffer from challenges like hallucination, semantic complexity, and limited context. To handle such issues, this paper proposes a multi-agent framework, i.e., GameVLM, to enhance the decision-making process in robotic task planning. In this study, VLM-based decision and expert agents are presented to conduct the task planning. Specifically, decision agents are used to plan the task, and the expert agent is employed to evaluate these task plans. Zero-sum game theory is introduced to resolve inconsistencies among different agents and determine the optimal solution. Experimental results on real robots demonstrate the efficacy of the proposed framework, with an average success rate of 83.3%. Videos of our experiments are available at https://youtu.be/sam-MKCPP7Y.
引用
收藏
页码:1771 / 1776
页数:6
相关论文
共 38 条
  • [31] Hierarchical Sliding-Mode Surface-Based Adaptive Critic Tracking Control for Nonlinear Multiplayer Zero-Sum Games via Generalized Fuzzy Hyperbolic Models
    Zhao, Heng
    Zong, Guangdeng
    Zhao, Xudong
    Wang, Huanqing
    Xu, Ning
    Zhao, Ning
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2023, 31 (11) : 4010 - 4023
  • [32] Graph-based algorithmic design and decision-making framework for district heating and cooling plant positioning and network planning
    Ho, Chi-On
    Nie, Ting
    Su, Lingqi
    Yang, Zheng
    Schwegler, Ben
    Calvez, Philippe
    ADVANCED ENGINEERING INFORMATICS, 2021, 50
  • [33] The development of an integrated BIM-based visual demolition waste management planning system for sustainability-oriented decision-making
    Han, Dongchen
    Kalantari, Mohsen
    Rajabifard, Abbas
    JOURNAL OF ENVIRONMENTAL MANAGEMENT, 2024, 351
  • [34] Towards on Develop a Framework for the Evaluation and Benchmarking of Skin Detectors Based on Artificial Intelligent Models Using Multi-Criteria Decision-Making Techniques
    Yas, Qahtan M.
    Zadain, A. A.
    Zaidan, B. B.
    Lakulu, M. B.
    Rahmatullah, Bahbibi
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2017, 31 (03)
  • [35] Optimized Large Language Models Versus Multiple Sclerosis Specialists: Evaluating Answering Questions of Clinical Decision-Making, A Comparative Study based on clinical scenarios
    Inojosa, Hernan
    Weicken, Eva
    Voigt, Isabel
    Wenk, Judith
    Wiest, Isabella
    Ferber, Dyke
    Gilbert, Stephen
    Kather, Jakob
    Akguen, Katja
    Ziemssen, Tjalf
    MULTIPLE SCLEROSIS JOURNAL, 2024, 30 (03) : 999 - 1000
  • [36] Assessment of decision-making with locally run and web-based large language models versus human board recommendations in otorhinolaryngology, head and neck surgery
    Buhr, Christoph Raphael
    Ernst, Benjamin Philipp
    Blaikie, Andrew
    Smith, Harry
    Kelsey, Tom
    Matthias, Christoph
    Fleischmann, Maximilian
    Jungmann, Florian
    Alt, Juergen
    Brandts, Christian
    Kaemmerer, Peer W.
    Foersch, Sebastian
    Kuhn, Sebastian
    Eckrich, Jonas
    EUROPEAN ARCHIVES OF OTO-RHINO-LARYNGOLOGY, 2025, 282 (03) : 1593 - 1607
  • [37] A multi-criteria decision-making framework for compressed air energy storage power site selection based on the probabilistic language term sets and regret theory
    Gao, Jianwei
    Men, Huijuan
    Guo, Fengjia
    Liu, Huihui
    Li, Xiangzhen
    Huang, Xin
    JOURNAL OF ENERGY STORAGE, 2021, 37
  • [38] Hybrid Diagnosis Models for Autism Patients Based on Medical and Sociodemographic Features Using Machine Learning and Multicriteria Decision-Making (MCDM) Techniques: An Evaluation and Benchmarking Framework
    Alqaysi M.E.
    Albahri A.S.
    Hamid R.A.
    Computational and Mathematical Methods in Medicine, 2022, 2022