GameVLM: A Decision-making Framework for Robotic Task Planning Based on Visual Language Models and Zero-sum Games

被引:1
|
作者
Mei, Aoran [1 ]
Wang, Jianhua [1 ]
Zhu, Guo-Niu [1 ]
Gan, Zhongxue [1 ]
机构
[1] Fudan Univ, Acad Engn & Technol, Shanghai 200433, Peoples R China
关键词
Task planning; Multi-agent; Visual language models; Zero-sum game theory; Decision-making;
D O I
10.1109/ICMA61710.2024.10633088
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With their prominent scene understanding and reasoning capabilities, pre-trained visual-language models (VLMs) such as GPT-4V have attracted increasing attention in robotic task planning. Compared with traditional task planning strategies, VLMs are strong in multimodal information parsing and code generation and show remarkable efficiency. Although VLMs demonstrate great potential in robotic task planning, they suffer from challenges like hallucination, semantic complexity, and limited context. To handle such issues, this paper proposes a multi-agent framework, i.e., GameVLM, to enhance the decision-making process in robotic task planning. In this study, VLM-based decision and expert agents are presented to conduct the task planning. Specifically, decision agents are used to plan the task, and the expert agent is employed to evaluate these task plans. Zero-sum game theory is introduced to resolve inconsistencies among different agents and determine the optimal solution. Experimental results on real robots demonstrate the efficacy of the proposed framework, with an average success rate of 83.3%. Videos of our experiments are available at https://youtu.be/sam-MKCPP7Y.
引用
收藏
页码:1771 / 1776
页数:6
相关论文
共 38 条
  • [1] APPLICATION OF NON-ZERO-SUM GAMES TO COMPETITIVE DECISION-MAKING
    LEONDES, CT
    NANDI, RK
    INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 1977, 8 (09) : 1009 - 1020
  • [2] A new approach for emergency decision-making based on zero-sum game with Pythagorean fuzzy uncertain linguistic variables
    Ding, Xue-Feng
    Liu, Hu-Chen
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2019, 34 (07) : 1667 - 1684
  • [3] A Data-Based Moving Target Defense Framework for Switching Zero-Sum Games
    Zhai, Lijing
    Vamvoudakis, Kyriakos G.
    5TH IEEE CONFERENCE ON CONTROL TECHNOLOGY AND APPLICATIONS (IEEE CCTA 2021), 2021, : 931 - 936
  • [4] Enhanced decision framework for two-player zero-sum Markov games with diverse opponent policies
    Zhu, Jin
    Wang, Xuan
    Dullerud, Geir E.
    APPLIED INTELLIGENCE, 2025, 55 (06)
  • [5] Research on Autonomous Manoeuvre Decision Making in Within-Visual-Range Aerial Two-Player Zero-Sum Games Based on Deep Reinforcement Learning
    Lu, Bo
    Ru, Le
    Hu, Shiguang
    Wang, Wenfei
    Xi, Hailong
    Zhao, Xiaolin
    MATHEMATICS, 2024, 12 (14)
  • [6] A hybrid emergency decision-making technique based on trapezoidal fuzzy best-worst method and zero-sum game
    Chen, Ze-hui
    Wu, Deng-feng
    Luo, Wen
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 234
  • [7] The relationship between teachers' and principals' decision-making power: is it a win-win situation or a zero-sum game?
    Shen, Jianping
    Xia, Jiangang
    INTERNATIONAL JOURNAL OF LEADERSHIP IN EDUCATION, 2012, 15 (02) : 153 - 174
  • [8] BIM-Based Decision-Making Framework for Scaffolding Planning
    Kim, Kyungki
    Cho, Yong K.
    Kim, Kinam
    JOURNAL OF MANAGEMENT IN ENGINEERING, 2018, 34 (06)
  • [9] Multiple attribute decision making with triangular intuitionistic fuzzy numbers based on zero-sum game approach
    Xui, J.
    Dong, J. Y.
    Wan, S. P.
    Gao, J.
    IRANIAN JOURNAL OF FUZZY SYSTEMS, 2019, 16 (03): : 97 - 112
  • [10] AutoPlan: Automatic Planning of Interactive Decision-Making Tasks With Large Language Models
    Ouyang, Siqi
    Li, Lei
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS - EMNLP 2023, 2023, : 3114 - 3128