Semantic Tracklets: An Object-Centric Representation for Visual Multi-Agent Reinforcement Learning

被引:2
|
作者
Liu, Iou-Jen [1 ]
Ren, Zhongzheng [1 ]
Yeh, Raymond A. [1 ]
Schwing, Alexander G. [1 ]
机构
[1] Univ Illinois, Champaign, IL 61820 USA
关键词
D O I
10.1109/IROS51168.2021.9636592
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Solving complex real-world tasks, e.g., autonomous fleet control, often involves a coordinated team of multiple agents which learn strategies from visual inputs via reinforcement learning. Many existing multi-agent reinforcement learning (MARL) algorithms however don't scale to environments where agents operate on visual inputs. To address this issue, algorithmically, recent works have focused on non-stationarity and exploration. In contrast, we study whether scalability can also be achieved via a disentangled representation. For this, we explicitly construct an object-centric intermediate representation to characterize the states of an environment, which we refer to as 'semantic tracklets.' We evaluate 'semantic tracklets' on the visual multi-agent particle environment (VMPE) and on the challenging visual multi-agent GFootball environment. 'Semantic tracklets' consistently outperform baselines on VMPE, and achieve a +2.4 higher score difference than baselines on GFootball. Notably, this method is the first to successfully learn a strategy for five players in the GFootball environment using only visual data. For more, please see our project page: https://ioujenliu.github.io/SemanticTracklets
引用
收藏
页码:5603 / 5610
页数:8
相关论文
共 50 条
  • [21] Slot-VPS: Object-centric Representation Learning for Video Panoptic Segmentation
    Zhou, Yi
    Zhang, Hui
    Lee, Hana
    Sun, Shuyang
    Li, Pingjun
    Zhu, Yangguang
    Yoo, ByungIn
    Qi, Xiaojuan
    Han, Jae-Joon
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 3083 - 3093
  • [22] COMPETITIVE MULTI-AGENT REINFORCEMENT LEARNING WITH SELF-SUPERVISED REPRESENTATION
    Su, DiJia
    Lee, Jason D.
    Mulvey, John M.
    Poor, H. Vincent
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 4098 - 4102
  • [23] Learning Latent Object-Centric Representations for Visual-Based Robot Manipulation
    Wang, Yunan
    Wang, Jiayu
    Li, Yixiao
    Hu, Chuxiong
    Zhu, Yu
    2022 INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND MECHATRONICS (ICARM 2022), 2022, : 138 - 143
  • [24] Learning Object-Centric Representations of Multi-Object Scenes from Multiple Views
    Nanbo, Li
    Eastwood, Cian
    Fisher, Robert B.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [25] Generalization and Robustness Implications in Object-Centric Learning
    Dittadi, Andrea
    Papa, Samuele
    De Vita, Michele
    Scholkopf, Bernhard
    Winther, Ole
    Locatello, Francesco
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [26] Multi-Agent Cognition Difference Reinforcement Learning for Multi-Agent Cooperation
    Wang, Huimu
    Qiu, Tenghai
    Liu, Zhen
    Pu, Zhiqiang
    Yi, Jianqiang
    Yuan, Wanmai
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [27] Multi-Agent Reinforcement Learning With Distributed Targeted Multi-Agent Communication
    Xu, Chi
    Zhang, Hui
    Zhang, Ya
    2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 2915 - 2920
  • [28] Multi-Agent Uncertainty Sharing for Cooperative Multi-Agent Reinforcement Learning
    Chen, Hao
    Yang, Guangkai
    Zhang, Junge
    Yin, Qiyue
    Huang, Kaiqi
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [29] Hierarchical multi-agent reinforcement learning
    Mohammad Ghavamzadeh
    Sridhar Mahadevan
    Rajbala Makar
    Autonomous Agents and Multi-Agent Systems, 2006, 13 : 197 - 229
  • [30] Learning to Share in Multi-Agent Reinforcement Learning
    Yi, Yuxuan
    Li, Ge
    Wang, Yaowei
    Lu, Zongqing
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,