Semantic Tracklets: An Object-Centric Representation for Visual Multi-Agent Reinforcement Learning

被引:2
|
作者
Liu, Iou-Jen [1 ]
Ren, Zhongzheng [1 ]
Yeh, Raymond A. [1 ]
Schwing, Alexander G. [1 ]
机构
[1] Univ Illinois, Champaign, IL 61820 USA
关键词
D O I
10.1109/IROS51168.2021.9636592
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Solving complex real-world tasks, e.g., autonomous fleet control, often involves a coordinated team of multiple agents which learn strategies from visual inputs via reinforcement learning. Many existing multi-agent reinforcement learning (MARL) algorithms however don't scale to environments where agents operate on visual inputs. To address this issue, algorithmically, recent works have focused on non-stationarity and exploration. In contrast, we study whether scalability can also be achieved via a disentangled representation. For this, we explicitly construct an object-centric intermediate representation to characterize the states of an environment, which we refer to as 'semantic tracklets.' We evaluate 'semantic tracklets' on the visual multi-agent particle environment (VMPE) and on the challenging visual multi-agent GFootball environment. 'Semantic tracklets' consistently outperform baselines on VMPE, and achieve a +2.4 higher score difference than baselines on GFootball. Notably, this method is the first to successfully learn a strategy for five players in the GFootball environment using only visual data. For more, please see our project page: https://ioujenliu.github.io/SemanticTracklets
引用
收藏
页码:5603 / 5610
页数:8
相关论文
共 50 条
  • [41] Enhancing cooperation by cognition differences and consistent representation in multi-agent reinforcement learning
    Ge, Hongwei
    Ge, Zhixin
    Sun, Liang
    Wang, Yuxin
    APPLIED INTELLIGENCE, 2022, 52 (09) : 9701 - 9716
  • [42] Greedy based Value Representation for Optimal Coordination in Multi-agent Reinforcement Learning
    Wan, Lipeng
    Liu, Zeyang
    Chen, Xingyu
    Lan, Xuguang
    Zheng, Nanning
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [43] Learning global spatial information for multi-view object-centric models
    Kobayashi, Yuya
    Suzuki, Masahiro
    Matsuo, Yutaka
    ADVANCED ROBOTICS, 2023, 37 (13) : 828 - 839
  • [44] Rethinking Amodal Video Segmentation from Learning Supervised Signals with Object-centric Representation
    Fan, Ke
    Lei, Jingshi
    Qian, Xuelin
    Yu, Miaopeng
    Xiao, Tianjun
    He, Tong
    Zhang, Zheng
    Fu, Yanwei
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 1272 - 1281
  • [45] MAGNet: Multi-agent Graph Network for Deep Multi-agent Reinforcement Learning
    Malysheva, Aleksandra
    Kudenko, Daniel
    Shpilman, Aleksei
    2019 XVI INTERNATIONAL SYMPOSIUM PROBLEMS OF REDUNDANCY IN INFORMATION AND CONTROL SYSTEMS (REDUNDANCY), 2019, : 171 - 176
  • [46] Finding Things in the Unknown: Semantic Object-Centric Exploration with an MAV
    Papatheodorou, Sotiris
    Funk, Nils
    Tzoumanikas, Dimos
    Choi, Christopher
    Xu, Binbin
    Leutenegger, Stefan
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA, 2023, : 3339 - 3345
  • [47] MADDPGViz: a visual analytics approach to understand multi-agent deep reinforcement learning
    Shi, Xiaoying
    Zhang, Jiaming
    Liang, Ziyi
    Seng, Dewen
    JOURNAL OF VISUALIZATION, 2023, 26 (05) : 1189 - 1205
  • [48] MADDPGViz: a visual analytics approach to understand multi-agent deep reinforcement learning
    Xiaoying Shi
    Jiaming Zhang
    Ziyi Liang
    Dewen Seng
    Journal of Visualization, 2023, 26 : 1189 - 1205
  • [49] Multi-agent reinforcement learning for prostate localization based on multi-scale image representation
    Zheng, Chenyang
    Si, Xiangyu
    Sun, Lei
    Chen, Zhang
    Yu, Linghao
    Tian, Zhiqiang
    INTERNATIONAL SYMPOSIUM ON ARTIFICIAL INTELLIGENCE AND ROBOTICS 2021, 2021, 11884
  • [50] Segmenting Moving Objects via an Object-Centric Layered Representation
    Xie, Junyu
    Xie, Weidi
    Zisserman, Andrew
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,