Heterogeneous Policy Networks for Composite Robot Team Communication and Coordination

被引:1
|
作者
Seraj, Esmaeil [1 ]
Paleja, Rohan [1 ]
Pimentel, Luis [1 ]
Lee, Kin Man [1 ]
Wang, Zheyuan [1 ]
Martin, Daniel [1 ]
Sklar, Matthew [1 ]
Zhang, John [1 ]
Kakish, Zahi [2 ]
Gombolay, Matthew [1 ]
机构
[1] Georgia Inst Technol, Atlanta, GA 30332 USA
[2] Sandia Natl Labs, Albuquerque, NM 87123 USA
关键词
Heterogeneous robot teaming; multirobot systems; neural graph-based communication;
D O I
10.1109/TRO.2024.3431829
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
High-performing human-human teams learn intelligent and efficient communication and coordination strategies to maximize their joint utility. These teams implicitly understand the different roles of heterogeneous team members and adapt their communication protocols accordingly. Multiagent reinforcement learning (MARL) has attempted to develop computational methods for synthesizing such joint coordination-communication strategies, but emulating heterogeneous communication patterns across agents with different state, action, and observation spaces has remained a challenge. Without properly modeling agent heterogeneity, as in prior MARL work that leverages homogeneous graph networks, communication becomes less helpful and can even deteriorate the team's performance. In the past, we proposed heterogeneous policy networks (HetNet) to learn efficient and diverse communication models for coordinating cooperative heterogeneous teams. In this extended work, we extend HetNet to support scaling heterogeneous robot teams. Building on heterogeneous graph-attention networks, we show that HetNet not only facilitates learning heterogeneous collaborative policies, but also enables end-to-end training for learning highly efficient binarized messaging. Our empirical evaluation shows that HetNet sets a new state-of-the-art in learning coordination and communication strategies for heterogeneous multiagent teams by achieving an 5.84% to 707.65% performance improvement over the next-best baseline across multiple domains while simultaneously achieving a 200x reduction in the required communication bandwidth.
引用
收藏
页码:3833 / 3849
页数:17
相关论文
共 50 条
  • [31] Multi-robot Coordination for a Heterogeneous Fleet of Robots
    Pereira, Diogo
    Matos, Diogo
    Rebelo, Paulo
    Ribeiro, Fillipe
    Costa, Pedro
    Lima, Jose
    ROBOT2022: FIFTH IBERIAN ROBOTICS CONFERENCE: ADVANCES IN ROBOTICS, VOL 2, 2023, 590 : 229 - 240
  • [32] Distributed coordination in heterogeneous multi-robot systems
    Iocchi, L
    Nardi, D
    Piaggio, M
    Sgorbissa, A
    AUTONOMOUS ROBOTS, 2003, 15 (02) : 155 - 168
  • [33] Protocols for collaboration, coordination and dynamic role assignment in a robot team
    Emery, R
    Sikorski, K
    Balch, T
    2002 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS I-IV, PROCEEDINGS, 2002, : 3008 - 3015
  • [34] Applying Reinforcement Learning to Multi-robot Team Coordination
    Sanz, Yolanda
    de Lope, Javier
    Antonio Martin H, Jose
    HYBRID ARTIFICIAL INTELLIGENCE SYSTEMS, 2008, 5271 : 625 - +
  • [35] Team Member Subjective Communication in Homogeneous and Heterogeneous Teams
    Arnold, Markus C.
    Hannan, R. Lynn
    Tafkov, Ivo D.
    ACCOUNTING REVIEW, 2018, 93 (05): : 1 - 22
  • [36] Human-robot Team Coordination That Considers Human Fatigue
    Zhang, Kai
    Li, Xiaobo
    INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2014, 11
  • [37] Autonomous multi-material construction with a heterogeneous robot team
    Saboia, Maira
    Thangavelu, Vivek
    Napp, Nils
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2019, 121
  • [38] A Prioritized Path Planning Algorithm for Heterogeneous Agricultural Robot Team
    Jo Y.
    Son H.I.
    Journal of Institute of Control, Robotics and Systems, 2024, 30 (06) : 634 - 642
  • [39] EXPLORATION AND MAPPING OF ANCIENT UNDERGROUND TUNNELS WITH A HETEROGENEOUS ROBOT TEAM
    Savran, Dogan
    Tuna, Gurkan
    INFORMATICS, GEOINFORMATICS AND REMOTE SENSING CONFERENCE PROCEEDINGS, SGEM 2016, VOL II, 2016, : 297 - 304
  • [40] Coordination and Common Knowledge on Communication Networks
    Korkmaz, Gizem
    Capra, Monica
    Kraig, Adriana
    Lakkaraju, Kiran
    Kuhlman, Chris J.
    Vega-Redondo, Fernando
    PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS (AAMAS' 18), 2018, : 1062 - 1070