Balancing Efficiency and Unpredictability in Multi-robot Patrolling: A MARL-Based Approach

被引:2
|
作者
Guo, Lingxiao [1 ]
Pan, Haoxuan [2 ]
Duan, Xiaoming [2 ]
He, Jianping [2 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Civil Engn, Shanghai 200240, Peoples R China
[2] Shanghai Jiao Tong Univ, Dept Automat, Shanghai 200240, Peoples R China
关键词
D O I
10.1109/ICRA48891.2023.10160923
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Patrolling with multiple robots is a challenging task. While the robots collaboratively and repeatedly cover the regions of interest in the environment, their routes should satisfy two often conflicting properties: i) (efficiency) the time intervals between two consecutive visits to the regions are small; ii) (unpredictability) the patrolling trajectories are random and unpredictable. We manage to strike a balance between the two goals by i) recasting the original patrolling problem as a Graph Deep Learning problem; ii) directly solving this problem on the graph in the framework of cooperative multi-agent reinforcement learning. Treating the decisions of a team of agents as a sequence input, our model outputs the agents' actions in order by an autoregressive mechanism. Extensive simulation studies show that our approach has comparable performance with existing algorithms in terms of efficiency and outperforms them in terms of unpredictability. To our knowledge, this is the first work that successfully solves the patrolling problem with reinforcement learning on a graph.
引用
收藏
页码:3504 / 3509
页数:6
相关论文
共 50 条
  • [41] Bayesian Reinforcement Learning for Multi-Robot Decentralized Patrolling in Uncertain Environments
    Zhou, Xin
    Wang, Weiping
    Wang, Tao
    Lei, Yonglin
    Zhong, Fangcheng
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2019, 68 (12) : 11691 - 11703
  • [42] Distributed on-line dynamic task assignment for multi-robot patrolling
    Alessandro Farinelli
    Luca Iocchi
    Daniele Nardi
    Autonomous Robots, 2017, 41 : 1321 - 1345
  • [43] Distributed on-line dynamic task assignment for multi-robot patrolling
    Farinelli, Alessandro
    Iocchi, Luca
    Nardi, Daniele
    AUTONOMOUS ROBOTS, 2017, 41 (06) : 1321 - 1345
  • [44] Entrapment/Escorting and Patrolling Missions in Multi-Robot Cluster Space Control
    Mas, Ignacio
    Li, Steven
    Acain, Jose
    Kitts, Christopher
    2009 IEEE-RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, 2009, : 5855 - 5861
  • [45] EVPRT: A MARL-Based Approach for Efficient Passage of Emergency Vehicles in Urban Vehicular Networks
    Liu, Bingyi
    Peng, Wei
    Liu, Jipeng
    Han, Weizhen
    Wang, Enshu
    IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, : 4357 - 4362
  • [46] A Rule-based Approach for a Multi-robot Application
    Panescu, Doru
    Pascal, Carlos
    Olaeru, Razvan Marian
    2015 19TH INTERNATIONAL CONFERENCE ON SYSTEM THEORY, CONTROL AND COMPUTING (ICSTCC), 2015, : 75 - 80
  • [47] A potential field based approach to multi-robot manipulation
    Song, P
    Kumar, V
    2002 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS I-IV, PROCEEDINGS, 2002, : 1217 - 1222
  • [48] Time-switching patrolling controller for holonomic and nonholonomic multi-robot systems
    Sakata, Hirokazu
    Sakurama, Kazunori
    Yamazumi, Mitsuhiro
    Wada, Toshihiro
    ADVANCED ROBOTICS, 2024, 38 (9-10) : 647 - 658
  • [49] Multi-robot Patrolling in Wireless Sensor Networks using Bounded Cycle Coverage
    Popescu, Mihai-Ioan
    Rivano, Herve
    Simonin, Olivier
    2016 IEEE 28TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2016), 2016, : 169 - 176
  • [50] Multi-Robot Boundary Tracking with Phase and Workload Balancing
    Boardman, Michael
    Edmonds, Jeremy
    Francis, Kyle
    Clark, Christopher M.
    IEEE/RSJ 2010 INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2010), 2010,