Balancing Efficiency and Unpredictability in Multi-robot Patrolling: A MARL-Based Approach

被引:2
|
作者
Guo, Lingxiao [1 ]
Pan, Haoxuan [2 ]
Duan, Xiaoming [2 ]
He, Jianping [2 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Civil Engn, Shanghai 200240, Peoples R China
[2] Shanghai Jiao Tong Univ, Dept Automat, Shanghai 200240, Peoples R China
关键词
D O I
10.1109/ICRA48891.2023.10160923
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Patrolling with multiple robots is a challenging task. While the robots collaboratively and repeatedly cover the regions of interest in the environment, their routes should satisfy two often conflicting properties: i) (efficiency) the time intervals between two consecutive visits to the regions are small; ii) (unpredictability) the patrolling trajectories are random and unpredictable. We manage to strike a balance between the two goals by i) recasting the original patrolling problem as a Graph Deep Learning problem; ii) directly solving this problem on the graph in the framework of cooperative multi-agent reinforcement learning. Treating the decisions of a team of agents as a sequence input, our model outputs the agents' actions in order by an autoregressive mechanism. Extensive simulation studies show that our approach has comparable performance with existing algorithms in terms of efficiency and outperforms them in terms of unpredictability. To our knowledge, this is the first work that successfully solves the patrolling problem with reinforcement learning on a graph.
引用
收藏
页码:3504 / 3509
页数:6
相关论文
共 50 条
  • [1] Leveraging CAVs to Improve Traffic Efficiency: An MARL-based Approach
    Han, Weizhen
    Wang, Enshu
    Li, Bingyi
    Liu, Zhi
    Li, Xun
    Wu, Libing
    Wang, Jianping
    2024 IEEE 44TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS, ICDCS 2024, 2024, : 1143 - 1153
  • [2] A new approach to multi-robot harbour patrolling: theory and experiments
    Marino, Alessandro
    Antonelli, Gianluca
    Aguiar, A. Pedro
    Pascoal, Antonio
    2012 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2012, : 1760 - 1765
  • [3] A Survey on Multi-robot Patrolling Algorithms
    Portugal, David
    Rocha, Rui
    TECHNOLOGICAL INNOVATION FOR SUSTAINABILITY, 2011, 349 : 139 - 146
  • [4] Trust Modeling in Multi-Robot Patrolling
    Pippin, Charles
    Christensen, Henrik
    2014 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2014, : 59 - 66
  • [5] Homogeneous Multi-robot Patrolling Based on Humanoid Formation Configuration
    Wang, Fei
    Wang, Hongrun
    Zhou, Dianle
    Wang, Tao
    INTELLIGENT ROBOTICS AND APPLICATIONS, ICIRA 2024, PT II, 2025, 15202 : 403 - 417
  • [6] A Survey of Multi-robot Regular and Adversarial Patrolling
    Huang, Li
    Zhou, MengChu
    Hao, Kuangrong
    Hou, Edwin
    IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2019, 6 (04) : 894 - 903
  • [7] A Survey of Multi-robot Regular and Adversarial Patrolling
    Li Huang
    MengChu Zhou
    Kuangrong Hao
    Edwin Hou
    IEEE/CAA Journal of Automatica Sinica, 2019, 6 (04) : 894 - 903
  • [8] Stochastic Multi-Robot Patrolling with Limited Visibility
    Alam, Tauhidul
    Rahman, Md. Mahbubur
    Carrillo, Pedro
    Bobadilla, Leonardo
    Rapp, Brian
    JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2020, 97 (02) : 411 - 429
  • [9] Stochastic Multi-Robot Patrolling with Limited Visibility
    Tauhidul Alam
    Md. Mahbubur Rahman
    Pedro Carrillo
    Leonardo Bobadilla
    Brian Rapp
    Journal of Intelligent & Robotic Systems, 2020, 97 : 411 - 429
  • [10] Randomized Multi-Robot Patrolling with Unidirectional Visibility
    Echefu, Louis
    Alam, Tauhidul
    Newaz, Abdullah Al Redwan
    2024 21ST INTERNATIONAL CONFERENCE ON UBIQUITOUS ROBOTS, UR 2024, 2024, : 324 - 329