Balancing Efficiency and Unpredictability in Multi-robot Patrolling: A MARL-Based Approach

被引：2

作者：

Guo, Lingxiao ^{[1
]}

Pan, Haoxuan ^{[2
]}

Duan, Xiaoming ^{[2
]}

He, Jianping ^{[2
]}

机构：

[1] Shanghai Jiao Tong Univ, Dept Civil Engn, Shanghai 200240, Peoples R China

[2] Shanghai Jiao Tong Univ, Dept Automat, Shanghai 200240, Peoples R China

来源：

2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA | 2023年

关键词：

D O I：

10.1109/ICRA48891.2023.10160923

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Patrolling with multiple robots is a challenging task. While the robots collaboratively and repeatedly cover the regions of interest in the environment, their routes should satisfy two often conflicting properties: i) (efficiency) the time intervals between two consecutive visits to the regions are small; ii) (unpredictability) the patrolling trajectories are random and unpredictable. We manage to strike a balance between the two goals by i) recasting the original patrolling problem as a Graph Deep Learning problem; ii) directly solving this problem on the graph in the framework of cooperative multi-agent reinforcement learning. Treating the decisions of a team of agents as a sequence input, our model outputs the agents' actions in order by an autoregressive mechanism. Extensive simulation studies show that our approach has comparable performance with existing algorithms in terms of efficiency and outperforms them in terms of unpredictability. To our knowledge, this is the first work that successfully solves the patrolling problem with reinforcement learning on a graph.

引用

页码：3504 / 3509

页数：6

共 50 条

[1] Leveraging CAVs to Improve Traffic Efficiency: An MARL-based Approach
Han, Weizhen
Wang, Enshu
Li, Bingyi
Liu, Zhi
Li, Xun
Wu, Libing
Wang, Jianping
2024 IEEE 44TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS, ICDCS 2024, 2024, : 1143 - 1153
[2] A new approach to multi-robot harbour patrolling: theory and experiments
Marino, Alessandro
Antonelli, Gianluca
Aguiar, A. Pedro
Pascoal, Antonio
2012 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2012, : 1760 - 1765
[3] A Survey on Multi-robot Patrolling Algorithms
Portugal, David
Rocha, Rui
TECHNOLOGICAL INNOVATION FOR SUSTAINABILITY, 2011, 349 : 139 - 146
[4] Trust Modeling in Multi-Robot Patrolling
Pippin, Charles
Christensen, Henrik
2014 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2014, : 59 - 66
[5] Homogeneous Multi-robot Patrolling Based on Humanoid Formation Configuration
Wang, Fei
Wang, Hongrun
Zhou, Dianle
Wang, Tao
INTELLIGENT ROBOTICS AND APPLICATIONS, ICIRA 2024, PT II, 2025, 15202 : 403 - 417
[6] A Survey of Multi-robot Regular and Adversarial Patrolling
Huang, Li
Zhou, MengChu
Hao, Kuangrong
Hou, Edwin
IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2019, 6 (04) : 894 - 903
[7] A Survey of Multi-robot Regular and Adversarial Patrolling
Li Huang
MengChu Zhou
Kuangrong Hao
Edwin Hou
IEEE/CAA Journal of Automatica Sinica, 2019, 6 (04) : 894 - 903
[8] Stochastic Multi-Robot Patrolling with Limited Visibility
Alam, Tauhidul
Rahman, Md. Mahbubur
Carrillo, Pedro
Bobadilla, Leonardo
Rapp, Brian
JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2020, 97 (02) : 411 - 429
[9] Stochastic Multi-Robot Patrolling with Limited Visibility
Tauhidul Alam
Md. Mahbubur Rahman
Pedro Carrillo
Leonardo Bobadilla
Brian Rapp
Journal of Intelligent & Robotic Systems, 2020, 97 : 411 - 429
[10] Randomized Multi-Robot Patrolling with Unidirectional Visibility
Echefu, Louis
Alam, Tauhidul
Newaz, Abdullah Al Redwan
2024 21ST INTERNATIONAL CONFERENCE ON UBIQUITOUS ROBOTS, UR 2024, 2024, : 324 - 329

← 1 2 3 4 5 →