Balancing Efficiency and Unpredictability in Multi-robot Patrolling: A MARL-Based Approach

被引：2

作者：

Guo, Lingxiao ^{[1
]}

Pan, Haoxuan ^{[2
]}

Duan, Xiaoming ^{[2
]}

He, Jianping ^{[2
]}

机构：

[1] Shanghai Jiao Tong Univ, Dept Civil Engn, Shanghai 200240, Peoples R China

[2] Shanghai Jiao Tong Univ, Dept Automat, Shanghai 200240, Peoples R China

来源：

2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA | 2023年

关键词：

D O I：

10.1109/ICRA48891.2023.10160923

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Patrolling with multiple robots is a challenging task. While the robots collaboratively and repeatedly cover the regions of interest in the environment, their routes should satisfy two often conflicting properties: i) (efficiency) the time intervals between two consecutive visits to the regions are small; ii) (unpredictability) the patrolling trajectories are random and unpredictable. We manage to strike a balance between the two goals by i) recasting the original patrolling problem as a Graph Deep Learning problem; ii) directly solving this problem on the graph in the framework of cooperative multi-agent reinforcement learning. Treating the decisions of a team of agents as a sequence input, our model outputs the agents' actions in order by an autoregressive mechanism. Extensive simulation studies show that our approach has comparable performance with existing algorithms in terms of efficiency and outperforms them in terms of unpredictability. To our knowledge, this is the first work that successfully solves the patrolling problem with reinforcement learning on a graph.

引用

页码：3504 / 3509

页数：6

共 50 条

[41] Bayesian Reinforcement Learning for Multi-Robot Decentralized Patrolling in Uncertain Environments
Zhou, Xin
Wang, Weiping
Wang, Tao
Lei, Yonglin
Zhong, Fangcheng
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2019, 68 (12) : 11691 - 11703
[42] Distributed on-line dynamic task assignment for multi-robot patrolling
Alessandro Farinelli
Luca Iocchi
Daniele Nardi
Autonomous Robots, 2017, 41 : 1321 - 1345
[43] Distributed on-line dynamic task assignment for multi-robot patrolling
Farinelli, Alessandro
Iocchi, Luca
Nardi, Daniele
AUTONOMOUS ROBOTS, 2017, 41 (06) : 1321 - 1345
[44] Entrapment/Escorting and Patrolling Missions in Multi-Robot Cluster Space Control
Mas, Ignacio
Li, Steven
Acain, Jose
Kitts, Christopher
2009 IEEE-RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, 2009, : 5855 - 5861
[45] EVPRT: A MARL-Based Approach for Efficient Passage of Emergency Vehicles in Urban Vehicular Networks
Liu, Bingyi
Peng, Wei
Liu, Jipeng
Han, Weizhen
Wang, Enshu
IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, : 4357 - 4362
[46] A Rule-based Approach for a Multi-robot Application
Panescu, Doru
Pascal, Carlos
Olaeru, Razvan Marian
2015 19TH INTERNATIONAL CONFERENCE ON SYSTEM THEORY, CONTROL AND COMPUTING (ICSTCC), 2015, : 75 - 80
[47] A potential field based approach to multi-robot manipulation
Song, P
Kumar, V
2002 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS I-IV, PROCEEDINGS, 2002, : 1217 - 1222
[48] Time-switching patrolling controller for holonomic and nonholonomic multi-robot systems
Sakata, Hirokazu
Sakurama, Kazunori
Yamazumi, Mitsuhiro
Wada, Toshihiro
ADVANCED ROBOTICS, 2024, 38 (9-10) : 647 - 658
[49] Multi-robot Patrolling in Wireless Sensor Networks using Bounded Cycle Coverage
Popescu, Mihai-Ioan
Rivano, Herve
Simonin, Olivier
2016 IEEE 28TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2016), 2016, : 169 - 176
[50] Multi-Robot Boundary Tracking with Phase and Workload Balancing
Boardman, Michael
Edmonds, Jeremy
Francis, Kyle
Clark, Christopher M.
IEEE/RSJ 2010 INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2010), 2010,

← 1 2 3 4 5 →