Learning to Send Reinforcements: Coordinating Multi-Agent Dynamic Police Patrol Dispatching and Rescheduling via Reinforcement Learning

被引：0

作者：

Joe, Waldy ^{[1
]}

Lau, Hoong Chuin ^{[1
]}

机构：

[1] Singapore Management Univ, Sch Comp & Informat Syst, Singapore, Singapore

来源：

PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023 | 2023年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We address the problem of coordinating multiple agents in a dynamic police patrol scheduling via a Reinforcement Learning (RL) approach. Our approach utilizes Multi-Agent Value Function Approximation (MAVFA) with a rescheduling heuristic to learn dispatching and rescheduling policies jointly. Often, police operations are divided into multiple sectors for more effective and efficient operations. In a dynamic setting, incidents occur throughout the day across different sectors, disrupting initially-planned patrol schedules. To maximize policing effectiveness, police agents from different sectors cooperate by sending reinforcements to support one another in their incident response and even routine patrol. This poses an interesting research challenge on how to make such complex decision of dispatching and rescheduling involving multiple agents in a coordinated fashion within an operationally reasonable time. Unlike existing MultiAgent RL (MARL) approaches which solve similar problems by either decomposing the problem or action into multiple components, our approach learns the dispatching and rescheduling policies jointly without any decomposition step. In addition, instead of directly searching over the joint action space, we incorporate an iterative best response procedure as a decentralized optimization heuristic and an explicit coordination mechanism for a scalable and coordinated decision-making. We evaluate our approach against the commonly adopted two-stage approach and conduct a series of ablation studies to ascertain the effectiveness of our proposed learning and coordination mechanisms.

引用

页码：153 / 161

页数：9

共 50 条

[31] Skill matters: Dynamic skill learning for multi-agent cooperative reinforcement learning
Li, Tong
Bai, Chenjia
Xu, Kang
Chu, Chen
Zhu, Peican
Wang, Zhen
NEURAL NETWORKS, 2025, 181
[32] LDSA: Learning Dynamic Subtask Assignment in Cooperative Multi-Agent Reinforcement Learning
Yang, Mingyu
Zhao, Jian
Hu, Xunhan
Zhou, Wengang
Zhu, Jiangcheng
Li, Houqiang
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[33] Cooperative Capture by Multi-Agent using Reinforcement Learning Application for Security Patrol Systems
Shimada, Yasuyuki
Ohtsuka, Hirofumi
Matsumoto, Tadashi
Harada, Maya
2015 10TH ASIAN CONTROL CONFERENCE (ASCC), 2015,
[34] Multi-Agent Motion Planning for Dense and Dynamic Environments via Deep Reinforcement Learning
Semnani, Samaneh Hosseini
Liu, Hugh
Everett, Michael
de Ruiter, Anton
How, Jonathan P.
IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (02): : 3221 - 3226
[35] Dynamic Multichannel Access via Multi-agent Reinforcement Learning: Throughput and Fairness Guarantees
Sohaib, Muhammad
Jeong, Jongjin
Jeon, Sang-Woon
IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2021), 2021,
[36] Dynamic Multichannel Access via Multi-Agent Reinforcement Learning: Throughput and Fairness Guarantees
Sohaib, Muhammad
Jeong, Jongjin
Jeon, Sang-Woon
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2022, 21 (06) : 3994 - 4008
[37] Dynamic Safe Interruptibility for Decentralized Multi-Agent Reinforcement Learning
El Mhamdi, El Mandi
Guerraoui, Rachid
Hendrikx, Hadrien
Maurer, Alexandre
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
[38] A multi-agent reinforcement learning approach to dynamic service composition
Wang, Hongbing
Wang, Xiaojun
Hu, Xingguo
Zhang, Xingzhi
Gu, Mingzhu
INFORMATION SCIENCES, 2016, 363 : 96 - 119
[39] MARRGM: Learning Framework for Multi-Agent Reinforcement Learning via Reinforcement Recommendation and Group Modification
Wu, Peiliang
Tian, Liqiang
Zhang, Qian
Mao, Bingyi
Chen, Wenbai
IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (06) : 5385 - 5392
[40] Output synchronization of multi-agent systems via reinforcement learning
Liu, Yingying
Wang, Zhanshan
NEUROCOMPUTING, 2022, 508 : 110 - 119

← 1 2 3 4 5 →