Learning to Send Reinforcements: Coordinating Multi-Agent Dynamic Police Patrol Dispatching and Rescheduling via Reinforcement Learning

被引:0
|
作者
Joe, Waldy [1 ]
Lau, Hoong Chuin [1 ]
机构
[1] Singapore Management Univ, Sch Comp & Informat Syst, Singapore, Singapore
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We address the problem of coordinating multiple agents in a dynamic police patrol scheduling via a Reinforcement Learning (RL) approach. Our approach utilizes Multi-Agent Value Function Approximation (MAVFA) with a rescheduling heuristic to learn dispatching and rescheduling policies jointly. Often, police operations are divided into multiple sectors for more effective and efficient operations. In a dynamic setting, incidents occur throughout the day across different sectors, disrupting initially-planned patrol schedules. To maximize policing effectiveness, police agents from different sectors cooperate by sending reinforcements to support one another in their incident response and even routine patrol. This poses an interesting research challenge on how to make such complex decision of dispatching and rescheduling involving multiple agents in a coordinated fashion within an operationally reasonable time. Unlike existing MultiAgent RL (MARL) approaches which solve similar problems by either decomposing the problem or action into multiple components, our approach learns the dispatching and rescheduling policies jointly without any decomposition step. In addition, instead of directly searching over the joint action space, we incorporate an iterative best response procedure as a decentralized optimization heuristic and an explicit coordination mechanism for a scalable and coordinated decision-making. We evaluate our approach against the commonly adopted two-stage approach and conduct a series of ablation studies to ascertain the effectiveness of our proposed learning and coordination mechanisms.
引用
收藏
页码:153 / 161
页数:9
相关论文
共 50 条
  • [31] Skill matters: Dynamic skill learning for multi-agent cooperative reinforcement learning
    Li, Tong
    Bai, Chenjia
    Xu, Kang
    Chu, Chen
    Zhu, Peican
    Wang, Zhen
    NEURAL NETWORKS, 2025, 181
  • [32] LDSA: Learning Dynamic Subtask Assignment in Cooperative Multi-Agent Reinforcement Learning
    Yang, Mingyu
    Zhao, Jian
    Hu, Xunhan
    Zhou, Wengang
    Zhu, Jiangcheng
    Li, Houqiang
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [33] Cooperative Capture by Multi-Agent using Reinforcement Learning Application for Security Patrol Systems
    Shimada, Yasuyuki
    Ohtsuka, Hirofumi
    Matsumoto, Tadashi
    Harada, Maya
    2015 10TH ASIAN CONTROL CONFERENCE (ASCC), 2015,
  • [34] Multi-Agent Motion Planning for Dense and Dynamic Environments via Deep Reinforcement Learning
    Semnani, Samaneh Hosseini
    Liu, Hugh
    Everett, Michael
    de Ruiter, Anton
    How, Jonathan P.
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (02): : 3221 - 3226
  • [35] Dynamic Multichannel Access via Multi-agent Reinforcement Learning: Throughput and Fairness Guarantees
    Sohaib, Muhammad
    Jeong, Jongjin
    Jeon, Sang-Woon
    IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2021), 2021,
  • [36] Dynamic Multichannel Access via Multi-Agent Reinforcement Learning: Throughput and Fairness Guarantees
    Sohaib, Muhammad
    Jeong, Jongjin
    Jeon, Sang-Woon
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2022, 21 (06) : 3994 - 4008
  • [37] Dynamic Safe Interruptibility for Decentralized Multi-Agent Reinforcement Learning
    El Mhamdi, El Mandi
    Guerraoui, Rachid
    Hendrikx, Hadrien
    Maurer, Alexandre
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [38] A multi-agent reinforcement learning approach to dynamic service composition
    Wang, Hongbing
    Wang, Xiaojun
    Hu, Xingguo
    Zhang, Xingzhi
    Gu, Mingzhu
    INFORMATION SCIENCES, 2016, 363 : 96 - 119
  • [39] MARRGM: Learning Framework for Multi-Agent Reinforcement Learning via Reinforcement Recommendation and Group Modification
    Wu, Peiliang
    Tian, Liqiang
    Zhang, Qian
    Mao, Bingyi
    Chen, Wenbai
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (06) : 5385 - 5392
  • [40] Output synchronization of multi-agent systems via reinforcement learning
    Liu, Yingying
    Wang, Zhanshan
    NEUROCOMPUTING, 2022, 508 : 110 - 119