Learning to Send Reinforcements: Coordinating Multi-Agent Dynamic Police Patrol Dispatching and Rescheduling via Reinforcement Learning

被引:0
|
作者
Joe, Waldy [1 ]
Lau, Hoong Chuin [1 ]
机构
[1] Singapore Management Univ, Sch Comp & Informat Syst, Singapore, Singapore
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We address the problem of coordinating multiple agents in a dynamic police patrol scheduling via a Reinforcement Learning (RL) approach. Our approach utilizes Multi-Agent Value Function Approximation (MAVFA) with a rescheduling heuristic to learn dispatching and rescheduling policies jointly. Often, police operations are divided into multiple sectors for more effective and efficient operations. In a dynamic setting, incidents occur throughout the day across different sectors, disrupting initially-planned patrol schedules. To maximize policing effectiveness, police agents from different sectors cooperate by sending reinforcements to support one another in their incident response and even routine patrol. This poses an interesting research challenge on how to make such complex decision of dispatching and rescheduling involving multiple agents in a coordinated fashion within an operationally reasonable time. Unlike existing MultiAgent RL (MARL) approaches which solve similar problems by either decomposing the problem or action into multiple components, our approach learns the dispatching and rescheduling policies jointly without any decomposition step. In addition, instead of directly searching over the joint action space, we incorporate an iterative best response procedure as a decentralized optimization heuristic and an explicit coordination mechanism for a scalable and coordinated decision-making. We evaluate our approach against the commonly adopted two-stage approach and conduct a series of ablation studies to ascertain the effectiveness of our proposed learning and coordination mechanisms.
引用
收藏
页码:153 / 161
页数:9
相关论文
共 50 条
  • [21] Hierarchical reinforcement learning via dynamic subspace search for multi-agent planning
    Aaron Ma
    Michael Ouimet
    Jorge Cortés
    Autonomous Robots, 2020, 44 : 485 - 503
  • [22] Dynamic Scholarly Collaborator Recommendation via Competitive Multi-Agent Reinforcement Learning
    Zhang, Yang
    Zhang, Chenwei
    Liu, Xiaozhong
    PROCEEDINGS OF THE ELEVENTH ACM CONFERENCE ON RECOMMENDER SYSTEMS (RECSYS'17), 2017, : 331 - 335
  • [23] Multi-Agent Image Classification via Reinforcement Learning
    Mousavi, Hossein K.
    Nazari, Mohammadreza
    Takac, Martin
    Motee, Nader
    2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 5020 - 5027
  • [24] Learning to Share in Multi-Agent Reinforcement Learning
    Yi, Yuxuan
    Li, Ge
    Wang, Yaowei
    Lu, Zongqing
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [25] Efficient Ridesharing Order Dispatching with Mean Field Multi-Agent Reinforcement Learning
    Li, Minne
    Qin, Zhiwei
    Jiao, Yan
    Yang, Yaodong
    Gong, Zhichen
    Wang, Jun
    Wang, Chenxi
    Wu, Guobin
    Ye, Jieping
    WEB CONFERENCE 2019: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2019), 2019, : 983 - 994
  • [26] Multi-Agent Reinforcement Learning for Order-dispatching via Order-Vehicle Distribution Matching
    Zhou, Ming
    Jin, Jiarui
    Zhang, Weinan
    Qin, Zhiwei
    Jiao, Yan
    Wang, Chenxi
    Wu, Guobin
    Yu, Yong
    Ye, Jieping
    PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19), 2019, : 2645 - 2653
  • [27] Scalable order dispatching through Federated Multi-Agent Deep Reinforcement Learning
    Jing, Yao
    Guo, Bin
    Li, Nuo
    Ding, Yasan
    Liu, Yan
    Yu, Zhiwen
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 264
  • [28] Learning to Collaborate: Multi-Scenario Ranking via Multi-Agent Reinforcement Learning
    Feng, Jun
    Li, Heng
    Huang, Minlie
    Liu, Shichen
    Ou, Wenwu
    Wang, Zhirong
    Zhu, Xiaoyan
    WEB CONFERENCE 2018: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW2018), 2018, : 1939 - 1948
  • [29] High-efficiency Freight Train Rescheduling Enabled by Multi-agent Reinforcement Learning
    Jiang L.
    Ni S.
    Tiedao Xuebao/Journal of the China Railway Society, 2023, 45 (08): : 27 - 35
  • [30] Coordinating Multi-Agent Navigation by Learning Communication
    Hildreth, Dalto N.
    Guy, Stephen J.
    PROCEEDINGS OF THE ACM ON COMPUTER GRAPHICS AND INTERACTIVE TECHNIQUES, 2019, 2 (02)