A Multi-Objective Optimization Method for Shelter Site Selection Based on Deep Reinforcement Learning

被引:0
|
作者
Zhang, Di [1 ,2 ]
Meng, Huan [1 ,2 ]
Wang, Moyang [1 ,2 ]
Xu, Xianrui [3 ]
Yan, Jianhai [4 ]
Li, Xiang [1 ,2 ,5 ,6 ,7 ]
机构
[1] East China Normal Univ, Key Lab Geog Informat Sci, Minist Educ, Shanghai, Peoples R China
[2] East China Normal Univ, Sch Geog Sci, Shanghai, Peoples R China
[3] Shanghai Univ Sport, Sch Econ & Management, Shanghai, Peoples R China
[4] Univ Shanghai Sci & Technol, Business Sch, Shanghai, Peoples R China
[5] East China Normal Univ, Low Altitude Econ Spatial Intelligence Technol Res, Shanghai, Peoples R China
[6] East China Normal Univ, Inst Cartog, Shanghai, Peoples R China
[7] East China Normal Univ, Chongqing Key Lab Precis Opt, Chongqing Inst, Chongqing, Peoples R China
关键词
deep reinforcement learning; emergency shelter; multi-objective optimization; site selection; FACILITY LOCATION; SPATIAL-ANALYSIS; MODEL; EARTHQUAKE; ALGORITHM; GIS; FRAMEWORK;
D O I
10.1111/tgis.13252
中图分类号
P9 [自然地理学]; K9 [地理];
学科分类号
0705 ; 070501 ;
摘要
Urban emergency shelters are a special type of public service facility, and their planning and construction are directly related to the safety of urban residents' lives and property, as well as to the sustainable development of the city. Existing research on the site selection of shelters is not ideal when dealing with large-scale scenarios and fails to accurately reflect the actual situation. To address this issue, this study proposes an emergency shelter site selection model based on deep reinforcement learning, IAM-PPO. This model constructs the site selection problem as a Markov Decision Process and uses deep learning to extract information from the site selection scenario. It finds the final solution for shelter locations through continuous exploration and learning. To improve the training efficiency of the model, the action masking process is innovatively applied to the model. The research results and ablation experiments using Shanghai as a case study prove that, owing to the diversity of shelter service ranges and action masking mechanism, the model proposed in this study can provide efficient and accurate shelter location services. Moreover, the customizability of this model provides meaningful reference value for other public facility location problems.
引用
收藏
页码:2722 / 2741
页数:20
相关论文
共 50 条
  • [31] Dynamic Weights in Multi-Objective Deep Reinforcement Learning
    Abels, Axel
    Roijers, Diederik M.
    Lenaerts, Tom
    Nowe, Ann
    Steckelmacher, Denis
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [32] Multi-Objective Deep Reinforcement Learning for Personalized Dose Optimization Based on Multi-Indicator Experience Replay
    Huo, Lin
    Tang, Yuepeng
    APPLIED SCIENCES-BASEL, 2023, 13 (01):
  • [33] Multi-Agent Deep Reinforcement Learning based Multi-Objective Resource Optimization in a Distributed Manufacturing System
    Shen, Xinchang
    Tham, Chen-Khong
    2024 IEEE 99TH VEHICULAR TECHNOLOGY CONFERENCE, VTC2024-SPRING, 2024,
  • [34] A reinforcement learning approach for dynamic multi-objective optimization
    Zou, Fei
    Yen, Gary G.
    Tang, Lixin
    Wang, Chunfeng
    INFORMATION SCIENCES, 2021, 546 : 815 - 834
  • [35] Multi-Objective Optimization in Disaster Backup with Reinforcement Learning
    Yi, Shanwen
    Qin, Yao
    Wang, Hua
    MATHEMATICS, 2025, 13 (03)
  • [36] Adaptive Objective Selection for Correlated Objectives in Multi-Objective Reinforcement Learning
    Brys, Tim
    Van Moffaert, Kristof
    Nowe, Ann
    Taylor, Matthew E.
    AAMAS'14: PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS, 2014, : 1349 - 1350
  • [37] Multi-objective optimization enabling CFRP energy-efficient milling based on deep reinforcement learning
    Zhang, Meihang
    Zhang, Hua
    Yan, Wei
    Zhang, Lin
    Jiang, Zhigang
    APPLIED INTELLIGENCE, 2024, 54 (23) : 12531 - 12557
  • [38] Reinforcement learning method for the multi-objective speed trajectory optimization of a freight train
    Lin, Xuan
    Liang, Zhicheng
    Shen, Lijuan
    Zhao, Fengyuan
    Liu, Xinyu
    Sun, Pengfei
    Cao, Taiqiang
    CONTROL ENGINEERING PRACTICE, 2023, 138
  • [39] Multi-objective safe reinforcement learning: the relationship between multi-objective reinforcement learning and safe reinforcement learning
    Horie, Naoto
    Matsui, Tohgoroh
    Moriyama, Koichi
    Mutoh, Atsuko
    Inuzuka, Nobuhiro
    ARTIFICIAL LIFE AND ROBOTICS, 2019, 24 (03) : 352 - 359
  • [40] Multi-objective safe reinforcement learning: the relationship between multi-objective reinforcement learning and safe reinforcement learning
    Naoto Horie
    Tohgoroh Matsui
    Koichi Moriyama
    Atsuko Mutoh
    Nobuhiro Inuzuka
    Artificial Life and Robotics, 2019, 24 : 352 - 359