Multi-vehicle routing problems with soft time windows: A multi-agent reinforcement learning approach

被引:98
|
作者
Zhang, Ke [1 ]
He, Fang [2 ]
Zhang, Zhengchao [1 ]
Lin, Xi [1 ]
Li, Meng [1 ]
机构
[1] Tsinghua Univ, Dept Civil Engn, Beijing 100084, Peoples R China
[2] Tsinghua Univ, Dept Ind Engn, Beijing 100084, Peoples R China
基金
中国国家自然科学基金;
关键词
Reinforcement learning; Vehicle routing problem; Attention mechanism; Computational efficiency; Multi-agent; ALGORITHMS;
D O I
10.1016/j.trc.2020.102861
中图分类号
U [交通运输];
学科分类号
08 ; 0823 ;
摘要
Multi-vehicle routing problem with soft time windows (MVRPSTW) is an indispensable constituent in urban logistics distribution systems. Over the past decade, numerous methods for MVRPSTW have been proposed, but most are based on heuristic rules that require a large amount of computation time. With the current rapid increase of logistics demands, traditional methods incur the dilemma between computational efficiency and solution quality. To efficiently solve the problem, we propose a novel reinforcement learning algorithm called the Multi-Agent Attention Model that can solve routing problem instantly benefit from lengthy offline training. Specifically, the vehicle routing problem is regarded as a vehicle tour generation process, and an encoder decoder framework with attention layers is proposed to generate tours of multiple vehicles iteratively. Furthermore, a multi-agent reinforcement learning method with an unsupervised auxiliary network is developed for the model training. By evaluated on four synthetic networks with different scales, the results demonstrate that the proposed method consistently outperforms Google OR-Tools and traditional methods with little computation time. In addition, we validate the robustness of the well-trained model by varying the number of customers and the capacities of vehicles.
引用
收藏
页数:14
相关论文
共 50 条
  • [32] A goal programming approach to vehicle routing problems with soft time windows
    Calvete, Herminia I.
    Gale, Carmen
    Oliveros, Maria-Jose
    Sanchez-Valverde, Belen
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2007, 177 (03) : 1720 - 1733
  • [33] Multi-Task Multi-Objective Evolutionary Search Based on Deep Reinforcement Learning for Multi-Objective Vehicle Routing Problems with Time Windows
    Deng, Jianjun
    Wang, Junjie
    Wang, Xiaojun
    Cai, Yiqiao
    Liu, Peizhong
    SYMMETRY-BASEL, 2024, 16 (08):
  • [34] A Soft Graph Attention Reinforcement Learning for Multi-Agent Cooperation
    Wang, Huimu
    Pu, Zhiqiang
    Liu, Zhen
    Yi, Jianqiang
    Qiu, Tenghai
    2020 IEEE 16TH INTERNATIONAL CONFERENCE ON AUTOMATION SCIENCE AND ENGINEERING (CASE), 2020, : 1257 - 1262
  • [35] A multi-agent reinforcement learning approach to robot soccer
    Yong Duan
    Bao Xia Cui
    Xin He Xu
    Artificial Intelligence Review, 2012, 38 : 193 - 211
  • [36] A Data-Driven Packet Routing Algorithm for an Unmanned Aerial Vehicle Swarm: A Multi-Agent Reinforcement Learning Approach
    Qiu, Xiulin
    Xu, Lei
    Wang, Ping
    Yang, Yuwang
    Liao, Zhenqiang
    IEEE WIRELESS COMMUNICATIONS LETTERS, 2022, 11 (10) : 2160 - 2164
  • [37] Multi-Agent Active Search: A Reinforcement Learning Approach
    Igoe, Conor
    Ghods, Ramina
    Schneider, Jeff
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (02): : 754 - 761
  • [38] A multi-agent reinforcement learning approach to robot soccer
    Duan, Yong
    Cui, Bao Xia
    Xu, Xin He
    ARTIFICIAL INTELLIGENCE REVIEW, 2012, 38 (03) : 193 - 211
  • [39] Gossip algorithms for heterogeneous multi-vehicle routing problems
    Franceschelli, Mauro
    Rosa, Daniele
    Seatzu, Carla
    Bullo, Francesco
    NONLINEAR ANALYSIS-HYBRID SYSTEMS, 2013, 10 : 156 - 174
  • [40] A reinforcement learning-based multi-agent framework applied for solving routing and scheduling problems
    Lopes Silva, Maria Amelia
    de Souza, Sergio Ricardo
    Freitas Souza, Marcone Jamilson
    Bazzan, Ana Lucia C.
    EXPERT SYSTEMS WITH APPLICATIONS, 2019, 131 : 148 - 171