An improved transformer model with multi-head attention and attention to attention for low-carbon multi-depot vehicle routing problem

被引:13
|
作者
Zou, Yang [1 ]
Wu, Hecheng [1 ]
Yin, Yunqiang [2 ]
Dhamotharan, Lalitha [3 ]
Chen, Daqiang [4 ]
Tiwari, Aviral Kumar [5 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Coll Econ & Management, Nanjing 210016, Peoples R China
[2] Univ Elect Sci & Technol China, Sch Econ & Management, Chengdu 611731, Peoples R China
[3] Univ Exeter, Business Sch, Exeter EX4 4PU, Devon, England
[4] Zhejiang Gongshang Univ, Sch Management & E Business, Hangzhou 310018, Peoples R China
[5] Rajagiri Business Sch RBS, Kochi 682039, Kerala, India
基金
中国国家自然科学基金;
关键词
End-to-end deep reinforcement learning; Transformer model; Multi-head attention mechanism; Low-carbon multi-depot vehicle routing problem; GA algorithm; VARIABLE NEIGHBORHOOD SEARCH;
D O I
10.1007/s10479-022-04788-z
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
Low-carbon logistics is an emerging and sustainable development industry in the era of a low-carbon economy. The end-to-end deep reinforcement learning (DRL) method with an encoder-decoder framework has been proven effective for solving logistics problems. However, in most cases, the recurrent neural networks (RNN) and attention mechanisms are used in encoders and decoders, which may result in the long-distance dependence problem and the neglect of the correlation between query vectors. To surround this problem, we propose an improved transformer model (TAOA) with both multi-head attention mechanism (MHA) and attention to attention mechanism (AOA), and apply it to solve the low-carbon multi-depot vehicle routing problem (MDVRP). In this model, the MHA and AOA are implemented to solve the probability of route nodes in the encoder and decoder. The MHA is used to process different parts of the input sequence, which can be calculated in parallel, and the AOA is used to deal with the deficiency problem of correlation between query results and query vectors in the MHA. The actor-critic framework based on strategy gradient is constructed to train model parameters. The 2opt operator is further used to optimize the resulting routes. Finally, extensive numerical studies are carried out to verify the effectiveness and operation efficiency of the proposed TAOA, and the results show that the proposed TAOA performs better in solving the MDVRP than the traditional transformer model (Kools), genetic algorithm (GA), and Google OR-Tools (Ortools).
引用
收藏
页码:517 / 536
页数:20
相关论文
共 50 条
  • [11] Combining Multi-Head Attention and Sparse Multi-Head Attention Networks for Session-Based Recommendation
    Zhao, Zhiwei
    Wang, Xiaoye
    Xiao, Yingyuan
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [12] Information Aggregation for Multi-Head Attention with Routing-by-Agreement
    Li, Jian
    Yang, Baosong
    Dou, Zi-Yi
    Wang, Xing
    Lyu, Michael R.
    Tu, Zhaopeng
    2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 3566 - 3575
  • [13] An improved optimization algorithm for a multi-depot vehicle routing problem considering carbon emissions
    Xujin Pu
    Xulong Lu
    Guanghua Han
    Environmental Science and Pollution Research, 2022, 29 : 54940 - 54955
  • [14] An improved optimization algorithm for a multi-depot vehicle routing problem considering carbon emissions
    Pu, Xujin
    Lu, Xulong
    Han, Guanghua
    ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH, 2022, 29 (36) : 54940 - 54955
  • [15] An Improved ACO for the Multi-depot Vehicle Routing Problem with Time Windows
    Ma, Yanfang
    Han, Jie
    Kang, Kai
    Yan, Fang
    PROCEEDINGS OF THE TENTH INTERNATIONAL CONFERENCE ON MANAGEMENT SCIENCE AND ENGINEERING MANAGEMENT, 2017, 502 : 1181 - 1189
  • [16] Repulsive Attention: Rethinking Multi-head Attention as Bayesian Inference
    An, Bang
    Lyu, Jie
    Wang, Zhenyi
    Li, Chunyuan
    Hu, Changwei
    Tan, Fei
    Zhang, Ruiyi
    Hu, Yifan
    Chen, Changyou
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 236 - 255
  • [17] Multi-Head Attention based Probabilistic Vehicle Trajectory Prediction
    Kim, Hayoung
    Kim, Dongchan
    Kim, Gihoon
    Cho, Jeongmin
    Huh, Kunsoo
    2020 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2020, : 1720 - 1725
  • [18] An Improved PSO for the Multi-Depot Vehicle Routing Problem with Time Windows
    Wen, Lei
    Meng, Fanhua
    PACIIA: 2008 PACIFIC-ASIA WORKSHOP ON COMPUTATIONAL INTELLIGENCE AND INDUSTRIAL APPLICATION, VOLS 1-3, PROCEEDINGS, 2008, : 820 - 824
  • [19] VEHICLE ROUTING PROBLEM WITH MULTI-DEPOT AND MULTI-TASK
    Yang, Haoxiong
    Jing, Li
    Zhou, Yongsheng
    He, Mingke
    ICEIS 2011: PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS, VOL 4, 2011, : 650 - 655
  • [20] Multi-Head Attention with Disagreement Regularization
    Li, Jian
    Tu, Zhaopeng
    Yang, Baosong
    Lyu, Michael R.
    Zhang, Tong
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 2897 - 2903