Combining deep reinforcement learning with heuristics to solve the traveling salesman problem

被引:0
|
作者
Hong, Li [1 ]
Liu, Yu [1 ]
Xu, Mengqiao [2 ]
Deng, Wenhui [2 ]
机构
[1] Dalian Univ Technol, Sch Software Technol, Dalian1 16620, Peoples R China
[2] Dalian Univ Technol, Sch Econ & Management, Dalian 116024, Peoples R China
基金
中国国家自然科学基金;
关键词
traveling salesman problem; deep reinforcement learning; simulated annealing algorithm; transformer model; whale optimization algorithm; 87.55.kd; 87.55.de; 07.05.Mh; ALGORITHM;
D O I
10.1088/1674-1056/ad95f1
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
Recent studies employing deep learning to solve the traveling salesman problem (TSP) have mainly focused on learning construction heuristics. Such methods can improve TSP solutions, but still depend on additional programs. However, methods that focus on learning improvement heuristics to iteratively refine solutions remain insufficient. Traditional improvement heuristics are guided by a manually designed search strategy and may only achieve limited improvements. This paper proposes a novel framework for learning improvement heuristics, which automatically discovers better improvement policies for heuristics to iteratively solve the TSP. Our framework first designs a new architecture based on a transformer model to make the policy network parameterized, which introduces an action-dropout layer to prevent action selection from overfitting. It then proposes a deep reinforcement learning approach integrating a simulated annealing mechanism (named RL-SA) to learn the pairwise selected policy, aiming to improve the 2-opt algorithm's performance. The RL-SA leverages the whale optimization algorithm to generate initial solutions for better sampling efficiency and uses the Gaussian perturbation strategy to tackle the sparse reward problem of reinforcement learning. The experiment results show that the proposed approach is significantly superior to the state-of-the-art learning-based methods, and further reduces the gap between learning-based methods and highly optimized solvers in the benchmark datasets. Moreover, our pre-trained model M can be applied to guide the SA algorithm (named M-SA (ours)), which performs better than existing deep models in small-, medium-, and large-scale TSPLIB datasets. Additionally, the M-SA (ours) achieves excellent generalization performance in a real-world dataset on global liner shipping routes, with the optimization percentages in distance reduction ranging from 3.52% to 17.99%.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] Perturbation heuristics for the pickup and delivery traveling salesman problem
    Renaud, J
    Boctor, FF
    Laporte, G
    COMPUTERS & OPERATIONS RESEARCH, 2002, 29 (09) : 1129 - 1141
  • [32] A computational study of smoothing heuristics for the traveling salesman problem
    Coy, SP
    Golden, BL
    Wasil, EA
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2000, 124 (01) : 15 - 27
  • [33] A Genetic Algorithm with the Mixed Heuristics for Traveling Salesman Problem
    Wang, Yong
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE AND APPLICATIONS, 2015, 14 (01)
  • [34] Prize-Collecting Traveling Salesman Problem: a Reinforcement Learning Approach
    Ruiz, Justin
    Gonzalez, Christopher
    Chen, Yutian
    Tang, Bin
    ICC 2023-IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2023, : 4416 - 4421
  • [35] GUARANTEED PERFORMANCE HEURISTICS FOR THE BOTTLENECK TRAVELING SALESMAN PROBLEM
    PARKER, RG
    RARDIN, RL
    OPERATIONS RESEARCH LETTERS, 1984, 2 (06) : 269 - 272
  • [36] Reinforcement learning for the traveling salesman problem: Performance comparison of three algorithms
    Wang, Jiaying
    Xiao, Chenglong
    Wang, Shanshan
    Ruan, Yaqi
    JOURNAL OF ENGINEERING-JOE, 2023, 2023 (09):
  • [37] The Performance of Different Algorithms to Solve Traveling Salesman Problem
    Bi, Hanqing
    Yang, Zhuoyuan
    Wang, Mengxi
    2021 2ND INTERNATIONAL CONFERENCE ON BIG DATA & ARTIFICIAL INTELLIGENCE & SOFTWARE ENGINEERING (ICBASE 2021), 2021, : 153 - 156
  • [38] SailFish Optimizer Algorithm to Solve the Traveling Salesman Problem
    Khaoula, Cherrat
    Morad, Bouzidi
    Essaid, Riffi Mohammed
    HYBRID INTELLIGENT SYSTEMS, HIS 2021, 2022, 420 : 225 - 232
  • [39] Hard to solve instances of the Euclidean Traveling Salesman Problem
    Hougardy, Stefan
    Zhong, Xianghui
    MATHEMATICAL PROGRAMMING COMPUTATION, 2021, 13 (01) : 51 - 74
  • [40] Group Search Optimization to solve Traveling Salesman Problem
    Akhand, M. A. H.
    Junaed, A. B. M.
    Hossain, Md. Forhad
    Murase, K.
    2012 15TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY (ICCIT), 2012, : 72 - 77