共 50 条
A Graph Reinforcement Learning Framework for Neural Adaptive Large Neighbourhood Search
被引:0
|作者:
Johnn, Syu-Ning
[1
]
Darvariu, Victor-Alexandru
[2
]
Handl, Julia
[3
]
Kalcsics, Jorg
[1
]
机构:
[1] Univ Edinburgh, Sch Math, Edinburgh, Scotland
[2] UCL, Dept Comp Sci, London, England
[3] Univ Manchester, Alliance Manchester Business Sch, Manchester, England
基金:
英国工程与自然科学研究理事会;
关键词:
Machine Learning;
Adaptive Large Neighbourhood Search;
Markov Decision Process;
Deep Reinforcement Learning;
Graph Neural Networks;
VEHICLE-ROUTING PROBLEM;
COMBINATORIAL OPTIMIZATION;
HYPER-HEURISTICS;
DEPOT;
NETWORKS;
PICKUP;
D O I:
10.1016/j.cor.2024.106791
中图分类号:
TP39 [计算机的应用];
学科分类号:
081203 ;
0835 ;
摘要:
Adaptive Large Neighbourhood Search (ALNS) is a popular metaheuristic with renowned efficiency in solving combinatorial optimisation problems. However, despite 18 years of intensive research into ALNS, the design of an effective adaptive layer for selecting operators to improve the solution remains an open question. In this work, we isolate this problem by formulating it as a Markov Decision Process, in which an agent is rewarded proportionally to the improvement of the incumbent. We propose Graph Reinforcement Learning for Operator Selection (GRLOS), a method based on Deep Reinforcement Learning and Graph Neural Networks, as well as Learned Roulette Wheel (LRW), a lightweight approach inspired by the classic Roulette Wheel adaptive layer. The methods, which are broadly applicable to optimisation problems that can be represented as graphs, are comprehensively evaluated on 5 routing problems using a large portfolio of 28 destroy and 7 repair operators. Results show that both GRLOS and LRW outperform the classic selection mechanism in ALNS, owing to the operator choices being learned in a prior training phase. GRLOS is also shown to consistently achieve better performance than a recent Deep Reinforcement Learning method due to its substantially more flexible state representation. The evaluation further examines the impact of the operator budget and type of initial solution, and is applied to problem instances with up to 1000 customers. The findings arising from our extensive benchmarking bear relevance to the wider literature of hybrid methods combining metaheuristics and machine learning.
引用
收藏
页数:18
相关论文