A Matching Algorithm with Reinforcement Learning and Decoupling Strategy for Order Dispatching in On-Demand Food Delivery

被引：3

作者：

Chen, Jingfang ^{[1
]}

Wang, Ling ^{[1
]}

Pan, Zixiao ^{[1
]}

Wu, Yuting ^{[1
]}

Zheng, Jie ^{[1
]}

Ding, Xuetao ^{[2
]}

机构：

[1] Tsinghua Univ, Dept Automat, Beijing 100084, Peoples R China

[2] Meituan, Dept Delivery Technol, Beijing 100102, Peoples R China

来源：

TSINGHUA SCIENCE AND TECHNOLOGY | 2024年 / 29卷 / 02期

基金：

中国国家自然科学基金;

关键词：

order dispatching; on-demand delivery; reinforcement learning; decoupling strategy; sequence-to-sequence neural network; MEAL-DELIVERY;

D O I：

10.26599/TST.2023.9010069

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The on-demand food delivery (OFD) service has gained rapid development in the past decades but meanwhile encounters challenges for further improving operation quality. The order dispatching problem is one of the most concerning issues for the OFD platforms, which refer to dynamically dispatching a large number of orders to riders reasonably in very limited decision time. To solve such a challenging combinatorial optimization problem, an effective matching algorithm is proposed by fusing the reinforcement learning technique and the optimization method. First, to deal with the large-scale complexity, a decoupling method is designed by reducing the matching space between new orders and riders. Second, to overcome the high dynamism and satisfy the stringent requirements on decision time, a reinforcement learning based dispatching heuristic is presented. To be specific, a sequence-tosequence neural network is constructed based on the problem characteristic to generate an order priority sequence. Besides, a training approach is specially designed to improve learning performance. Furthermore, a greedy heuristic is employed to effectively dispatch new orders according to the order priority sequence. On real-world datasets, numerical experiments are conducted to validate the effectiveness of the proposed algorithm. Statistical results show that the proposed algorithm can effectively solve the problem by improving delivery efficiency and maintaining customer satisfaction.

引用

页码：386 / 399

页数：14

共 50 条

[41] Checklist as an Effective Means of Information Delivery in On-Demand Learning
Konjengbam, Anand
Nagayoshi, Sanetake
2020 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND ENGINEERING MANAGEMENT (IEEE IEEM), 2020, : 1276 - 1280
[42] Online Car-Hailing Order Matching Method Based on Demand Clustering and Reinforcement Learning
Gao, Huifei
Chen, Tian
Hao, Jingxiang
NEURAL COMPUTING FOR ADVANCED APPLICATIONS, NCAA 2024, PT I, 2025, 2181 : 30 - 45
[43] Deep Reinforcement Learning for On-demand Intelligent Routing in Deterministic Networks
Liu, Ying
Yin, Jianhui
Zhang, Weiting
Xie, Shanghan
IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, : 1932 - 1937
[44] Delivery Scope: A New Way of Restaurant Retrieval For On-demand Food Delivery Service
Ding, Xuetao
Zhang, Runfeng
Mao, Zhen
Xing, Ke
Du, Fangxiao
Liu, Xingyu
Wei, Guoxing
Yin, Feifan
He, Renqing
Sun, Zhizhao
KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 3026 - 3034
[45] Reinforcement learning based scheme for on-demand vehicular fog formation
Nsouli, Ahmad
El-Hajj, Wassim
Mourad, Azzam
VEHICULAR COMMUNICATIONS, 2023, 40
[46] Multi-Stove Scheduling for Sustainable On-Demand Food Delivery
Dai, Tao
Fan, Xiangqi
SUSTAINABILITY, 2021, 13 (23)
[47] Uncovering merchants' willingness to wait in on-demand food delivery markets
Liang, Jian
Zhao, Ya
Wang, Hai
Xiao, Zuopeng
Ke, Jintao
TRANSPORT POLICY, 2024, 158 : 14 - 28
[48] Modeling stochastic service time for complex on-demand food delivery
Jie Zheng
Ling Wang
Shengyao Wang
Jing-fang Chen
Xing Wang
Haining Duan
Yile Liang
Xuetao Ding
Complex & Intelligent Systems, 2022, 8 : 4939 - 4953
[49] Modeling stochastic service time for complex on-demand food delivery
Zheng, Jie
Wang, Ling
Wang, Shengyao
Chen, Jing-fang
Wang, Xing
Duan, Haining
Liang, Yile
Ding, Xuetao
COMPLEX & INTELLIGENT SYSTEMS, 2022, 8 (06) : 4939 - 4953
[50] Modeling and managing an on-demand meal delivery system with order bundling
Ye, Anke
Zhang, Kenan
Chen, Xiqun
Bell, Michael G. H.
Lee, Der-Horng
Hu, Simon
TRANSPORTATION RESEARCH PART E-LOGISTICS AND TRANSPORTATION REVIEW, 2024, 187

← 1 2 3 4 5 →