A Matching Algorithm with Reinforcement Learning and Decoupling Strategy for Order Dispatching in On-Demand Food Delivery

被引：3

作者：

Chen, Jingfang ^{[1
]}

Wang, Ling ^{[1
]}

Pan, Zixiao ^{[1
]}

Wu, Yuting ^{[1
]}

Zheng, Jie ^{[1
]}

Ding, Xuetao ^{[2
]}

机构：

[1] Tsinghua Univ, Dept Automat, Beijing 100084, Peoples R China

[2] Meituan, Dept Delivery Technol, Beijing 100102, Peoples R China

来源：

TSINGHUA SCIENCE AND TECHNOLOGY | 2024年 / 29卷 / 02期

基金：

中国国家自然科学基金;

关键词：

order dispatching; on-demand delivery; reinforcement learning; decoupling strategy; sequence-to-sequence neural network; MEAL-DELIVERY;

D O I：

10.26599/TST.2023.9010069

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The on-demand food delivery (OFD) service has gained rapid development in the past decades but meanwhile encounters challenges for further improving operation quality. The order dispatching problem is one of the most concerning issues for the OFD platforms, which refer to dynamically dispatching a large number of orders to riders reasonably in very limited decision time. To solve such a challenging combinatorial optimization problem, an effective matching algorithm is proposed by fusing the reinforcement learning technique and the optimization method. First, to deal with the large-scale complexity, a decoupling method is designed by reducing the matching space between new orders and riders. Second, to overcome the high dynamism and satisfy the stringent requirements on decision time, a reinforcement learning based dispatching heuristic is presented. To be specific, a sequence-tosequence neural network is constructed based on the problem characteristic to generate an order priority sequence. Besides, a training approach is specially designed to improve learning performance. Furthermore, a greedy heuristic is employed to effectively dispatch new orders according to the order priority sequence. On real-world datasets, numerical experiments are conducted to validate the effectiveness of the proposed algorithm. Statistical results show that the proposed algorithm can effectively solve the problem by improving delivery efficiency and maintaining customer satisfaction.

引用

页码：386 / 399

页数：14

共 50 条

[31] Recommending-and-Grabbing: A Crowdsourcing-Based Order Allocation Pattern for On-Demand Food Delivery
Wang, Xing
Wang, Ling
Wang, Shengyao
Pan, Jize
Ren, Hao
Zheng, Jie
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (01) : 838 - 853
[32] Impacts of food accessibility and built environment on on-demand food delivery usage
Wang, Zhenzhen
He, Sylvia Y.
TRANSPORTATION RESEARCH PART D-TRANSPORT AND ENVIRONMENT, 2021, 100
[33] A deep reinforcement learning-based on-demand charging algorithm for wireless rechargeable sensor networks
Cao, Xianbo
Xu, Wenzheng
Liu, Xuxun
Peng, Jian
Liu, Tang
AD HOC NETWORKS, 2021, 110
[34] Autonomous order dispatching in the semiconductor industry using reinforcement learning
Kuhnle, Andreas
Roehrig, Nicole
Lanza, Gisela
12TH CIRP CONFERENCE ON INTELLIGENT COMPUTATION IN MANUFACTURING ENGINEERING, 2019, 79 : 391 - 396
[35] Reinforcement Learning Based Demand-Responsive Public Transit Dispatching
Wu, Mian
Yu, Chunhui
Ma, Wanjing
Wang, Ling
Ma, Xiaolong
CICTP 2021: ADVANCED TRANSPORTATION, ENHANCED CONNECTION, 2021, : 387 - 398
[36] An intelligent open trading system for on-demand delivery facilitated by deep Q network based reinforcement learning
Guo, Chaojie
Zhang, Lele
Thompson, Russell G.
Foliente, Greg
Peng, Xiaoshuai
INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2025, 63 (03) : 904 - 926
[37] An advanced real-time dispatching strategy for a distributed energy system based on the reinforcement learning algorithm
Meng, Fanyi
Bai, Yang
Jin, Jingliang
RENEWABLE ENERGY, 2021, 178 : 13 - 24
[38] Preference-Aware Task Assignment in On-Demand Taxi Dispatching: An Online Stable Matching Approach
Zhao, Boming
Xu, Pan
Shi, Yexuan
Tong, Yongxin
Zhou, Zimu
Zeng, Yuxiang
THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 2245 - 2252
[39] An advanced real-time dispatching strategy for a distributed energy system based on the reinforcement learning algorithm
Meng, Fanyi
Bai, Yang
Jin, Jingliang
Renewable Energy, 2021, 178 : 13 - 24
[40] Applying Deep Learning Based Probabilistic Forecasting to Food Preparation Time for On-Demand Delivery Service
Gao, Chengliang
Zhang, Fan
Zhou, Yue
Feng, Ronggen
Ru, Qiang
Bian, Kaigui
He, Renqing
Sun, Zhizhao
PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 2924 - 2934

← 1 2 3 4 5 →