A Matching Algorithm with Reinforcement Learning and Decoupling Strategy for Order Dispatching in On-Demand Food Delivery

被引:3
|
作者
Chen, Jingfang [1 ]
Wang, Ling [1 ]
Pan, Zixiao [1 ]
Wu, Yuting [1 ]
Zheng, Jie [1 ]
Ding, Xuetao [2 ]
机构
[1] Tsinghua Univ, Dept Automat, Beijing 100084, Peoples R China
[2] Meituan, Dept Delivery Technol, Beijing 100102, Peoples R China
来源
TSINGHUA SCIENCE AND TECHNOLOGY | 2024年 / 29卷 / 02期
基金
中国国家自然科学基金;
关键词
order dispatching; on-demand delivery; reinforcement learning; decoupling strategy; sequence-to-sequence neural network; MEAL-DELIVERY;
D O I
10.26599/TST.2023.9010069
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The on-demand food delivery (OFD) service has gained rapid development in the past decades but meanwhile encounters challenges for further improving operation quality. The order dispatching problem is one of the most concerning issues for the OFD platforms, which refer to dynamically dispatching a large number of orders to riders reasonably in very limited decision time. To solve such a challenging combinatorial optimization problem, an effective matching algorithm is proposed by fusing the reinforcement learning technique and the optimization method. First, to deal with the large-scale complexity, a decoupling method is designed by reducing the matching space between new orders and riders. Second, to overcome the high dynamism and satisfy the stringent requirements on decision time, a reinforcement learning based dispatching heuristic is presented. To be specific, a sequence-tosequence neural network is constructed based on the problem characteristic to generate an order priority sequence. Besides, a training approach is specially designed to improve learning performance. Furthermore, a greedy heuristic is employed to effectively dispatch new orders according to the order priority sequence. On real-world datasets, numerical experiments are conducted to validate the effectiveness of the proposed algorithm. Statistical results show that the proposed algorithm can effectively solve the problem by improving delivery efficiency and maintaining customer satisfaction.
引用
收藏
页码:386 / 399
页数:14
相关论文
共 50 条
  • [31] Recommending-and-Grabbing: A Crowdsourcing-Based Order Allocation Pattern for On-Demand Food Delivery
    Wang, Xing
    Wang, Ling
    Wang, Shengyao
    Pan, Jize
    Ren, Hao
    Zheng, Jie
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (01) : 838 - 853
  • [32] Impacts of food accessibility and built environment on on-demand food delivery usage
    Wang, Zhenzhen
    He, Sylvia Y.
    TRANSPORTATION RESEARCH PART D-TRANSPORT AND ENVIRONMENT, 2021, 100
  • [33] A deep reinforcement learning-based on-demand charging algorithm for wireless rechargeable sensor networks
    Cao, Xianbo
    Xu, Wenzheng
    Liu, Xuxun
    Peng, Jian
    Liu, Tang
    AD HOC NETWORKS, 2021, 110
  • [34] Autonomous order dispatching in the semiconductor industry using reinforcement learning
    Kuhnle, Andreas
    Roehrig, Nicole
    Lanza, Gisela
    12TH CIRP CONFERENCE ON INTELLIGENT COMPUTATION IN MANUFACTURING ENGINEERING, 2019, 79 : 391 - 396
  • [35] Reinforcement Learning Based Demand-Responsive Public Transit Dispatching
    Wu, Mian
    Yu, Chunhui
    Ma, Wanjing
    Wang, Ling
    Ma, Xiaolong
    CICTP 2021: ADVANCED TRANSPORTATION, ENHANCED CONNECTION, 2021, : 387 - 398
  • [36] An intelligent open trading system for on-demand delivery facilitated by deep Q network based reinforcement learning
    Guo, Chaojie
    Zhang, Lele
    Thompson, Russell G.
    Foliente, Greg
    Peng, Xiaoshuai
    INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2025, 63 (03) : 904 - 926
  • [37] An advanced real-time dispatching strategy for a distributed energy system based on the reinforcement learning algorithm
    Meng, Fanyi
    Bai, Yang
    Jin, Jingliang
    RENEWABLE ENERGY, 2021, 178 : 13 - 24
  • [38] Preference-Aware Task Assignment in On-Demand Taxi Dispatching: An Online Stable Matching Approach
    Zhao, Boming
    Xu, Pan
    Shi, Yexuan
    Tong, Yongxin
    Zhou, Zimu
    Zeng, Yuxiang
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 2245 - 2252
  • [39] An advanced real-time dispatching strategy for a distributed energy system based on the reinforcement learning algorithm
    Meng, Fanyi
    Bai, Yang
    Jin, Jingliang
    Renewable Energy, 2021, 178 : 13 - 24
  • [40] Applying Deep Learning Based Probabilistic Forecasting to Food Preparation Time for On-Demand Delivery Service
    Gao, Chengliang
    Zhang, Fan
    Zhou, Yue
    Feng, Ronggen
    Ru, Qiang
    Bian, Kaigui
    He, Renqing
    Sun, Zhizhao
    PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 2924 - 2934