A Matching Algorithm with Reinforcement Learning and Decoupling Strategy for Order Dispatching in On-Demand Food Delivery

被引:3
|
作者
Chen, Jingfang [1 ]
Wang, Ling [1 ]
Pan, Zixiao [1 ]
Wu, Yuting [1 ]
Zheng, Jie [1 ]
Ding, Xuetao [2 ]
机构
[1] Tsinghua Univ, Dept Automat, Beijing 100084, Peoples R China
[2] Meituan, Dept Delivery Technol, Beijing 100102, Peoples R China
来源
TSINGHUA SCIENCE AND TECHNOLOGY | 2024年 / 29卷 / 02期
基金
中国国家自然科学基金;
关键词
order dispatching; on-demand delivery; reinforcement learning; decoupling strategy; sequence-to-sequence neural network; MEAL-DELIVERY;
D O I
10.26599/TST.2023.9010069
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The on-demand food delivery (OFD) service has gained rapid development in the past decades but meanwhile encounters challenges for further improving operation quality. The order dispatching problem is one of the most concerning issues for the OFD platforms, which refer to dynamically dispatching a large number of orders to riders reasonably in very limited decision time. To solve such a challenging combinatorial optimization problem, an effective matching algorithm is proposed by fusing the reinforcement learning technique and the optimization method. First, to deal with the large-scale complexity, a decoupling method is designed by reducing the matching space between new orders and riders. Second, to overcome the high dynamism and satisfy the stringent requirements on decision time, a reinforcement learning based dispatching heuristic is presented. To be specific, a sequence-tosequence neural network is constructed based on the problem characteristic to generate an order priority sequence. Besides, a training approach is specially designed to improve learning performance. Furthermore, a greedy heuristic is employed to effectively dispatch new orders according to the order priority sequence. On real-world datasets, numerical experiments are conducted to validate the effectiveness of the proposed algorithm. Statistical results show that the proposed algorithm can effectively solve the problem by improving delivery efficiency and maintaining customer satisfaction.
引用
收藏
页码:386 / 399
页数:14
相关论文
共 50 条
  • [41] Checklist as an Effective Means of Information Delivery in On-Demand Learning
    Konjengbam, Anand
    Nagayoshi, Sanetake
    2020 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND ENGINEERING MANAGEMENT (IEEE IEEM), 2020, : 1276 - 1280
  • [42] Online Car-Hailing Order Matching Method Based on Demand Clustering and Reinforcement Learning
    Gao, Huifei
    Chen, Tian
    Hao, Jingxiang
    NEURAL COMPUTING FOR ADVANCED APPLICATIONS, NCAA 2024, PT I, 2025, 2181 : 30 - 45
  • [43] Deep Reinforcement Learning for On-demand Intelligent Routing in Deterministic Networks
    Liu, Ying
    Yin, Jianhui
    Zhang, Weiting
    Xie, Shanghan
    IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, : 1932 - 1937
  • [44] Delivery Scope: A New Way of Restaurant Retrieval For On-demand Food Delivery Service
    Ding, Xuetao
    Zhang, Runfeng
    Mao, Zhen
    Xing, Ke
    Du, Fangxiao
    Liu, Xingyu
    Wei, Guoxing
    Yin, Feifan
    He, Renqing
    Sun, Zhizhao
    KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 3026 - 3034
  • [45] Reinforcement learning based scheme for on-demand vehicular fog formation
    Nsouli, Ahmad
    El-Hajj, Wassim
    Mourad, Azzam
    VEHICULAR COMMUNICATIONS, 2023, 40
  • [46] Multi-Stove Scheduling for Sustainable On-Demand Food Delivery
    Dai, Tao
    Fan, Xiangqi
    SUSTAINABILITY, 2021, 13 (23)
  • [47] Uncovering merchants' willingness to wait in on-demand food delivery markets
    Liang, Jian
    Zhao, Ya
    Wang, Hai
    Xiao, Zuopeng
    Ke, Jintao
    TRANSPORT POLICY, 2024, 158 : 14 - 28
  • [48] Modeling stochastic service time for complex on-demand food delivery
    Jie Zheng
    Ling Wang
    Shengyao Wang
    Jing-fang Chen
    Xing Wang
    Haining Duan
    Yile Liang
    Xuetao Ding
    Complex & Intelligent Systems, 2022, 8 : 4939 - 4953
  • [49] Modeling stochastic service time for complex on-demand food delivery
    Zheng, Jie
    Wang, Ling
    Wang, Shengyao
    Chen, Jing-fang
    Wang, Xing
    Duan, Haining
    Liang, Yile
    Ding, Xuetao
    COMPLEX & INTELLIGENT SYSTEMS, 2022, 8 (06) : 4939 - 4953
  • [50] Modeling and managing an on-demand meal delivery system with order bundling
    Ye, Anke
    Zhang, Kenan
    Chen, Xiqun
    Bell, Michael G. H.
    Lee, Der-Horng
    Hu, Simon
    TRANSPORTATION RESEARCH PART E-LOGISTICS AND TRANSPORTATION REVIEW, 2024, 187