Online model-based reinforcement learning for decision-making in long distance routes

被引:2
|
作者
Alcaraz, Juan J. [1 ]
Losilla, Fernando [1 ]
Caballero-Arnaldos, Luis [1 ]
机构
[1] Tech Univ Cartagena UPCT, Dept Informat & Commun Technol, Cartagena, Spain
关键词
Route scheduling; Reinforcement learning; Model predictive control; Monte Carlo tree search; VEHICLE-ROUTING PROBLEM; TIME WINDOWS; STOCHASTIC TRAVEL; OPTIMIZATION; FRAMEWORK; SERVICE;
D O I
10.1016/j.tre.2022.102790
中图分类号
F [经济];
学科分类号
02 ;
摘要
In road transportation, long-distance routes require scheduled driving times, breaks, and restperiods, in compliance with the regulations on working conditions for truck drivers, whileensuring goods are delivered within the time windows of each customer. However, routes aresubject to uncertain travel and service times, and incidents may cause additional delays, makingpredefined schedules ineffective in many real-life situations. This paper presents a reinforcementlearning (RL) algorithm capable of making en-route decisions regarding driving times, breaks,and rest periods, under uncertain conditions. Our proposal aims at maximizing the likelihood ofon-time delivery while complying with drivers' work regulations. We use an online model-basedRL strategy that needs no prior training and is more flexible than model-free RL approaches,where the agent must be trained offline before making online decisions. Our proposal combinesmodel predictive control with a rollout strategy and Monte Carlo tree search. At each decisionstage, our algorithm anticipates the consequences of all the possible decisions in a number offuture stages (the lookahead horizon), and then uses a base policy to generate a sequence ofdecisions beyond the lookahead horizon. This base policy could be, for example, a set of decisionrules based on the experience and expertise of the transportation company covering the routes.Our numerical results show that the policy obtained using our algorithm outperforms not onlythe base policy (up to 83%), but also a policy obtained offline using deep Q networks (DQN),a state-of-the-art, model-free RL algorithm.
引用
收藏
页数:21
相关论文
共 50 条
  • [21] Decision-making approaches for a model-based FDI method.
    de Miguel, LJ
    Mediavilla, M
    Perán, JR
    (SAFEPROCESS'97): FAULT DETECTION, SUPERVISION AND SAFETY FOR TECHNICAL PROCESSES 1997, VOLS 1-3, 1998, : 707 - 713
  • [22] Model-Based Wisdom of the Crowd for Sequential Decision-Making Tasks
    Thomas, Bobby
    Coon, Jeff
    Westfall, Holly A.
    Lee, Michael D.
    COGNITIVE SCIENCE, 2021, 45 (07)
  • [23] An Integrated Model for Autonomous Speed and Lane Change Decision-Making Based on Deep Reinforcement Learning
    Peng, Jiankun
    Zhang, Siyu
    Zhou, Yang
    Li, Zhibin
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (11) : 21848 - 21860
  • [24] A DECISION-MAKING METHOD FOR AUTONOMOUS VEHICLES BASED ON SIMULATION AND REINFORCEMENT LEARNING
    Zheng, Rui
    Liu, Chunming
    Guo, Qi
    PROCEEDINGS OF 2013 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), VOLS 1-4, 2013, : 362 - 369
  • [25] UAVs Maneuver Decision-Making Method Based on Transfer Reinforcement Learning
    Zhu, Jindong
    Fu, Xiaowei
    Qiao, Zhe
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022 : 2399796
  • [26] An Integrated Lateral and Longitudinal Decision-Making Model for Autonomous Driving Based on Deep Reinforcement Learning
    Cui, Jianxun
    Zhao, Boyuan
    Qu, Mingcheng
    JOURNAL OF ADVANCED TRANSPORTATION, 2023, 2023
  • [27] Collaborative decision-making for UAV swarm confrontation based on reinforcement learning
    Jiao, Yongkang
    Fu, Wenxing
    Cao, Xinying
    Su, Qiangqing
    Wang, Yusheng
    Shen, Zixiang
    Yu, Lanlin
    IET CONTROL THEORY AND APPLICATIONS, 2025, 19 (01):
  • [28] An integrated model for coordinating adaptive platoons and parking decision-making based on deep reinforcement learning
    Li, Jia
    Guo, Zijian
    Jiang, Ying
    Wang, Wenyuan
    Li, Xin
    COMPUTERS & INDUSTRIAL ENGINEERING, 2025, 203
  • [29] Reinforcement Learning-Based Intelligent Decision-Making for Communication Parameters
    Xie, Xia
    Dou, Zheng
    Zhang, Yabin
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2022, 16 (09): : 2942 - 2960
  • [30] Reinforcement Learning Decision-Making for Autonomous Vehicles Based on Semantic Segmentation
    Gao, Jianping
    Liu, Ningbo
    Li, Haotian
    Li, Zhe
    Xie, Chengwei
    Gou, Yangyang
    APPLIED SCIENCES-BASEL, 2025, 15 (03):