Towards a Deep Reinforcement Learning Model of Master Bay Stowage Planning

被引:1
|
作者
Van Twiller, Jaike [1 ]
Grbic, Djordje [1 ]
Jensen, Rune Moller [1 ]
机构
[1] IT Univ Copenhagen, Rued Langgaards Vej 7, DK-2300 Copenhagen, Denmark
来源
关键词
Maritime logistics; Liner shipping; Stowage planning; Deep reinforcement learning; Markov decision processes; CONTAINER SHIPS; METHODOLOGY; ALGORITHM; NUMBER; REDUCE;
D O I
10.1007/978-3-031-43612-3_6
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Major liner shipping companies aim to solve the stowage planning problem by optimally allocating containers to vessel locations during a multi-port voyage. Due to a large variety of combinatorial aspects, a scalable algorithm to solve a representative problem is yet to be found. This paper will show that deep reinforcement learning can optimize a non-trivial master bay planning problem. Our experiments show that proximal policy optimization efficiently finds reasonable solutions, serving as preliminary evidence of the potential value of deep reinforcement learning in stowage planning. In future work, we will extend our architecture to address a full-featured master bay planning problem.
引用
收藏
页码:105 / 121
页数:17
相关论文
共 50 条
  • [31] Electric Vehicle Charge Planning by Deep Reinforcement Learning
    Roccotelli, M.
    Fanti, M. P.
    Mangini, A. M.
    IFAC PAPERSONLINE, 2023, 56 (02): : 9080 - 9085
  • [32] Improving deep reinforcement learning by safety guarding model via hazardous experience planning
    PENG Pai
    ZHU Fei
    LING Xinghong
    ZHAO Peiyao
    LIU Quan
    Frontiers of Computer Science, 2022, 16 (04)
  • [33] Intelligent land vehicle model transfer trajectory planning method of deep reinforcement learning
    Yu L.-L.
    Shao X.-Y.
    Long Z.-W.
    Wei Y.-D.
    Zhou K.-J.
    Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2019, 36 (09): : 1409 - 1422
  • [34] Improving deep reinforcement learning by safety guarding model via hazardous experience planning
    Peng, Pai
    Zhu, Fei
    Ling, Xinghong
    Zhao, Peiyao
    Liu, Quan
    FRONTIERS OF COMPUTER SCIENCE, 2022, 16 (04)
  • [35] Improving deep reinforcement learning by safety guarding model via hazardous experience planning
    Pai Peng
    Fei Zhu
    Xinghong Ling
    Peiyao Zhao
    Quan Liu
    Frontiers of Computer Science, 2022, 16
  • [36] A Model-free Deep Reinforcement Learning Approach for Robotic Manipulators Path Planning
    Liu, Wenxing
    Niu, Hanlin
    Mahyuddin, Muhammad Nasiruddin
    Herrmann, Guido
    Carrasco, Joaquin
    2021 21ST INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2021), 2021, : 512 - 517
  • [37] Smart Master Production Scheduling by Deep Reinforcement Learning: An Exploratory Analysis
    Serrano-Ruiz, Julio C.
    Mula, Josefa
    Poler, Raul
    Diaz-Madronero, Manuel
    NAVIGATING UNPREDICTABILITY: COLLABORATIVE NETWORKS IN NON-LINEAR WORLDS, PRO-VE 2024, PT II, 2024, 727 : 228 - 244
  • [38] Towards Deeper Deep Reinforcement Learning with Spectral Normalization
    Bjorck, Johan
    Gomes, Carla P.
    Weinberger, Kilian Q.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021,
  • [39] Physics-Model-Regulated Deep Reinforcement Learning Towards Safety & Stability Guarantees
    Cao, Hongpeng
    Mao, Yanbing
    Sha, Lui
    Caccamo, Marco
    2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 8306 - 8311
  • [40] Towards Real-Time Path Planning through Deep Reinforcement Learning for a UAV in Dynamic Environments
    Yan, Chao
    Xiang, Xiaojia
    Wang, Chang
    JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2020, 98 (02) : 297 - 309