Towards a Deep Reinforcement Learning Model of Master Bay Stowage Planning

被引:1
|
作者
Van Twiller, Jaike [1 ]
Grbic, Djordje [1 ]
Jensen, Rune Moller [1 ]
机构
[1] IT Univ Copenhagen, Rued Langgaards Vej 7, DK-2300 Copenhagen, Denmark
来源
关键词
Maritime logistics; Liner shipping; Stowage planning; Deep reinforcement learning; Markov decision processes; CONTAINER SHIPS; METHODOLOGY; ALGORITHM; NUMBER; REDUCE;
D O I
10.1007/978-3-031-43612-3_6
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Major liner shipping companies aim to solve the stowage planning problem by optimally allocating containers to vessel locations during a multi-port voyage. Due to a large variety of combinatorial aspects, a scalable algorithm to solve a representative problem is yet to be found. This paper will show that deep reinforcement learning can optimize a non-trivial master bay planning problem. Our experiments show that proximal policy optimization efficiently finds reasonable solutions, serving as preliminary evidence of the potential value of deep reinforcement learning in stowage planning. In future work, we will extend our architecture to address a full-featured master bay planning problem.
引用
收藏
页码:105 / 121
页数:17
相关论文
共 50 条
  • [41] Towards Real-Time Path Planning through Deep Reinforcement Learning for a UAV in Dynamic Environments
    Chao Yan
    Xiaojia Xiang
    Chang Wang
    Journal of Intelligent & Robotic Systems, 2020, 98 : 297 - 309
  • [42] Practice Makes Perfect: Towards Learned Path Planning for Robotic Musicians using Deep Reinforcement Learning
    Hantrakul, Lamtharn
    Kondak, Zachary
    Weinberg, Gil
    PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON MOVEMENT AND COMPUTING (MOCO'18), 2018,
  • [43] Learning to navigate a crystallization model with Deep Reinforcement Learning
    Manee, Vidhyadhar
    Baratti, Roberto
    Romagnoli, Jose A.
    CHEMICAL ENGINEERING RESEARCH & DESIGN, 2022, 178 : 111 - 123
  • [44] Towards Modeling the Learning Process of Aviators using Deep Reinforcement Learning
    van Oijen, Joost
    Poppinga, Gerald
    Brouwer, Olaf
    Aliko, Andi
    Roessingh, Jan Joris
    2017 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2017, : 3439 - 3444
  • [45] Adaptive Nonlinear Model Predictive Horizon Using Deep Reinforcement Learning for Optimal Trajectory Planning
    Al Younes, Younes
    Barczyk, Martin
    DRONES, 2022, 6 (11)
  • [46] Deep Reinforcement Learning Applied to Airport Surface Movement Planning
    Tien, Shin-Lai
    Tang, Huang
    Kirk, Daniel
    Vargo, Erik
    Liu, Sheng
    2019 IEEE/AIAA 38TH DIGITAL AVIONICS SYSTEMS CONFERENCE (DASC), 2019,
  • [47] A mission planning method for deep space detectors using deep reinforcement learning
    Qi, Yuheng
    Gu, Defeng
    Liu, Yuan
    Zhu, Jubo
    Wang, Jian
    Liu, Daoping
    AEROSPACE SCIENCE AND TECHNOLOGY, 2024, 153
  • [48] Optimizing smart city planning: A deep reinforcement learning framework
    Park, Junyoung
    Baek, Jiwoo
    Song, Yujae
    ICT EXPRESS, 2025, 11 (01): : 129 - 134
  • [49] Grid Path Planning with Deep Reinforcement Learning: Preliminary Results
    Panov, Aleksandr, I
    Yakovlev, Konstantin S.
    Suvorov, Roman
    8TH ANNUAL INTERNATIONAL CONFERENCE ON BIOLOGICALLY INSPIRED COGNITIVE ARCHITECTURES, BICA 2017 (EIGHTH ANNUAL MEETING OF THE BICA SOCIETY), 2018, 123 : 347 - 353
  • [50] Application of Deep Reinforcement Learning in Mobile Robot Path Planning
    Xin, Jing
    Zhao, Huan
    Liu, Ding
    Li, Minqi
    2017 CHINESE AUTOMATION CONGRESS (CAC), 2017, : 7112 - 7116