Optimization of 2D Irregular Packing: Deep Reinforcement Learning with Dense Reward

被引:0
|
作者
Crescitelli, Viviana [1 ]
Oshima, Takashi [1 ]
机构
[1] Hitachi Ltd, Res & Dev Grp, Tokyo, Japan
关键词
Irregular packing; reinforcement learning; factory automation; machine learning; reward; ALGORITHM;
D O I
10.1142/S1793351X24430025
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper introduces a method to solve the 2D irregular packing problem using Deep Reinforcement Learning (Deep RL) for logistics. Our method employs a Q agent trained to predict the best placement within a container, maximizing available space. Unlike previous Deep RL algorithms, our method introduces a dense reward function at each packing step, providing immediate feedback and accelerating learning. To our knowledge, this is the first approach to use a dense reward to address the 2D irregular packing problem. Building on our earlier work, we improve the deep neural network by incorporating the Double Deep Q-Network (DDQN) framework to enhance our deep Q-learning approach, reducing overestimation biases and improving decision-making reliability. Simulation results show the method's effectiveness in completing the online 2D irregular packing tasks, achieving promising volume efficiency and packed piece metrics. This research extends our initial findings, highlighting the practical importance of DDQN and dense reward in advancing 2D irregular packing problem-solving. These advancements not only broaden the applications of deep learning but also hold practical importance for real-world logistics challenges.
引用
收藏
页码:405 / 416
页数:12
相关论文
共 50 条
  • [41] Deep Reinforcement Learning for Mapping Quantum Circuits to 2D Nearest-Neighbor Architectures
    Li, Yangzhi
    Liu, Wen
    Li, Maoduo
    ADVANCED QUANTUM TECHNOLOGIES, 2024, 7 (02)
  • [42] Efficient 2D Simulators for Deep-Reinforcement-Learning-based Training of Navigation Approaches
    Zeng, Huajian
    Kastner, Linh
    Lambrecht, Jens
    2023 20TH INTERNATIONAL CONFERENCE ON UBIQUITOUS ROBOTS, UR, 2023, : 275 - 280
  • [43] Learning Global Optimization by Deep Reinforcement Learning
    da Silva Filho, Moesio Wenceslau
    Barbosa, Gabriel A.
    Miranda, Pericles B. C.
    INTELLIGENT SYSTEMS, PT II, 2022, 13654 : 417 - 433
  • [44] Deep Reinforcement Learning for Multiobjective Optimization
    Li, Kaiwen
    Zhang, Tao
    Wang, Rui
    IEEE TRANSACTIONS ON CYBERNETICS, 2021, 51 (06) : 3103 - 3114
  • [45] Reinforcement learning for deep portfolio optimization
    Yan, Ruyu
    Jin, Jiafei
    Han, Kun
    ELECTRONIC RESEARCH ARCHIVE, 2024, 32 (09): : 5176 - 5200
  • [46] A Modified Particle Swarm Optimization for the 2D Rectangular Packing Problem
    Shao, Libing
    Wang, Shuzong
    Li, Biruo
    Song, Huanhuan
    2010 INTERNATIONAL CONFERENCE ON INFORMATION, ELECTRONIC AND COMPUTER SCIENCE, VOLS 1-3, 2010, : 195 - 198
  • [47] Improved Sliding Algorithm for Generating No-Fit Polygon in the 2D Irregular Packing Problem
    Luo, Qiang
    Rao, Yunqing
    MATHEMATICS, 2022, 10 (16)
  • [48] Generalization in Deep Reinforcement Learning for Robotic Navigation by Reward Shaping
    Miranda, Victor R. F.
    Neto, Armando A.
    Freitas, Gustavo M.
    Mozelli, Leonardo A.
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2024, 71 (06) : 6013 - 6020
  • [49] An autonomous ore packing system through deep reinforcement learning
    Ren, He
    Zhong, Rui
    ADVANCES IN SPACE RESEARCH, 2024, 74 (12) : 6366 - 6383
  • [50] DeepPack3D: A Python']Python package for online 3D bin packing optimization by deep reinforcement learning and constructive heuristics
    Tsang, Y. P.
    Mo, D. Y.
    Chung, K. T.
    Lee, C. K. M.
    SOFTWARE IMPACTS, 2025, 23