Optimization of 2D Irregular Packing: Deep Reinforcement Learning with Dense Reward

被引:0
|
作者
Crescitelli, Viviana [1 ]
Oshima, Takashi [1 ]
机构
[1] Hitachi Ltd, Res & Dev Grp, Tokyo, Japan
关键词
Irregular packing; reinforcement learning; factory automation; machine learning; reward; ALGORITHM;
D O I
10.1142/S1793351X24430025
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper introduces a method to solve the 2D irregular packing problem using Deep Reinforcement Learning (Deep RL) for logistics. Our method employs a Q agent trained to predict the best placement within a container, maximizing available space. Unlike previous Deep RL algorithms, our method introduces a dense reward function at each packing step, providing immediate feedback and accelerating learning. To our knowledge, this is the first approach to use a dense reward to address the 2D irregular packing problem. Building on our earlier work, we improve the deep neural network by incorporating the Double Deep Q-Network (DDQN) framework to enhance our deep Q-learning approach, reducing overestimation biases and improving decision-making reliability. Simulation results show the method's effectiveness in completing the online 2D irregular packing tasks, achieving promising volume efficiency and packed piece metrics. This research extends our initial findings, highlighting the practical importance of DDQN and dense reward in advancing 2D irregular packing problem-solving. These advancements not only broaden the applications of deep learning but also hold practical importance for real-world logistics challenges.
引用
收藏
页码:405 / 416
页数:12
相关论文
共 50 条
  • [21] Optimizing 2D irregular packing via image processing and computational intelligence
    Longhui Meng
    Liang Ding
    Yunkai Pu
    Xiaomeng Wang
    Ray Tahir Mushtaq
    Mohammed Alkahtani
    Muhammad Shakeel
    Scientific Reports, 15 (1)
  • [22] 2D shape reconstruction of irregular particles with deep learning based on interferometric particle imaging
    Fan, Wenbo
    Sun, Jinlu
    Qiu, Yue
    Wu, Yuhang
    Chen, Shengyong
    APPLIED OPTICS, 2022, 61 (32) : 9595 - 9602
  • [23] D2D Resource Allocation Optimization Algorithm Based on Deep Reinforcement Learning
    Zheng, Kaijin
    Wang, Yijun
    Zhuang, Dawei
    Zhou, Jialu
    2024 4TH INTERNATIONAL CONFERENCE ON ELECTRONIC MATERIALS AND INFORMATION ENGINEERING, EMIE 2024, 2024, : 108 - 111
  • [24] Online VNF Placement using Deep Reinforcement Learning and Reward Constrained Policy Optimization
    Mohamed, Ramy
    Avgeris, Marios
    Leivadeas, Aris
    Lambadaris, Ioannis
    2024 IEEE INTERNATIONAL MEDITERRANEAN CONFERENCE ON COMMUNICATIONS AND NETWORKING, MEDITCOM 2024, 2024, : 269 - 274
  • [25] Dense Robotic Packing of Irregular and Novel 3-D Objects
    Wang, Fan
    Hauser, Kris
    IEEE TRANSACTIONS ON ROBOTICS, 2022, 38 (02) : 1160 - 1173
  • [26] Variance aware reward smoothing for deep reinforcement learning
    Dong, Yunlong
    Zhang, Shengjun
    Liu, Xing
    Zhang, Yu
    Shen, Tan
    NEUROCOMPUTING, 2021, 458 : 327 - 335
  • [27] Deep reinforcement learning with reward design for quantum control
    Yu H.
    Zhao X.
    IEEE Transactions on Artificial Intelligence, 2024, 5 (03): : 1087 - 1101
  • [28] Reward Space Noise for Exploration in Deep Reinforcement Learning
    Sun, Chuxiong
    Wang, Rui
    Li, Qian
    Hu, Xiaohui
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2021, 35 (10)
  • [29] Deep Reinforcement Learning for Video Summarization with Semantic Reward
    Sun, Haoran
    Zhu, Xiaolong
    Zhou, Conghua
    2022 IEEE 22ND INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY, AND SECURITY COMPANION, QRS-C, 2022, : 754 - 755
  • [30] Heuristics Integrated Deep Reinforcement Learning for Online 3D Bin Packing
    Yang, Shuo
    Song, Shuai
    Chu, Shilei
    Song, Ran
    Cheng, Jiyu
    Li, Yibin
    Zhang, Wei
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, 21 (01) : 939 - 950