Optimization of 2D Irregular Packing: Deep Reinforcement Learning with Dense Reward

被引：0

作者：

Crescitelli, Viviana ^{[1
]}

Oshima, Takashi ^{[1
]}

机构：

[1] Hitachi Ltd, Res & Dev Grp, Tokyo, Japan

来源：

INTERNATIONAL JOURNAL OF SEMANTIC COMPUTING | 2024年 / 18卷 / 03期

关键词：

Irregular packing; reinforcement learning; factory automation; machine learning; reward; ALGORITHM;

D O I：

10.1142/S1793351X24430025

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper introduces a method to solve the 2D irregular packing problem using Deep Reinforcement Learning (Deep RL) for logistics. Our method employs a Q agent trained to predict the best placement within a container, maximizing available space. Unlike previous Deep RL algorithms, our method introduces a dense reward function at each packing step, providing immediate feedback and accelerating learning. To our knowledge, this is the first approach to use a dense reward to address the 2D irregular packing problem. Building on our earlier work, we improve the deep neural network by incorporating the Double Deep Q-Network (DDQN) framework to enhance our deep Q-learning approach, reducing overestimation biases and improving decision-making reliability. Simulation results show the method's effectiveness in completing the online 2D irregular packing tasks, achieving promising volume efficiency and packed piece metrics. This research extends our initial findings, highlighting the practical importance of DDQN and dense reward in advancing 2D irregular packing problem-solving. These advancements not only broaden the applications of deep learning but also hold practical importance for real-world logistics challenges.

引用

页码：405 / 416

页数：12

共 50 条

[21] Optimizing 2D irregular packing via image processing and computational intelligence
Longhui Meng
Liang Ding
Yunkai Pu
Xiaomeng Wang
Ray Tahir Mushtaq
Mohammed Alkahtani
Muhammad Shakeel
Scientific Reports, 15 (1)
[22] 2D shape reconstruction of irregular particles with deep learning based on interferometric particle imaging
Fan, Wenbo
Sun, Jinlu
Qiu, Yue
Wu, Yuhang
Chen, Shengyong
APPLIED OPTICS, 2022, 61 (32) : 9595 - 9602
[23] D2D Resource Allocation Optimization Algorithm Based on Deep Reinforcement Learning
Zheng, Kaijin
Wang, Yijun
Zhuang, Dawei
Zhou, Jialu
2024 4TH INTERNATIONAL CONFERENCE ON ELECTRONIC MATERIALS AND INFORMATION ENGINEERING, EMIE 2024, 2024, : 108 - 111
[24] Online VNF Placement using Deep Reinforcement Learning and Reward Constrained Policy Optimization
Mohamed, Ramy
Avgeris, Marios
Leivadeas, Aris
Lambadaris, Ioannis
2024 IEEE INTERNATIONAL MEDITERRANEAN CONFERENCE ON COMMUNICATIONS AND NETWORKING, MEDITCOM 2024, 2024, : 269 - 274
[25] Dense Robotic Packing of Irregular and Novel 3-D Objects
Wang, Fan
Hauser, Kris
IEEE TRANSACTIONS ON ROBOTICS, 2022, 38 (02) : 1160 - 1173
[26] Variance aware reward smoothing for deep reinforcement learning
Dong, Yunlong
Zhang, Shengjun
Liu, Xing
Zhang, Yu
Shen, Tan
NEUROCOMPUTING, 2021, 458 : 327 - 335
[27] Deep reinforcement learning with reward design for quantum control
Yu H.
Zhao X.
IEEE Transactions on Artificial Intelligence, 2024, 5 (03): : 1087 - 1101
[28] Reward Space Noise for Exploration in Deep Reinforcement Learning
Sun, Chuxiong
Wang, Rui
Li, Qian
Hu, Xiaohui
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2021, 35 (10)
[29] Deep Reinforcement Learning for Video Summarization with Semantic Reward
Sun, Haoran
Zhu, Xiaolong
Zhou, Conghua
2022 IEEE 22ND INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY, AND SECURITY COMPANION, QRS-C, 2022, : 754 - 755
[30] Heuristics Integrated Deep Reinforcement Learning for Online 3D Bin Packing
Yang, Shuo
Song, Shuai
Chu, Shilei
Song, Ran
Cheng, Jiyu
Li, Yibin
Zhang, Wei
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, 21 (01) : 939 - 950

← 1 2 3 4 5 →