Bin Packing Optimization via Deep Reinforcement Learning

被引:0
|
作者
Wang, Baoying [1 ]
Lin, Zhaohui [1 ]
Kong, Weijie [1 ]
Dong, Huixu [1 ,2 ]
机构
[1] Zhejiang Univ, Mech Engn Dept, Grasp Lab, Hangzhou 310027, Peoples R China
[2] Zhejiang Key Lab Ind Big Data & Robot Intelligent, Hangzhou 310058, Peoples R China
来源
IEEE ROBOTICS AND AUTOMATION LETTERS | 2025年 / 10卷 / 03期
基金
中国国家自然科学基金;
关键词
Three-dimensional displays; Robots; Genetic algorithms; Costs; Deep reinforcement learning; Decoding; Accuracy; Search problems; Logistics; Convolution; Reinforcement learning; manipulation planning; bin packing problem (BPP); robot packing; ROBOTIC MANIPULATION; ALGORITHM;
D O I
10.1109/LRA.2025.3534070
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
The bin packing problem (BPP) has attracted enthusiastic research interest recently, owing to its widespread applications in logistics and warehousing environments. It is truly essential to optimize the bin packing to enable more objects to be packed into bins, in which the object packing order and placement position are the two crucial optimization goals. However, existing optimization methods for BPP, such as the genetic algorithm (GA), emerge as the primary issues in highly time cost and relatively low accuracy, making it difficult to implement in realistic scenarios. To well relieve related research gaps, we present a novel optimization method of 2D and 3D BPP for objects with regular shapes via deep reinforcement learning (DRL), maximizing the space utilization and minimizing the usage number of bins. First, an end-to-end DRL neural network constructed by a modified Pointer Network consisting of an encoder, a decoder and an attention module is proposed to achieve the optimal object packing order. Second, conforming to the top-down operation mode, the placement strategy based on a height map is used to determine the placement positions of the ordered objects in the bins, preventing the objects from colliding with bins and other objects in bins. Third, the reward and loss functions are defined as the indicators of the compactness, pyramid, and usage number of bins to conduct the DRL neural network training based on an on-policy actor-critic framework. Finally, we conduct extensive experiments to evaluate the performance of the proposed method, and demonstrate that our method achieves a 3% improvement and more than 50x time saving over the GA. Further, an experiment on robotic packing is implemented to validate its generalization capacity in the realistic environment.
引用
收藏
页码:2542 / 2549
页数:8
相关论文
共 50 条
  • [21] DeepPack3D: A Python']Python package for online 3D bin packing optimization by deep reinforcement learning and constructive heuristics
    Tsang, Y. P.
    Mo, D. Y.
    Chung, K. T.
    Lee, C. K. M.
    SOFTWARE IMPACTS, 2025, 23
  • [22] Optimization of 2D Irregular Packing: Deep Reinforcement Learning with Dense Reward
    Crescitelli, Viviana
    Oshima, Takashi
    INTERNATIONAL JOURNAL OF SEMANTIC COMPUTING, 2024, 18 (03) : 405 - 416
  • [23] Online 3D Bin Packing Reinforcement Learning Solution with Buffer
    Puche, Aaron Valero
    Lee, Sukhan
    2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 8902 - 8909
  • [24] Composite temperature profile and tooling optimization via Deep Reinforcement Learning
    Szarski, Martin
    Chauhan, Sunita
    COMPOSITES PART A-APPLIED SCIENCE AND MANUFACTURING, 2021, 142
  • [25] Adaptive Optimization of Traffic Signal Timing via Deep Reinforcement Learning
    Ma, Zibo
    Cui, Tongchao
    Deng, Wenxing
    Jiang, Fengyao
    Zhang, Liguo
    JOURNAL OF ADVANCED TRANSPORTATION, 2021, 2021
  • [26] Distributed Optimization of Regional Traffic Signals via Deep Reinforcement Learning
    Cui, Tongchao
    Liu, Xudong
    Zhang, Liguo
    2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, : 6130 - 6135
  • [27] Experience Replay Optimization via ESMM for Stable Deep Reinforcement Learning
    Osei, Richard Sakyi
    Lopez, Daphne
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (01) : 715 - 723
  • [28] Fault Detection and Optimization for Flight Vehicles via Deep Reinforcement Learning
    Cheng, Haoyu
    Chen, Ping
    Shi, Linan
    He, Yanfeng
    Hu, Renyi
    Zhang, Jie
    2020 IEEE 16TH INTERNATIONAL CONFERENCE ON CONTROL & AUTOMATION (ICCA), 2020, : 1384 - 1389
  • [29] Learning Global Optimization by Deep Reinforcement Learning
    da Silva Filho, Moesio Wenceslau
    Barbosa, Gabriel A.
    Miranda, Pericles B. C.
    INTELLIGENT SYSTEMS, PT II, 2022, 13654 : 417 - 433
  • [30] Deep Reinforcement Learning for Multiobjective Optimization
    Li, Kaiwen
    Zhang, Tao
    Wang, Rui
    IEEE TRANSACTIONS ON CYBERNETICS, 2021, 51 (06) : 3103 - 3114