Bin Packing Optimization via Deep Reinforcement Learning

被引:0
|
作者
Wang, Baoying [1 ]
Lin, Zhaohui [1 ]
Kong, Weijie [1 ]
Dong, Huixu [1 ,2 ]
机构
[1] Zhejiang Univ, Mech Engn Dept, Grasp Lab, Hangzhou 310027, Peoples R China
[2] Zhejiang Key Lab Ind Big Data & Robot Intelligent, Hangzhou 310058, Peoples R China
来源
IEEE ROBOTICS AND AUTOMATION LETTERS | 2025年 / 10卷 / 03期
基金
中国国家自然科学基金;
关键词
Three-dimensional displays; Robots; Genetic algorithms; Costs; Deep reinforcement learning; Decoding; Accuracy; Search problems; Logistics; Convolution; Reinforcement learning; manipulation planning; bin packing problem (BPP); robot packing; ROBOTIC MANIPULATION; ALGORITHM;
D O I
10.1109/LRA.2025.3534070
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
The bin packing problem (BPP) has attracted enthusiastic research interest recently, owing to its widespread applications in logistics and warehousing environments. It is truly essential to optimize the bin packing to enable more objects to be packed into bins, in which the object packing order and placement position are the two crucial optimization goals. However, existing optimization methods for BPP, such as the genetic algorithm (GA), emerge as the primary issues in highly time cost and relatively low accuracy, making it difficult to implement in realistic scenarios. To well relieve related research gaps, we present a novel optimization method of 2D and 3D BPP for objects with regular shapes via deep reinforcement learning (DRL), maximizing the space utilization and minimizing the usage number of bins. First, an end-to-end DRL neural network constructed by a modified Pointer Network consisting of an encoder, a decoder and an attention module is proposed to achieve the optimal object packing order. Second, conforming to the top-down operation mode, the placement strategy based on a height map is used to determine the placement positions of the ordered objects in the bins, preventing the objects from colliding with bins and other objects in bins. Third, the reward and loss functions are defined as the indicators of the compactness, pyramid, and usage number of bins to conduct the DRL neural network training based on an on-policy actor-critic framework. Finally, we conduct extensive experiments to evaluate the performance of the proposed method, and demonstrate that our method achieves a 3% improvement and more than 50x time saving over the GA. Further, an experiment on robotic packing is implemented to validate its generalization capacity in the realistic environment.
引用
收藏
页码:2542 / 2549
页数:8
相关论文
共 50 条
  • [41] Joint Optimization Via Deep Reinforcement Learning in Wireless Networked Controlled Systems
    Ashraf, Kanwal
    Le Moullec, Yannick
    Pardy, Tamas
    Rang, Toomas
    IEEE ACCESS, 2022, 10 : 67152 - 67167
  • [42] Optimization of Neuroprosthetic Vision via End-to-End Deep Reinforcement Learning
    Kucukoglu, Burcu
    Rueckauer, Bodo
    Ahmad, Nasir
    van Steveninck, Jaap de Ruyter
    Guclu, Umut
    van Gerven, Marcel
    INTERNATIONAL JOURNAL OF NEURAL SYSTEMS, 2022, 32 (11)
  • [43] Transforming Cooling Optimization for Green Data Center via Deep Reinforcement Learning
    Li, Yuanlong
    Wen, Yonggang
    Tao, Dacheng
    Guan, Kyle
    IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (05) : 2002 - 2013
  • [44] Route Optimization via Environment-Aware Deep Network and Reinforcement Learning
    Guo, Pengzhan
    Xiao, Keli
    Ye, Zeyang
    Zhu, Wei
    ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2021, 12 (06)
  • [45] Performance Optimization in Mobile-Edge Computing via Deep Reinforcement Learning
    Chen, Xianfu
    Zhang, Honggang
    Wu, Celimuge
    Mao, Shiwen
    Ji, Yusheng
    Bennis, Mehdi
    2018 IEEE 88TH VEHICULAR TECHNOLOGY CONFERENCE (VTC-FALL), 2018,
  • [46] Planning Irregular Object Packing via Hierarchical Reinforcement Learning
    Huang, Sichao
    Wang, Ziwei
    Zhou, Jie
    Lu, Jiwen
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (01) : 81 - 88
  • [47] Learning Battles in ViZDoom via Deep Reinforcement Learning
    Shao, Kun
    Zhao, Dongbin
    Li, Nannan
    Zhu, Yuanheng
    PROCEEDINGS OF THE 2018 IEEE CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND GAMES (CIG'18), 2018, : 389 - 392
  • [48] Deep sparse representation via deep dictionary learning for reinforcement learning
    Tang, Jianhao
    Li, Zhenni
    Xie, Shengli
    Ding, Shuxue
    Zheng, Shaolong
    Chen, Xueni
    2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 2398 - 2403
  • [49] Model Parallelism optimization with deep reinforcement learning
    Mirhoseini, Azalia
    2018 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW 2018), 2018, : 855 - 855
  • [50] FPGA Placement Optimization with Deep Reinforcement Learning
    Zhang, Junpeng
    Deng, Fang
    Yang, Xudong
    2021 2ND INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND INTELLIGENT CONTROL (ICCEIC 2021), 2021, : 73 - 76