Deep reinforcement learning with domain randomization for overhead crane control with payload mass variations

被引:0
|
作者
Zhang, Jianfeng [1 ]
Zhao, Chunhui [1 ,2 ]
Ding, Jinliang [3 ]
机构
[1] Zhejiang Univ, Coll Control Sci & Engn, Hangzhou 310027, Peoples R China
[2] Shenzhen Polytech Universtiy, Inst Intelligence Sci & Engn, Shenzhen 518055, Peoples R China
[3] Northeastern Univ, State Key Lab Synthet Automation Proc Ind, Shenyang 110819, Peoples R China
基金
中国国家自然科学基金;
关键词
Overhead cranes; Deep reinforcement learning; Domain randomization; Memory-augmented policy; Payload mass variations; DESIGN;
D O I
10.1016/j.conengprac.2023.105689
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Overhead cranes, as an important tool for loading and transporting, play an important role in modern industry. A key challenge in overhead crane control is payload mass variation: a policy learned to solve the overhead crane control in the fixed payload scenario often fails to solve the control task in the payload variation scenario. Therefore, from a practical perspective, this paper designs a novel deep reinforcement learning (DRL) control algorithm, domain randomization memory-augmented Beta proximal policy optimization (DR-MABPPO), which leverages the memory-augmented policy and incorporates the domain randomization (DR) training strategy to address the control problem of the overhead crane with payload masses variations. With the help of the DR training strategy and the memory-augmented policy, DR-MABPPO can learn a universal policy that is robust to the wide range of payload mass variations. As far as we know, this is the first time that the DRL technique is applied to solve the overhead crane control with payload mass variations. Simulation studies are conducted to demonstrate the effectiveness of the proposed method in the presence of payload mass variations, exhibiting satisfactory control performance when compared to PID and LQR.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Passivity-Based Online Reinforcement Learning for Real Time Model-Free Overhead Crane System Control
    Zhang, Haoran
    Zhao, Chunhui
    Ding, Jinliang
    2022 34TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2022, : 4116 - 4121
  • [22] High-Speed Collision Avoidance using Deep Reinforcement Learning and Domain Randomization for Autonomous Vehicles
    Kontes, Georgios D.
    Scherer, Daniel D.
    Nisslbeck, Tim
    Fischer, Janina
    Mutschler, Christopher
    2020 IEEE 23RD INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2020,
  • [23] A Domain Data Pattern Randomization based Deep Reinforcement Learning method for Sim-to-Real transfer
    Gong, Peng
    Shi, Dianxi
    Xue, Chao
    Chen, Xucan
    2021 5TH INTERNATIONAL CONFERENCE ON INNOVATION IN ARTIFICIAL INTELLIGENCE (ICIAI 2021), 2021, : 1 - 7
  • [24] Deep Reinforcement Learning-Based Control for Asynchronous Motor-Actuated Triple Pendulum Crane Systems With Distributed Mass Payloads
    Wu, Qingxiang
    Sun, Ning
    Yang, Tong
    Fang, Yongchun
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2024, 71 (02) : 1853 - 1862
  • [25] Variance Reduced Domain Randomization for Reinforcement Learning With Policy Gradient
    Jiang, Yuankun
    Li, Chenglin
    Dai, Wenrui
    Zou, Junni
    Xiong, Hongkai
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (02) : 1031 - 1048
  • [26] Bridging the Reality Gap of Reinforcement Learning based Traffic Signal Control using Domain Randomization and Meta Learning
    Mueller, Arthur
    Sabatelli, Matthia
    2023 IEEE 26TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS, ITSC, 2023, : 5271 - 5278
  • [27] Learning and H∞ control of an overhead crane for obstacle avoidance and disturbance rejection
    Gao, JB
    Chen, DG
    PROCEEDINGS OF THE 36TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-5, 1997, : 275 - 280
  • [28] Intelligent Optimal Control With Critic Learning for a Nonlinear Overhead Crane System
    Wang, Ding
    He, Haibo
    Liu, Derong
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2018, 14 (07) : 2932 - 2940
  • [29] Domain Randomization on Deep Learning Models for Image Dehazing
    Shamsuddin, Abdul Fathaah
    Abhijith, P.
    Ragunathan, Krupasankari
    Deepak, Raja Sekar P. M.
    Sankaran, Praveen
    2021 NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2021, : 182 - 187
  • [30] Effect of the payload mass on forces acting from the overhead crane drives during movement in the mode of suppressing uncontrolled oscillations
    Korytov, Mikhail S.
    Shcherbakov, Vitaly S.
    Titenko, Vladimir V.
    IV INTERNATIONAL SCIENTIFIC AND TECHNICAL CONFERENCE MECHANICAL SCIENCE AND TECHNOLOGY UPDATE (MSTU-2020), 2020, 1546