Deep reinforcement learning with domain randomization for overhead crane control with payload mass variations

被引:0
|
作者
Zhang, Jianfeng [1 ]
Zhao, Chunhui [1 ,2 ]
Ding, Jinliang [3 ]
机构
[1] Zhejiang Univ, Coll Control Sci & Engn, Hangzhou 310027, Peoples R China
[2] Shenzhen Polytech Universtiy, Inst Intelligence Sci & Engn, Shenzhen 518055, Peoples R China
[3] Northeastern Univ, State Key Lab Synthet Automation Proc Ind, Shenyang 110819, Peoples R China
基金
中国国家自然科学基金;
关键词
Overhead cranes; Deep reinforcement learning; Domain randomization; Memory-augmented policy; Payload mass variations; DESIGN;
D O I
10.1016/j.conengprac.2023.105689
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Overhead cranes, as an important tool for loading and transporting, play an important role in modern industry. A key challenge in overhead crane control is payload mass variation: a policy learned to solve the overhead crane control in the fixed payload scenario often fails to solve the control task in the payload variation scenario. Therefore, from a practical perspective, this paper designs a novel deep reinforcement learning (DRL) control algorithm, domain randomization memory-augmented Beta proximal policy optimization (DR-MABPPO), which leverages the memory-augmented policy and incorporates the domain randomization (DR) training strategy to address the control problem of the overhead crane with payload masses variations. With the help of the DR training strategy and the memory-augmented policy, DR-MABPPO can learn a universal policy that is robust to the wide range of payload mass variations. As far as we know, this is the first time that the DRL technique is applied to solve the overhead crane control with payload mass variations. Simulation studies are conducted to demonstrate the effectiveness of the proposed method in the presence of payload mass variations, exhibiting satisfactory control performance when compared to PID and LQR.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Robust Reinforcement Learning For Overhead Crane Control With Variable Load Conditions
    Balazy, Patryk
    Pieprzycki, Kamil
    Knap, Pawel
    2024 25TH INTERNATIONAL CARPATHIAN CONTROL CONFERENCE, ICCC 2024, 2024,
  • [2] Heave reduction of payload through crane control based on deep reinforcement learning using dual offshore cranes
    Bae, Jun-Hyeok
    Cha, Ju-Hwan
    Ha, Sol
    JOURNAL OF COMPUTATIONAL DESIGN AND ENGINEERING, 2023, 10 (01) : 414 - 424
  • [3] Uniaxial attitude control of uncrewed aerial vehicle with thrust vectoring under model variations by deep reinforcement learning and domain randomization
    Atsushi Osedo
    Daichi Wada
    Shinsaku Hisada
    ROBOMECH Journal, 10
  • [4] Uniaxial attitude control of uncrewed aerial vehicle with thrust vectoring under model variations by deep reinforcement learning and domain randomization
    Osedo, Atsushi
    Wada, Daichi
    Hisada, Shinsaku
    ROBOMECH JOURNAL, 2023, 10 (01):
  • [5] Evaluating Domain Randomization in Deep Reinforcement Learning Locomotion Tasks
    Ajani, Oladayo S.
    Hur, Sung-ho
    Mallipeddi, Rammohan
    MATHEMATICS, 2023, 11 (23)
  • [6] A neural network-based input shaping for swing suppression of an overhead crane under payload hoisting and mass variations
    Ramli, Liyana
    Mohamed, Z.
    Jaafar, H. I.
    MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2018, 107 : 484 - 501
  • [7] Automation of crane control for block lifting based on deep reinforcement learning
    Chun, Do-Hyun
    Roh, Myung-Il
    Lee, Hye-Won
    JOURNAL OF COMPUTATIONAL DESIGN AND ENGINEERING, 2022, 9 (04) : 1430 - 1448
  • [8] Learning control of an overhead crane for obstacle avoidance
    Gao, J
    Chen, D
    PROCEEDINGS OF THE 1997 AMERICAN CONTROL CONFERENCE, VOLS 1-6, 1997, : 3103 - 3104
  • [9] Iterative learning control for overhead crane systems
    Bu Xuhui
    Wang Fuzhong
    Li Sanyi
    Yu Fashan
    PROCEEDINGS OF INTERNATIONAL CONFERENCE ON ARTIFICIAL LIFE AND ROBOTICS (ICAROB 2014), 2014, : 182 - 185
  • [10] Efficient swing control of an overhead crane with simultaneous payload hoisting and external disturbances
    Ramli L.
    Mohamed Z.
    Efe M.Ö.
    Lazim I.M.
    Jaafar H.I.
    Mechanical Systems and Signal Processing, 2020, 135