Deep reinforcement learning with domain randomization for overhead crane control with payload mass variations

被引:0
|
作者
Zhang, Jianfeng [1 ]
Zhao, Chunhui [1 ,2 ]
Ding, Jinliang [3 ]
机构
[1] Zhejiang Univ, Coll Control Sci & Engn, Hangzhou 310027, Peoples R China
[2] Shenzhen Polytech Universtiy, Inst Intelligence Sci & Engn, Shenzhen 518055, Peoples R China
[3] Northeastern Univ, State Key Lab Synthet Automation Proc Ind, Shenyang 110819, Peoples R China
基金
中国国家自然科学基金;
关键词
Overhead cranes; Deep reinforcement learning; Domain randomization; Memory-augmented policy; Payload mass variations; DESIGN;
D O I
10.1016/j.conengprac.2023.105689
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Overhead cranes, as an important tool for loading and transporting, play an important role in modern industry. A key challenge in overhead crane control is payload mass variation: a policy learned to solve the overhead crane control in the fixed payload scenario often fails to solve the control task in the payload variation scenario. Therefore, from a practical perspective, this paper designs a novel deep reinforcement learning (DRL) control algorithm, domain randomization memory-augmented Beta proximal policy optimization (DR-MABPPO), which leverages the memory-augmented policy and incorporates the domain randomization (DR) training strategy to address the control problem of the overhead crane with payload masses variations. With the help of the DR training strategy and the memory-augmented policy, DR-MABPPO can learn a universal policy that is robust to the wide range of payload mass variations. As far as we know, this is the first time that the DRL technique is applied to solve the overhead crane control with payload mass variations. Simulation studies are conducted to demonstrate the effectiveness of the proposed method in the presence of payload mass variations, exhibiting satisfactory control performance when compared to PID and LQR.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Adaptive fuzzy tracking control for vibration suppression of tower crane with distributed payload mass
    Sun, Zheng
    Ouyang, Huimin
    AUTOMATION IN CONSTRUCTION, 2022, 142
  • [32] Comparison of Different Domain Randomization Methods for Policy Transfer in Reinforcement Learning
    Ma, Mingjun
    Li, Haoran
    Hu, Guangzheng
    Liu, Shasha
    Zhao, Dongbin
    2023 IEEE 12TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE, DDCLS, 2023, : 1818 - 1823
  • [33] A Deep Reinforcement Learning Motion Control Strategy of a Multi-rotor UAV for Payload Transportation with Minimum Swing
    Panetsos, Fotis
    Karras, George C.
    Kyriakopoulos, Kostas J.
    2022 30TH MEDITERRANEAN CONFERENCE ON CONTROL AND AUTOMATION (MED), 2022, : 368 - 374
  • [34] Deep Reinforcement Q-Learning for Intelligent Traffic Control in Mass Transit
    Khozam, Shurok
    Farhi, Nadir
    SUSTAINABILITY, 2023, 15 (14)
  • [35] Deep Reinforcement Learning for Formation Control
    Aykin, Can
    Knopp, Martin
    Dieopold, Klaus
    2018 27TH IEEE INTERNATIONAL SYMPOSIUM ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION (IEEE RO-MAN 2018), 2018, : 1124 - 1128
  • [36] Deep Reinforcement Learning for Contagion Control
    Benalcazar, Diego R.
    Enyioha, Chinwendu
    5TH IEEE CONFERENCE ON CONTROL TECHNOLOGY AND APPLICATIONS (IEEE CCTA 2021), 2021, : 162 - 167
  • [37] Online reinforcement learning with passivity-based stabilizing term for real time overhead crane control without knowledge of the system model
    Zhang, Haoran
    Zhao, Chunhui
    Ding, Jinliang
    CONTROL ENGINEERING PRACTICE, 2022, 127
  • [38] Document Domain Randomization for Deep Learning Document Layout Extraction
    Ling, Meng
    Chen, Jian
    Moeller, Torsten
    Isenberg, Petra
    Isenberg, Tobias
    Sedlmair, Michael
    Laramee, Robert S.
    Shen, Han-Wei
    Wu, Jian
    Giles, C. Lee
    DOCUMENT ANALYSIS AND RECOGNITION - ICDAR 2021, PT I, 2021, 12821 : 497 - 513
  • [39] A PID-SMC control method with payload anti -s ing for 3D overhead crane systems
    Wang, Shourui
    Jim, Wuyin
    2022 34TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2022, : 4583 - 4588
  • [40] Efficient control of a 3D overhead crane with simultaneous payload hoisting and wind disturbance: design, simulation and experiment
    Abdullahi, A. M.
    Mohamed, Z.
    Selamat, H.
    Pota, H. R.
    Abidin, M. S. Zainal
    Fasih, S. M.
    MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2020, 145