Payload Transporting With Two Quadrotors by Centralized Reinforcement Learning Method

被引:4
|
作者
Lin, Dasheng [1 ]
Han, Jianda [1 ]
Li, Kun [2 ]
Zhang, Jianlei [1 ]
Zhang, Chunyan [1 ]
机构
[1] Nankai Univ, Coll Artificial Intelligence, Tianjin 300071, Peoples R China
[2] Hebei Univ Technol, Sch Civil & Transportat Engn, Tianjin 300401, Peoples R China
基金
中国国家自然科学基金;
关键词
Quadrotors; Payloads; Reinforcement learning; Mathematical models; Angular velocity; Torque; Symbols; Continuous action space; continuous state space; reinforcement learning (RL); twin delayed deep deterministic policy gradient (TD3); two-quadrotor transporting payload system; TRACKING; FLIGHT;
D O I
10.1109/TAES.2023.3321260
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
Nowadays, quadrotors find applications in automation and artificial intelligence. Among diverse quadrotor studies, payload transport stands out, posing implementation challenges. Using multiple quadrotors reduces per-quadrotor load while increasing system complexity. Inspired by model-free reinforcement learning, we apply it to position control in a nonlinear two-quadrotor payload system. Our approach employs a reinforcement learning agent guided by the twin delay deep deterministic policy gradient (TD3) algorithm. Its goal is accurate cable-suspended payload delivery and system stabilization. We test the method's robustness by adding noise. Simulation results show that TD3 excels in ideal conditions and handles noise during training and testing, highlighting its effectiveness. This article's scope can be expanded to encompass scenarios involving three or more quadrotors, providing valuable insights for future endeavors.
引用
收藏
页码:239 / 251
页数:13
相关论文
共 50 条
  • [1] Using Constrained Model Predictive Control to Control Two Quadrotors Transporting a Cable-Suspended Payload
    Alothman, Yaser
    Gu, Dongbing
    2018 13TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2018, : 228 - 233
  • [2] Mixed H2/H∞ control for two quadrotors transporting a cable-suspended payload
    Guo, Minhuan
    Gu, Dongbing
    INTERNATIONAL JOURNAL OF MODELLING IDENTIFICATION AND CONTROL, 2021, 38 (01) : 15 - 31
  • [3] A New Nonlinear Control Strategy Embedded With Reinforcement Learning for a Multirotor Transporting a Suspended Payload
    Hua, Hean
    Fang, Yongchun
    Zhang, Xuetao
    Qian, Chen
    IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2022, 27 (02) : 1174 - 1184
  • [4] Cooperative Transportation of a Flexible Payload Using Two Quadrotors
    Chen, Ti
    Shan, Jinjun
    Liu, Hugh H. T.
    JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 2021, 44 (11) : 2099 - 2107
  • [5] A Velocity Controller for Quadrotors Based on Reinforcement Learning
    Hu, Yu
    Luo, Jie
    Dong, Zhiyan
    Zhang, Lihua
    ARTIFICIAL INTELLIGENCE, CICAI 2023, PT II, 2024, 14474 : 411 - 421
  • [6] Multi-Task Reinforcement Learning for Quadrotors
    Xing, Jiaxu
    Geles, Ismail
    Song, Yunlong
    Aljalbout, Elie
    Scaramuzza, Davide
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2025, 10 (03): : 2112 - 2119
  • [7] Centralized payload control method for planetary rover exploration
    Wang L.
    Zhu Y.
    Ma M.
    Rao J.
    Liang Y.
    Wang W.
    Guofang Keji Daxue Xuebao/Journal of National University of Defense Technology, 2019, 41 (02): : 8 - 16
  • [8] Attitude Synchronization for Multiple Quadrotors using Reinforcement Learning
    Liu, Hao
    Zhao, Wanbing
    Lewis, Frank L.
    Jiang, Zhong-Ping
    Modares, Hamidreza
    PROCEEDINGS OF THE 38TH CHINESE CONTROL CONFERENCE (CCC), 2019, : 2480 - 2483
  • [9] Using Constrained NMPC to Control a Cable-Suspended Payload With Two Quadrotors
    Alothman, Yaser
    Gu, Dongbing
    2018 24TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATION AND COMPUTING (ICAC' 18), 2018, : 417 - 422
  • [10] A centralized reinforcement learning method for multi-agent job scheduling in Grid
    Moradi, Milad
    2016 6TH INTERNATIONAL CONFERENCE ON COMPUTER AND KNOWLEDGE ENGINEERING (ICCKE), 2016, : 171 - 176