Payload Transporting With Two Quadrotors by Centralized Reinforcement Learning Method

被引：4

作者：

Lin, Dasheng ^{[1
]}

Han, Jianda ^{[1
]}

Li, Kun ^{[2
]}

Zhang, Jianlei ^{[1
]}

Zhang, Chunyan ^{[1
]}

机构：

[1] Nankai Univ, Coll Artificial Intelligence, Tianjin 300071, Peoples R China

[2] Hebei Univ Technol, Sch Civil & Transportat Engn, Tianjin 300401, Peoples R China

来源：

IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS | 2024年 / 60卷 / 01期

基金：

中国国家自然科学基金;

关键词：

Quadrotors; Payloads; Reinforcement learning; Mathematical models; Angular velocity; Torque; Symbols; Continuous action space; continuous state space; reinforcement learning (RL); twin delayed deep deterministic policy gradient (TD3); two-quadrotor transporting payload system; TRACKING; FLIGHT;

D O I：

10.1109/TAES.2023.3321260

中图分类号：

V [航空、航天];

学科分类号：

08 ; 0825 ;

摘要：

Nowadays, quadrotors find applications in automation and artificial intelligence. Among diverse quadrotor studies, payload transport stands out, posing implementation challenges. Using multiple quadrotors reduces per-quadrotor load while increasing system complexity. Inspired by model-free reinforcement learning, we apply it to position control in a nonlinear two-quadrotor payload system. Our approach employs a reinforcement learning agent guided by the twin delay deep deterministic policy gradient (TD3) algorithm. Its goal is accurate cable-suspended payload delivery and system stabilization. We test the method's robustness by adding noise. Simulation results show that TD3 excels in ideal conditions and handles noise during training and testing, highlighting its effectiveness. This article's scope can be expanded to encompass scenarios involving three or more quadrotors, providing valuable insights for future endeavors.

引用

页码：239 / 251

页数：13

共 50 条

[1] Using Constrained Model Predictive Control to Control Two Quadrotors Transporting a Cable-Suspended Payload
Alothman, Yaser
Gu, Dongbing
2018 13TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2018, : 228 - 233
[2] Mixed H2/H∞ control for two quadrotors transporting a cable-suspended payload
Guo, Minhuan
Gu, Dongbing
INTERNATIONAL JOURNAL OF MODELLING IDENTIFICATION AND CONTROL, 2021, 38 (01) : 15 - 31
[3] A New Nonlinear Control Strategy Embedded With Reinforcement Learning for a Multirotor Transporting a Suspended Payload
Hua, Hean
Fang, Yongchun
Zhang, Xuetao
Qian, Chen
IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2022, 27 (02) : 1174 - 1184
[4] Cooperative Transportation of a Flexible Payload Using Two Quadrotors
Chen, Ti
Shan, Jinjun
Liu, Hugh H. T.
JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 2021, 44 (11) : 2099 - 2107
[5] A Velocity Controller for Quadrotors Based on Reinforcement Learning
Hu, Yu
Luo, Jie
Dong, Zhiyan
Zhang, Lihua
ARTIFICIAL INTELLIGENCE, CICAI 2023, PT II, 2024, 14474 : 411 - 421
[6] Multi-Task Reinforcement Learning for Quadrotors
Xing, Jiaxu
Geles, Ismail
Song, Yunlong
Aljalbout, Elie
Scaramuzza, Davide
IEEE ROBOTICS AND AUTOMATION LETTERS, 2025, 10 (03): : 2112 - 2119
[7] Centralized payload control method for planetary rover exploration
Wang L.
Zhu Y.
Ma M.
Rao J.
Liang Y.
Wang W.
Guofang Keji Daxue Xuebao/Journal of National University of Defense Technology, 2019, 41 (02): : 8 - 16
[8] Attitude Synchronization for Multiple Quadrotors using Reinforcement Learning
Liu, Hao
Zhao, Wanbing
Lewis, Frank L.
Jiang, Zhong-Ping
Modares, Hamidreza
PROCEEDINGS OF THE 38TH CHINESE CONTROL CONFERENCE (CCC), 2019, : 2480 - 2483
[9] Using Constrained NMPC to Control a Cable-Suspended Payload With Two Quadrotors
Alothman, Yaser
Gu, Dongbing
2018 24TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATION AND COMPUTING (ICAC' 18), 2018, : 417 - 422
[10] A centralized reinforcement learning method for multi-agent job scheduling in Grid
Moradi, Milad
2016 6TH INTERNATIONAL CONFERENCE ON COMPUTER AND KNOWLEDGE ENGINEERING (ICCKE), 2016, : 171 - 176

← 1 2 3 4 5 →