QUADCOPTER CONTROL USING SINGLE NETWORK ADAPTIVE CRITICS

被引:0
|
作者
Velazquez, Alberto [1 ]
Xu, Lei [2 ]
Sardarmehni, Tohid [3 ]
机构
[1] Univ Texas Rio Grande Valley, Edinburg, TX USA
[2] Kent State Univ, Kent, OH 44242 USA
[3] Calif State Univ Northridge, Northridge, CA 91330 USA
基金
美国国家科学基金会;
关键词
Quadcopter; Optimal Control; Adaptive Dynamic Programming; Reinforcement Learning;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, optimal tracking control is found for an input-affine nonlinear quadcopter using Single Network Adaptive Critics (SNAC). The quadcopter dynamics consists of twelve states and four controls. The states are defined using two related reference frames: the earth frame, which describes the position and angles, and the body frame, which describes the linear and angular velocities. The quadcopter has six outputs and four controls, so it is an underactuated nonlinear system. The optimal control for the system is derived by solving a discrete-time recursive Hamilton-Jacobi-Bellman equation using a linear in-parameter neural network. The neural network is trained to find a mapping between a target costate vector and the current states. The network's weights are iteratively trained using the least-squares approximation method until the maximum number of iterations or convergence is reached, and training begins at the final time and proceeds backward to the initial time. The trained neural controller applies online optimal feedback control that tracks a trajectory, minimizes control effort, and satisfies the optimality condition. The SNAC method provides a controller that can handle all initial conditions within the domain of training and all times less than the training's final time.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] Optimal control synthesis of a class of nonlinear systems using single network adaptive critics
    Padhi, R
    Unnikrishnan, N
    Balakrishnan, SN
    PROCEEDINGS OF THE 2004 AMERICAN CONTROL CONFERENCE, VOLS 1-6, 2004, : 1592 - 1597
  • [2] Decentralized Control of Nonlinear Multi-Agent Systems Using Single Network Adaptive Critics
    Heydari, Ali
    Balakrishnan, S. N.
    2012 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2012,
  • [3] Finite-Horizon Control-Constrained Nonlinear Optimal Control Using Single Network Adaptive Critics
    Heydari, Ali
    Balakrishnan, Sivasubramanya N.
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2013, 24 (01) : 145 - 157
  • [4] Optimal Blood Glucose Regulation using Single Network Adaptive Critics
    Ali, Sk Faruque
    Padhi, Radhakant
    2009 IEEE CONTROL APPLICATIONS CCA & INTELLIGENT CONTROL (ISIC), VOLS 1-3, 2009, : 89 - 94
  • [5] Finite-Horizon Input-Constrained Nonlinear Optimal Control Using Single Network Adaptive Critics
    Heydari, Ali
    Balakrishnan, S. N.
    2011 AMERICAN CONTROL CONFERENCE, 2011, : 3047 - 3052
  • [6] Finite-horizon input-constrained nonlinear optimal control using single network adaptive critics
    Heydari, Ali
    Balakrishnan, S.N.
    Proceedings of the American Control Conference, 2011, : 3047 - 3052
  • [7] Direct adaptive control using single network adaptive critic
    Kumar, Swagat
    Padhi, Radhakant
    Behera, Laxmidhar
    2007 IEEE INTERNATIONAL CONFERENCE ON SYSTEM OF SYSTEMS ENGINEERING, VOLS 1 AND 2, 2007, : 396 - +
  • [8] Optimal blood glucose regulation of diabetic patients using single network adaptive critics
    Ali, S. K. Faruque
    Padhi, Radhakant
    OPTIMAL CONTROL APPLICATIONS & METHODS, 2011, 32 (02): : 196 - 214
  • [9] Adaptive Fault Tolerant Control of Quadcopter by Using Minimum Projection Method
    Tabata, Anan
    Satoh, Yasuyuki
    Nakamura, Hisakazu
    Kato, Kiyotaka
    IECON 2018 - 44TH ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY, 2018, : 2201 - 2206
  • [10] Designing and modeling of quadcopter control system using L1 adaptive control
    Thu, Kyaw Myat
    Gavrilov, A. I.
    XII INTERNATIONAL SYMPOSIUM INTELLIGENT SYSTEMS 2016, (INTELS 2016), 2017, 103 : 528 - 535