Fuzzy learning controller design of 2-DOF flight attitude simulator

被引:0
|
作者
Ren L.-W. [1 ]
Ban X.-J. [1 ]
Wu F. [2 ]
Huang X.-L. [1 ]
机构
[1] Center for Control Theory and Guidance Technology, Harbin Institute of Technology, Harbin
[2] Department of Mechanical and Aerospace Engineering, North Carolina State University, Raleigh
关键词
Aircraft control; Attitude stabilization; Policy iteration algorithm; Polynomial T-S fuzzy system; Reinforcement learning;
D O I
10.15938/j.emc.2019.11.016
中图分类号
学科分类号
摘要
Aiming at the attitude stabilization problem of two-degrees-of-freedom flight attitude simulator, an attitude stabilization controller was designed based on the policy iteration algorithm in the reinforcement learning.The policyiteration learning algorithm and the polynomial T-S fuzzy systems were combined together, conducting parameters' adjustment of the controller, and achievingthe optimization of the attitude stability control performance of the two-degrees-of-freedom flight attitude simulator.By approximating the policy function of the actor and the value function of the critic with the polynomial T-S fuzzy models, the actor-critic structure based on the polynomial T-S fuzzy models was established. Through the policy iteration process, the optimal parameters of the controller were learned to minimize the value function.The simulation results show that the policy iteration algorithm based on polynomial T-S fuzzy models is effective in controlling aircraft attitude stabilization. © 2019, Harbin University of Science and Technology Publication. All right reserved.
引用
收藏
页码:127 / 134
页数:7
相关论文
共 21 条
  • [1] Yin H., Yang Y., Zhao Y., Et al., Self-tuning controller design for a 2-DOF flight attitude simulator, Electric Machines and Control, 22, 4, pp. 109-116, (2018)
  • [2] Sutton R.S., Barto A.G., Reinforcement Learning-An Introduction, (1998)
  • [3] Mendel J.M., Adaptive Learning and Pattern Recognition Systems: Theory and Applications, (1970)
  • [4] Wei Q., Lewis F.L., Liu D., Et al., Discrete-time local value iteration adaptive dynamic programming: Convergence analysis, IEEE Transactions on Systems Man and Cybernetics: Systems, 48, 6, pp. 875-891, (2018)
  • [5] Kiumarsi B., Lewis F.L., Jiang Z.P., H<sub>∞</sub> control of linear discrete-time systems: Off-policy reinforcement learning, Automatica, 78, 1, pp. 144-152, (2017)
  • [6] Lewis F.L., Vrabie D., Reinforcement learning and adaptive dynamic programming for feedback control, IEEE Circuits and Systems Magazine, 9, 3, pp. 32-50, (2009)
  • [7] Zhao D., Liu D., Lewis F.L., Et al., Special issue on deep reinforcement learning and adaptive dynamic programming, IEEE Transactions on Neural Networks and Learning Systems, 29, 6, pp. 2038-2041, (2018)
  • [8] Khateraa, El-Nagar A.M., El-Bardinim, Et al., Adaptive T-S fuzzy controller using reinforcement learning based on Lyapunov stability, Journal of the Franklin Institute, 355, 14, pp. 6390-6415, (2018)
  • [9] Xiong Y., Haibo H., Qinglai W., Et al., Reinforcement learning for robust adaptive control of partially unknown nonlinear systems subject to unmatched uncertainties, Information Sciences, 463, 1, pp. 307-322, (2018)
  • [10] Zhang C., Zou W., Cheng N., Et al., Trajectory tracking control for rotary steerable systems using interval type-2 fuzzy logic and reinforcement learning, Journal of the Franklin Institute, 355, 2, pp. 803-826, (2018)