A Novel Exploration-Exploitation-Based Adaptive Law for Intelligent Model-Free Control Approaches

被引:44
|
作者
Tutsoy, Onder [1 ]
Barkana, Duygun Erol [2 ]
Balikci, Kemal [3 ]
机构
[1] Adana Alparslan Turkes Sci & Technol Univ, Elect & Elect Engn Dept, TR-411000 Adana, Turkey
[2] Yeditepe Univ, Dept Elect & Elect Engn, TR-34755 Istanbul, Turkey
[3] Osmaniye Korkut Ata Univ, Dept Elect & Elect Engn, TR-80000 Osmaniye, Turkey
关键词
Adaptation models; Process control; Predictive models; Real-time systems; Noise measurement; Analytical models; Manipulator dynamics; Adaptive law; exploitation; exploration; intelligent control; Lyapunov stability; model free; uncertainty; vanishing gradient; PREDICTIVE CONTROL; FINITE CONTROL;
D O I
10.1109/TCYB.2021.3091680
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Model-free control approaches require advanced exploration-exploitation policies to achieve practical tasks such as learning to bipedal robot walk in unstructured environments. In this article, we first construct a comprehensive exploration-exploitation policy that carries quality knowledge about the long-term predictor and the control policy, and the control signal of the model-free algorithms. Therefore, the developed model-free algorithm continues exploration by adjusting its unknown parameters until the desired learning and control are accomplished. Second, we provide an utterly model-free adaptive law enriched with the exploration-exploitation policy and derived step-by-step using the exact analogy of the model-based solution. The obtained adaptive control law considers the control signal saturation and the control signal (input) delay. Performed Lyapunov stability analysis ensures the convergence of the adaptive law that can also be used for intelligent control approaches. Third, we implement the adaptive algorithm in real time on a challenging benchmark system: a fourth-order, coupled dynamics, input saturated, and time-delayed underactuated manipulator. The results show that the proposed adaptive algorithm explores larger state-action spaces and treats the vanishing gradient problem in both learning and control. Also, we notice from the results that the learning and control properties of the adaptive algorithm are optimized as required.
引用
收藏
页码:329 / 337
页数:9
相关论文
共 50 条
  • [1] An intelligent control system for building aquaculture based on model-free adaptive control
    Qin, Haohua
    Dong, Yuhua
    Guo, Lantian
    Man, Luxin
    Chai, Sheng
    FERROELECTRICS, 2023, 609 (01) : 87 - 96
  • [2] Model-free Optimization: The Exploration-Exploitation Paradigm
    Raphel, Mariya
    Gunjal, Revati
    Wagh, S. R.
    Singh, N. M.
    2022 EIGHTH INDIAN CONTROL CONFERENCE, ICC, 2022, : 422 - 427
  • [3] Exploration-exploitation-based trajectory tracking of mobile robots using Gaussian processes and model predictive control
    Eschmann, Hannes
    Ebel, Henrik
    Eberhard, Peter
    ROBOTICA, 2023, 41 (10) : 3040 - 3058
  • [4] Motion Control of Manipulators Based on Model-free Adaptive Control
    Zhang, Ziqiao
    Dian, Songyi
    3RD INTERNATIONAL CONFERENCE ON AUTOMATION, CONTROL AND ROBOTICS ENGINEERING (CACRE 2018), 2018, 428
  • [5] Model-free adaptive (MFA) control
    Cheng, GS
    COMPUTING & CONTROL ENGINEERING JOURNAL, 2004, 15 (03): : 28 - 33
  • [6] Model-free adaptive sliding mode control for intelligent vehicle longitudinal dynamics
    Feng Zhang-qi
    Jiang Hao-bin
    Wei Qi-zhi
    Hong Yang-ke
    Oluwaleke, Ojo Abiodun
    ADVANCES IN MECHANICAL ENGINEERING, 2022, 14 (07)
  • [7] Model-free adaptive control instrument
    不详
    HYDROCARBON PROCESSING, 1999, 78 (10): : 37 - 37
  • [8] 'Model-free' adaptive control technology
    不详
    HYDROCARBON PROCESSING, 1998, 77 (02): : 34 - +
  • [9] Model-free adaptive control of an evaporator
    Major, Michael
    Control (Chicago, Ill), 1998, 11 (09):
  • [10] A novel ADP based model-free predictive control
    Dong, Na
    Chen, Zengqiang
    NONLINEAR DYNAMICS, 2012, 69 (1-2) : 89 - 97