Wind turbine pitch reinforcement learning control improved by PID regulator and learning observer

被引:39
|
作者
Enrique Sierra-Garcia, J. [1 ]
Santos, Matilde [2 ]
Pandit, Ravi [3 ]
机构
[1] Univ Burgos, Electromech Engn Dept, Burgos 09006, Spain
[2] Univ Complutense Madrid, Inst Knowledge Technol, Madrid 28040, Spain
[3] Anglia Ruskin Univ, Fac Sci & Engn, Chelmsford, Essex, England
关键词
Intelligent control; Reinforcement learning; Learning observer; Pitch control; Wind turbines; DESIGN; SYSTEM;
D O I
10.1016/j.engappai.2022.104769
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Wind turbine (WT) pitch control is a challenging issue due to the non-linearities of the wind device and its complex dynamics, the coupling of the variables and the uncertainty of the environment. Reinforcement learning (RL) based control arises as a promising technique to address these problems. However, its applicability is still limited due to the slowness of the learning process. To help alleviate this drawback, in this work we present a hybrid RL-based control that combines a RL-based controller with a proportional-integral-derivative (PID) regulator, and a learning observer. The PID is beneficial during the first training episodes as the RL based control does not have any experience to learn from. The learning observer oversees the learning process by adjusting the exploration rate and the exploration window in order to reduce the oscillations during the training and improve convergence. Simulation experiments on a small real WT show how the learning significantly improves with this control architecture, speeding up the learning convergence up to 37%, and increasing the efficiency of the intelligent control strategy. The best hybrid controller reduces the error of the output power by around 41% regarding a PID regulator. Moreover, the proposed intelligent hybrid control configuration has proved more efficient than a fuzzy controller and a neuro-control strategy.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Combination of fuzzy control and reinforcement learning for wind turbine pitch control
    Sierra-Garcia, J. Enrique
    Santos, Matilde
    LOGIC JOURNAL OF THE IGPL, 2024,
  • [2] Exploring Reward Strategies for Wind Turbine Pitch Control by Reinforcement Learning
    Sierra-Garcia, Jesus Enrique
    Santos, Matilde
    APPLIED SCIENCES-BASEL, 2020, 10 (21): : 1 - 23
  • [3] Combination of Neural Networks and Reinforcement Learning for Wind Turbine Pitch Control
    Sierra-Garcia, Jesus Enrique
    Santos, Matilde
    HYBRID ARTIFICIAL INTELLIGENT SYSTEMS, HAIS 2022, 2022, 13469 : 385 - 392
  • [4] Adaptive PID Controller based on Reinforcement Learning for Wind Turbine Control
    Sedighizadeh, M.
    Rezazadeh, A.
    PROCEEDINGS OF WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY, VOL 27, 2008, 27 : 257 - 262
  • [5] Spatial Iterative Learning Control for Pitch of Wind Turbine
    Liu, Yan
    Ruan, Xiaoe
    PROCEEDINGS OF 2018 IEEE 7TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE (DDCLS), 2018, : 841 - 846
  • [6] Fuzzy-based collective pitch control for wind turbine via deep reinforcement learning
    Nabeel, Abdelhamid
    Lasheen, Ahmed
    Elshafei, Abdel Latif
    Zahab, Essam Aboul
    ISA TRANSACTIONS, 2024, 148 : 307 - 325
  • [7] Intelligent Control of a Wind Turbine based on Reinforcement Learning
    Tomin, Nikita
    Kurbatsky, Victor
    Guliyev, Huseyngulu
    2019 16TH CONFERENCE ON ELECTRICAL MACHINES, DRIVES AND POWER SYSTEMS (ELMA), 2019,
  • [8] Neural networks and reinforcement learning in wind turbine control
    Sierra-Garcia, J. E.
    Santos, M.
    REVISTA IBEROAMERICANA DE AUTOMATICA E INFORMATICA INDUSTRIAL, 2021, 18 (04): : 327 - 335
  • [9] The Application of Fuzzy PID Control in Pitch Wind Turbine
    Qi, Yishuang
    Meng, Qingjin
    2012 INTERNATIONAL CONFERENCE ON FUTURE ENERGY, ENVIRONMENT, AND MATERIALS, PT C, 2012, 16 : 1635 - 1641
  • [10] Control of Pitch Angle of Wind Turbine by Fuzzy Pid Controller
    Civelek, Zafer
    Luy, Murat
    Cam, Ertugrul
    Barisci, Necaattin
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2016, 22 (03): : 463 - 471