Safe reinforcement learning-based control using deep deterministic policy gradient algorithm and slime mould algorithm with experimental tower crane system validation

被引:2
|
作者
Zamfirache, Iuliu Alexandru [1 ]
Precup, Radu-Emil [1 ,2 ]
Petriu, Emil M. [3 ]
机构
[1] Politehn Univ Timisoara, Dept Automat & Appl Informat, Bd V Parvan 2, Timisoara 300223, Romania
[2] Romanian Acad, Ctr Fundamental & Adv Tech Res, Timisoara Branch, Bd Mihai Viteazu 24, Timisoara 300223, Romania
[3] Univ Ottawa, Sch Elect Engn & Comp Sci, 800 King Edward, Ottawa, ON K1N 6N5, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Deep deterministic policy gradient; Optimal reference tracking control; Safe reinforcement learning; Slime mould algorithm; Tower crane systems;
D O I
10.1016/j.ins.2024.121640
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a novel optimal control approach resulting from the combination between the safe Reinforcement Learning (RL) framework represented by a Deep Deterministic Policy Gradient (DDPG) algorithm and a Slime Mould Algorithm (SMA) as a representative nature-inspired optimization algorithm. The main drawbacks of the traditional DDPG-based safe RL optimal control approach are the possible instability of the control system caused by randomly generated initial values of the controller parameters and the lack of state safety guarantees in the first iterations of the learning process due to (i) and (ii): (i) the safety constraints are considered only in the DDPG-based training process of the controller, which is usually implemented as a neural network (NN); (ii) the initial values of the weights and the biases of the NN-based controller are initialized with randomly generated values. The proposed approach mitigates these drawbacks by initializing the parameters of the NN-based controller using SMA. The fitness function of the SMA-based initialization process is designed to incorporate state safety constraints into the search process, resulting in an initial NN-based controller with embedded state safety constraints. The proposed approach is compared to the classical one using real-time experimental results and performance indices popular for optimal reference tracking control problems and based on a state safety score.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] Reinforcement Learning Control with Deep Deterministic Policy Gradient Algorithm for Multivariable pH Process
    Panjapornpon, Chanin
    Chinchalongporn, Patcharapol
    Bardeeniz, Santi
    Makkayatorn, Ratthanita
    Wongpunnawat, Witchaya
    PROCESSES, 2022, 10 (12)
  • [2] Deep Ensemble Reinforcement Learning with Multiple Deep Deterministic Policy Gradient Algorithm
    Wu, Junta
    Li, Huiyun
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2020, 2020 (2020)
  • [3] Reinforcement learning control method for real-time hybrid simulation based on deep deterministic policy gradient algorithm
    Li, Ning
    Tang, Jichuan
    Li, Zhong-Xian
    Gao, Xiuyu
    Structural Control and Health Monitoring, 2022, 29 (10)
  • [4] Reinforcement learning control method for real-time hybrid simulation based on deep deterministic policy gradient algorithm
    Li, Ning
    Tang, Jichuan
    Li, Zhong-Xian
    Gao, Xiuyu
    STRUCTURAL CONTROL & HEALTH MONITORING, 2022, 29 (10):
  • [5] Robot grasping method optimization using improved deep deterministic policy gradient algorithm of deep reinforcement learning
    Zhang, Hongxu
    Wang, Fei
    Wang, Jianhui
    Cui, Ben
    REVIEW OF SCIENTIFIC INSTRUMENTS, 2021, 92 (02):
  • [6] Reinforcement Learning-based control using Q-learning and gravitational search algorithm with experimental validation on a nonlinear servo system
    Zamfirache, Iuliu Alexandru
    Precup, Radu-Emil
    Roman, Raul-Cristian
    Petriu, Emil M.
    INFORMATION SCIENCES, 2022, 583 : 99 - 120
  • [7] Control Method for PEMFC Using Improved Deep Deterministic Policy Gradient Algorithm
    Li, Jiawen
    Li, Yaping
    Yu, Tao
    FRONTIERS IN ENERGY RESEARCH, 2021, 9
  • [8] Intelligent Thermal Control Algorithm Based on Deep Deterministic Policy Gradient for Spacecraft
    Xiong, Yan
    Guo, Liang
    Wang, Hongliang
    Huang, Yong
    Liu, Chunlong
    JOURNAL OF THERMOPHYSICS AND HEAT TRANSFER, 2020, 34 (04) : 683 - 695
  • [9] Deep Deterministic Policy Gradient to Regulate Feedback Control Systems Using Reinforcement Learning
    Arshad, Jehangir
    Khan, Ayesha
    Aftab, Mariam
    Hussain, Mujtaba
    Rehman, Ateeq Ur
    Ahmad, Shafiq
    Al-Shayea, Adel M.
    Shafiq, Muhammad
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 71 (01): : 1153 - 1169
  • [10] Improvement of PMSM Control Using Reinforcement Learning Deep Deterministic Policy Gradient Agent
    Nicola, Marcel
    Nicola, Claudiu-Ionel
    2021 21ST INTERNATIONAL SYMPOSIUM ON POWER ELECTRONICS (EE 2021), 2021,