Explainable data-driven Q-learning control for a class of discrete-time linear autonomous systems

被引:0
|
作者
Perrusquia, Adolfo [1 ]
Zou, Mengbang [1 ]
Guo, Weisi [1 ]
机构
[1] Cranfield Univ, Sch Aerosp Transport & Mfg, Bedford MK43 0AL, England
关键词
Q-learning; State-transition function; Explainable Q-learning (XQL); Control policy; REINFORCEMENT; IDENTIFICATION;
D O I
10.1016/j.ins.2024.121283
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Explaining what a reinforcement learning (RL) control agent learns play a crucial role in the safety critical control domain. Most of the approaches in the state-of-the-art focused on imitation learning methods that uncover the hidden reward function of a given control policy. However, these approaches do not uncover what the RL agent learns effectively from the agent-environment interaction. The policy learned by the RL agent depends in how good the state transition mapping is inferred from the data. When the state transition mapping is wrongly inferred implies that the RL agent is not learning properly. This can compromise the safety of the surrounding environment and the agent itself. In this paper, we aim to uncover the elements learned by data-driven RL control agents in a special class of discrete-time linear autonomous systems. Here, the approach aims to add a new explainable dimension to data-driven control approaches to increase their trust and safe deployment. We focus on the classical data-driven Q-learning algorithm and propose an explainable Q-learning (XQL) algorithm that can be further expanded to other data-driven RL control agents. Simulation experiments are conducted to observe the effectiveness of the proposed approach under different scenarios using several discrete-time models of autonomous platforms.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] Data-Driven Robust Control of Discrete-Time Uncertain Linear Systems via Off-Policy Reinforcement Learning
    Yang, Yongliang
    Guo, Zhishan
    Xiong, Haoyi
    Ding, Da-Wei
    Yin, Yixin
    Wunsch, Donald C.
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2019, 30 (12) : 3735 - 3747
  • [42] A Novel Data-Driven Filtering Algorithm for a Class of Discrete-Time Nonlinear Systems
    Fan, Lingling
    Hou, Zhongsheng
    Cao, Rongmin
    Ji, Honghai
    PROCEEDINGS OF 2018 IEEE 7TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE (DDCLS), 2018, : 763 - 769
  • [43] Iterative learning control for a class of linear discrete-time switched systems
    Bu, Xu-Hui
    Yu, Fa-Shan
    Hou, Zhong-Sheng
    Wang, Fu-Zhong
    Zidonghua Xuebao/Acta Automatica Sinica, 2013, 39 (09): : 1564 - 1569
  • [44] Stochastic linear quadratic optimal tracking control for discrete-time systems with delays based on Q-learning algorithm
    Tan, Xufeng
    Li, Yuan
    Liu, Yang
    AIMS MATHEMATICS, 2023, 8 (05): : 10249 - 10265
  • [45] Robust Data-Driven Moving Horizon Estimation for Linear Discrete-Time Systems
    Wolff, Tobias M.
    Lopez, Victor G.
    Mueller, Matthias A.
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2024, 69 (08) : 5598 - 5604
  • [46] Data-driven Optimal Preview Output Tracking of Linear Discrete-time Systems
    Liu, Zhou-Yang
    Wu, Huai-Ning
    PROCEEDINGS OF THE 38TH CHINESE CONTROL CONFERENCE (CCC), 2019, : 1973 - 1978
  • [47] Data-driven Finite-horizon Optimal Control for Linear Time-varying Discrete-time Systems
    Pang, Bo
    Bian, Tao
    Jiang, Zhong-Ping
    2018 IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2018, : 861 - 866
  • [48] Data-Driven Model-Free Adaptive Control for a Class of MIMO Nonlinear Discrete-Time Systems
    Hou, Zhongsheng
    Jin, Shangtai
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 2011, 22 (12): : 2173 - 2188
  • [49] Comparisons of Continuous-time and Discrete-time Q-learning Schemes for Adaptive Linear Quadratic Control
    Chun, Tae Yoon
    Lee, Jae Young
    Park, Jin Bae
    Choi, Yoon Ho
    2012 PROCEEDINGS OF SICE ANNUAL CONFERENCE (SICE), 2012, : 1228 - 1233
  • [50] Direct Data-Driven Optimal Set-Point Tracking Control of Linear Discrete-Time Systems
    Xu, Yao
    Zhou, Linna
    Zhao, Jianguo
    Ma, Lei
    Yang, Chunyu
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2024, 71 (08) : 3795 - 3799