Control-Tutored Reinforcement Learning: Towards the Integration of Data-Driven and Model-Based Control

被引:0
|
作者
DeLellis, Francesco [1 ]
Coraggio, Marco [2 ]
Russo, Giovanni [3 ]
Musolesi, Mirco [4 ,5 ]
di Bernardo, Mario [1 ,2 ]
机构
[1] Univ Naples Federico II, Naples, Italy
[2] Scuola Super Merid, Naples, Italy
[3] Univ Salerno, Salerno, Italy
[4] UCL, London, England
[5] Univ Bologna, Bologna, Italy
来源
LEARNING FOR DYNAMICS AND CONTROL CONFERENCE, VOL 168 | 2022年 / 168卷
关键词
Reinforcement learning based control; data-driven control; feedback control;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We present an architecture where a feedback controller derived on an approximate model of the environment assists the learning process to enhance its data efficiency. This architecture, which we term as Control-Tutored Q-Learning (CTQL), is presented in two alternative flavours. The former is based on defining the reward function so that a Boolean condition can be used to determine when the control tutor policy is adopted, while the latter, termed as probabilistic CTQL (pCTQL), is instead based on executing calls to the tutor with a certain probability during learning. Both approaches are validated, and thoroughly benchmarked against Q-Learning, by considering the stabilization of an inverted pendulum as defined in OpenAI Gym as a representative problem.
引用
收藏
页数:12
相关论文
共 50 条
  • [11] DATA-DRIVEN MODEL-FREE ITERATIVE LEARNING CONTROL USING REINFORCEMENT LEARNING
    Song, Bing
    Phan, Minh Q.
    Longman, Richard W.
    ASTRODYNAMICS 2018, PTS I-IV, 2019, 167 : 2579 - 2597
  • [12] Model-free Data-driven Predictive Control Using Reinforcement Learning
    Sawant, Shambhuraj
    Reinhardt, Dirk
    Kordabad, Arash Bahari
    Gros, Sebastien
    2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 4046 - 4052
  • [13] Synthesis of model predictive control based on data-driven learning
    Zhou, Yuanqiang
    Li, Dewei
    Xi, Yugeng
    Gan, Zhongxue
    SCIENCE CHINA-INFORMATION SCIENCES, 2020, 63 (08)
  • [14] Synthesis of model predictive control based on data-driven learning
    Yuanqiang Zhou
    Dewei Li
    Yugeng Xi
    Zhongxue Gan
    Science China Information Sciences, 2020, 63
  • [15] Model-based and data-driven model-reference control: a comparative analysis
    Formentin, Simone
    van Heusden, Klaske
    Karimi, Alireza
    2013 EUROPEAN CONTROL CONFERENCE (ECC), 2013, : 1410 - 1415
  • [16] Synthesis of model predictive control based on data-driven learning
    Yuanqiang ZHOU
    Dewei LI
    Yugeng XI
    Zhongxue GAN
    Science China(Information Sciences), 2020, 63 (08) : 251 - 253
  • [17] Underactuated MIMO Airship Control Based on Online Data-Driven Reinforcement Learning
    Boase, Derek
    Gueaieb, Wail
    Miah, Md Suruz
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 9464 - 9471
  • [18] Reinforcement Learning based Data-driven Optimal Control Strategy for Systems with Disturbance
    Fan, Zhong-Xin
    Li, Shihua
    Liu, Rongjie
    2023 IEEE 12TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE, DDCLS, 2023, : 567 - 572
  • [19] Model-Based Reinforcement Learning For Robot Control
    Li, Xiang
    Shang, Weiwei
    Cong, Shuang
    2020 5TH INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND MECHATRONICS (ICARM 2020), 2020, : 300 - 305
  • [20] Control Approach Combining Reinforcement Learning and Model-Based Control
    Okawa, Yoshihiro
    Sasaki, Tomotake
    Iwane, Hidenao
    2019 12TH ASIAN CONTROL CONFERENCE (ASCC), 2019, : 1419 - 1424