Control-Tutored Reinforcement Learning: Towards the Integration of Data-Driven and Model-Based Control

被引:0
|
作者
DeLellis, Francesco [1 ]
Coraggio, Marco [2 ]
Russo, Giovanni [3 ]
Musolesi, Mirco [4 ,5 ]
di Bernardo, Mario [1 ,2 ]
机构
[1] Univ Naples Federico II, Naples, Italy
[2] Scuola Super Merid, Naples, Italy
[3] Univ Salerno, Salerno, Italy
[4] UCL, London, England
[5] Univ Bologna, Bologna, Italy
关键词
Reinforcement learning based control; data-driven control; feedback control;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We present an architecture where a feedback controller derived on an approximate model of the environment assists the learning process to enhance its data efficiency. This architecture, which we term as Control-Tutored Q-Learning (CTQL), is presented in two alternative flavours. The former is based on defining the reward function so that a Boolean condition can be used to determine when the control tutor policy is adopted, while the latter, termed as probabilistic CTQL (pCTQL), is instead based on executing calls to the tutor with a certain probability during learning. Both approaches are validated, and thoroughly benchmarked against Q-Learning, by considering the stabilization of an inverted pendulum as defined in OpenAI Gym as a representative problem.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] On the Performance of Data-Driven Reinforcement Learning for Commercial HVAC Control
    Faddel, Samy
    Tian, Guanyu
    Zhou, Qun
    Aburub, Haneen
    2020 IEEE INDUSTRY APPLICATIONS SOCIETY ANNUAL MEETING, 2020,
  • [22] DATA-EFFICIENT MODEL-BASED REINFORCEMENT LEARNING FOR ROBOT CONTROL
    Sun, Ming
    Gao, Yue
    Liu, Wei
    Li, Shaoyuan
    INTERNATIONAL JOURNAL OF ROBOTICS & AUTOMATION, 2021, 36 (04): : 211 - 218
  • [23] Safe Reinforcement Learning using Data-Driven Predictive Control
    Selim, Mahmoud
    Alanwar, Amr
    El-Kharashi, M. Watheq
    Abbas, Hazem M.
    Johansson, Karl H.
    2022 5TH INTERNATIONAL CONFERENCE ON COMMUNICATIONS, SIGNAL PROCESSING, AND THEIR APPLICATIONS (ICCSPA), 2022,
  • [24] Enhancing Model-Based Traffic Signal Control with Data-Driven Adaptive Optimization
    Zhang, Xuanyu
    Hu, Fuyu
    Huang, Wei
    CICTP 2022: INTELLIGENT, GREEN, AND CONNECTED TRANSPORTATION, 2022, : 346 - 356
  • [25] Hybrid Optimal Traffic Control: Combining Model-Based and Data-Driven Approaches
    Baumgart, Urs
    Burger, Michael
    PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON VEHICLE TECHNOLOGY AND INTELLIGENT TRANSPORT SYSTEMS, VEHITS 2023, 2023, : 85 - 94
  • [26] Model-Based Evaluation of a Data-Driven Control Strategy: Application to Ibuprofen Crystallization
    Montes, Frederico C. C.
    Oner, Merve
    Gernaey, Krist, V
    Sin, Gurkan
    PROCESSES, 2021, 9 (04)
  • [27] Data-driven Characterization of Human Interaction for Model-based Control of Powered Prostheses
    Gehlhar, Rachel
    Chen, Yuxiao
    Ames, Aaron D.
    2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 4126 - 4133
  • [28] Model-Based and Data-Driven HVAC Control Strategies for Residential Demand Response
    Kou, Xiao
    Du, Yan
    Li, Fangxing
    Pulgar-Painemal, Hector
    Zandi, Helia
    Dong, Jin
    Olama, Mohammed M.
    IEEE OPEN ACCESS JOURNAL OF POWER AND ENERGY, 2021, 8 : 186 - 197
  • [29] Data-driven adaptive model-based predictive control with application in wastewater systems
    Wahab, N. A.
    Katebi, R.
    Balderud, J.
    Rahmat, M. F.
    IET CONTROL THEORY AND APPLICATIONS, 2011, 5 (06): : 803 - 812
  • [30] Data-driven and Model-based Hybrid Reinforcement Learning to Reduce Stress on Power Systems Branches
    Kamel, Mariana
    Dai, Renchang
    Wang, Yawei
    Li, Fangxing
    Liu, Guangyi
    CSEE JOURNAL OF POWER AND ENERGY SYSTEMS, 2021, 7 (03): : 433 - 442