Control-Tutored Reinforcement Learning: Towards the Integration of Data-Driven and Model-Based Control

被引：0

作者：

DeLellis, Francesco ^{[1
]}

Coraggio, Marco ^{[2
]}

Russo, Giovanni ^{[3
]}

Musolesi, Mirco ^{[4
,5
]}

di Bernardo, Mario ^{[1
,2
]}

机构：

[1] Univ Naples Federico II, Naples, Italy

[2] Scuola Super Merid, Naples, Italy

[3] Univ Salerno, Salerno, Italy

[4] UCL, London, England

[5] Univ Bologna, Bologna, Italy

来源：

LEARNING FOR DYNAMICS AND CONTROL CONFERENCE, VOL 168 | 2022年 / 168卷

关键词：

Reinforcement learning based control; data-driven control; feedback control;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

We present an architecture where a feedback controller derived on an approximate model of the environment assists the learning process to enhance its data efficiency. This architecture, which we term as Control-Tutored Q-Learning (CTQL), is presented in two alternative flavours. The former is based on defining the reward function so that a Boolean condition can be used to determine when the control tutor policy is adopted, while the latter, termed as probabilistic CTQL (pCTQL), is instead based on executing calls to the tutor with a certain probability during learning. Both approaches are validated, and thoroughly benchmarked against Q-Learning, by considering the stabilization of an inverted pendulum as defined in OpenAI Gym as a representative problem.

引用

页数：12

共 50 条

[11] DATA-DRIVEN MODEL-FREE ITERATIVE LEARNING CONTROL USING REINFORCEMENT LEARNING
Song, Bing
Phan, Minh Q.
Longman, Richard W.
ASTRODYNAMICS 2018, PTS I-IV, 2019, 167 : 2579 - 2597
[12] Model-free Data-driven Predictive Control Using Reinforcement Learning
Sawant, Shambhuraj
Reinhardt, Dirk
Kordabad, Arash Bahari
Gros, Sebastien
2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 4046 - 4052
[13] Synthesis of model predictive control based on data-driven learning
Zhou, Yuanqiang
Li, Dewei
Xi, Yugeng
Gan, Zhongxue
SCIENCE CHINA-INFORMATION SCIENCES, 2020, 63 (08)
[14] Synthesis of model predictive control based on data-driven learning
Yuanqiang Zhou
Dewei Li
Yugeng Xi
Zhongxue Gan
Science China Information Sciences, 2020, 63
[15] Model-based and data-driven model-reference control: a comparative analysis
Formentin, Simone
van Heusden, Klaske
Karimi, Alireza
2013 EUROPEAN CONTROL CONFERENCE (ECC), 2013, : 1410 - 1415
[16] Synthesis of model predictive control based on data-driven learning
Yuanqiang ZHOU
Dewei LI
Yugeng XI
Zhongxue GAN
Science China(Information Sciences), 2020, 63 (08) : 251 - 253
[17] Underactuated MIMO Airship Control Based on Online Data-Driven Reinforcement Learning
Boase, Derek
Gueaieb, Wail
Miah, Md Suruz
2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 9464 - 9471
[18] Reinforcement Learning based Data-driven Optimal Control Strategy for Systems with Disturbance
Fan, Zhong-Xin
Li, Shihua
Liu, Rongjie
2023 IEEE 12TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE, DDCLS, 2023, : 567 - 572
[19] Model-Based Reinforcement Learning For Robot Control
Li, Xiang
Shang, Weiwei
Cong, Shuang
2020 5TH INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND MECHATRONICS (ICARM 2020), 2020, : 300 - 305
[20] Control Approach Combining Reinforcement Learning and Model-Based Control
Okawa, Yoshihiro
Sasaki, Tomotake
Iwane, Hidenao
2019 12TH ASIAN CONTROL CONFERENCE (ASCC), 2019, : 1419 - 1424

← 1 2 3 4 5 →