On the Sample Complexity of the Linear Quadratic Regulator

被引:196
|
作者
Dean, Sarah [1 ]
Mania, Horia [1 ]
Matni, Nikolai [2 ]
Recht, Benjamin [1 ]
Tu, Stephen [1 ]
机构
[1] Univ Calif Berkeley, Dept Elect Engn & Comp Sci, Berkeley, CA 94720 USA
[2] CALTECH, Dept Comp & Math Sci, Pasadena, CA 91125 USA
基金
美国国家科学基金会;
关键词
Optimal control; Robust control; System identification; Statistical learning theory; Reinforcement learning; System level synthesis; SYSTEM-IDENTIFICATION; BOUNDS; CONVERGENCE; RATES;
D O I
10.1007/s10208-019-09426-y
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper addresses the optimal control problem known as the linear quadratic regulator in the case when the dynamics are unknown. We propose a multistage procedure, calledCoarse-ID control, that estimates a model from a few experimental trials, estimates the error in that model with respect to the truth, and then designs a controller using both the model and uncertainty estimate. Our technique uses contemporary tools from random matrix theory to bound the error in the estimation procedure. We also employ a recently developed approach to control synthesis calledSystem Level Synthesisthat enables robust control design by solving a quasi-convex optimization problem. We provide end-to-end bounds on the relative error in control cost that are optimal in the number of parameters and that highlight salient properties of the system to be controlled such as closed-loop sensitivity and optimal control magnitude. We show experimentally that the Coarse-ID approach enables efficient computation of a stabilizing controller in regimes where simple control schemes that do not take the model uncertainty into account fail to stabilize the true system.
引用
收藏
页码:633 / 679
页数:47
相关论文
共 50 条
  • [41] Synthesis of a Regulator for a Linear Quadratic Optimal Control Problem
    Gabasov, R.
    Lubochkin, A. V.
    Automation and Remote Control (English translation of Avtomatika i Telemekhanika), 58 (01):
  • [42] Computation of the constrained infinite time linear quadratic regulator
    Grieder, P
    Borrelli, F
    Torrisi, F
    Morari, M
    PROCEEDINGS OF THE 2003 AMERICAN CONTROL CONFERENCE, VOLS 1-6, 2003, : 4711 - 4716
  • [43] LINEAR QUADRATIC OPTIMAL REGULATOR FOR DISCRETE IMPLICIT SYSTEMS
    BERNHARD, P
    GRIMM, J
    WANG, XM
    RAIRO-AUTOMATIQUE-PRODUCTIQUE INFORMATIQUE INDUSTRIELLE-AUTOMATIC CONTROL PRODUCTION SYSTEMS, 1990, 24 (01): : 17 - 36
  • [44] Computcation of the constrained infinite time linear quadratic regulator
    Grieder, P
    Borrelli, F
    Torrisi, F
    Morari, M
    AUTOMATICA, 2004, 40 (04) : 701 - 708
  • [45] Implementation of Linear Quadratic Regulator in an Isolated Microgrid System
    Sanki, Prasun
    Basu, Mousumi
    Pal, Partha Sarathi
    Das, Debapriya
    PROCEEDINGS OF 3RD IEEE CONFERENCE ON VLSI DEVICE, CIRCUIT AND SYSTEM (IEEE VLSI DCS 2022), 2022, : 104 - 109
  • [46] Vehicle Dynamics Control Based on Linear Quadratic Regulator
    Zhang, Siqi
    Zhang, Tianxia
    Zhou, Shuwen
    E-ENGINEERING & DIGITAL ENTERPRISE TECHNOLOGY VII, PTS 1 AND 2, 2009, 16-19 : 876 - 880
  • [47] Maximum-Entropy Satisficing Linear Quadratic Regulator
    Esmzad, Ramin
    Modares, Hamidreza
    IEEE CONTROL SYSTEMS LETTERS, 2023, 7 : 3241 - 3246
  • [48] A linear quadratic regulator for nonlinear SIRC epidemic model
    Di Giamberardino, Paolo
    Iacoviello, Daniela
    2019 23RD INTERNATIONAL CONFERENCE ON SYSTEM THEORY, CONTROL AND COMPUTING (ICSTCC), 2019, : 733 - 738
  • [49] Linear Quadratic Regulator with Decentralized Event-Triggering
    Nakajima, Kyohei
    Kobayashi, Koichi
    Yamashita, Yuh
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2017, E100A (02) : 414 - 420
  • [50] Effect of process nonlinearity on linear quadratic regulator performance
    Guay, M
    Dier, R
    Hahn, J
    McLellan, PJ
    JOURNAL OF PROCESS CONTROL, 2005, 15 (01) : 113 - 124