On the Sample Complexity of the Linear Quadratic Regulator

被引:196
|
作者
Dean, Sarah [1 ]
Mania, Horia [1 ]
Matni, Nikolai [2 ]
Recht, Benjamin [1 ]
Tu, Stephen [1 ]
机构
[1] Univ Calif Berkeley, Dept Elect Engn & Comp Sci, Berkeley, CA 94720 USA
[2] CALTECH, Dept Comp & Math Sci, Pasadena, CA 91125 USA
基金
美国国家科学基金会;
关键词
Optimal control; Robust control; System identification; Statistical learning theory; Reinforcement learning; System level synthesis; SYSTEM-IDENTIFICATION; BOUNDS; CONVERGENCE; RATES;
D O I
10.1007/s10208-019-09426-y
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper addresses the optimal control problem known as the linear quadratic regulator in the case when the dynamics are unknown. We propose a multistage procedure, calledCoarse-ID control, that estimates a model from a few experimental trials, estimates the error in that model with respect to the truth, and then designs a controller using both the model and uncertainty estimate. Our technique uses contemporary tools from random matrix theory to bound the error in the estimation procedure. We also employ a recently developed approach to control synthesis calledSystem Level Synthesisthat enables robust control design by solving a quasi-convex optimization problem. We provide end-to-end bounds on the relative error in control cost that are optimal in the number of parameters and that highlight salient properties of the system to be controlled such as closed-loop sensitivity and optimal control magnitude. We show experimentally that the Coarse-ID approach enables efficient computation of a stabilizing controller in regimes where simple control schemes that do not take the model uncertainty into account fail to stabilize the true system.
引用
收藏
页码:633 / 679
页数:47
相关论文
共 50 条
  • [21] The explicit linear quadratic regulator for constrained systems
    Bemporad, A
    Morari, M
    Dua, V
    Pistikopoulos, EN
    AUTOMATICA, 2002, 38 (01) : 3 - 20
  • [22] LIMITS OF PROPRIETY FOR LINEAR QUADRATIC REGULATOR PROBLEMS
    JOHNSON, CD
    INTERNATIONAL JOURNAL OF CONTROL, 1987, 45 (05) : 1835 - 1846
  • [23] Random search for learning the linear quadratic regulator
    Mohammadi, Hesameddin
    Soltanolkotabi, Mandi
    Jovanovic, Mihailo R.
    2020 AMERICAN CONTROL CONFERENCE (ACC), 2020, : 4798 - 4803
  • [24] On the problem of a quadratic regulator for linear discrete systems
    Minyuk, SA
    DOKLADY AKADEMII NAUK BELARUSI, 1998, 42 (04): : 29 - 32
  • [25] Effect of nonlinearity on linear quadratic regulator performance
    Guay, M
    Forbes, JF
    2004 43RD IEEE CONFERENCE ON DECISION AND CONTROL (CDC), VOLS 1-5, 2004, : 2267 - 2272
  • [26] ISWEC linear quadratic regulator oscillating control
    Vissio, Giacomo
    Valerio, Duarte
    Bracco, Giovanni
    Beirao, Pedro
    Pozzi, Nicola
    Mattiazzo, Giuliana
    RENEWABLE ENERGY, 2017, 103 : 372 - 382
  • [27] On the Sample Complexity and Optimization Landscape for Quadratic Feasibility Problems
    Thaker, Parth K.
    Dasarathy, Gautam
    Nedic, Angelia
    2020 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY (ISIT), 2020, : 1438 - 1443
  • [28] Linear Quadratic Regulator of Discrete-Time Switched Linear Systems
    Wu, Guangyu
    Xiong, Lu
    Wang, Gang
    Sun, Jian
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2020, 67 (12) : 3113 - 3117
  • [29] Evaluation of Input Redundancies on Linear Quadratic Regulator Problems
    Peng, Zhongxing
    Yang, Ying
    JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2012, 155 (01) : 325 - 335
  • [30] DETERMINATION OF WEIGHTING MATRICES OF A LINEAR-QUADRATIC REGULATOR
    LUO, J
    LAN, CE
    JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 1995, 18 (06) : 1462 - 1463