Online Linear Quadratic Control

被引:0
|
作者
Cohen, Alon [1 ,2 ]
Hassidim, Avinatan [1 ,3 ]
Koren, Tomer [4 ]
Lazic, Nevena [4 ]
Mansour, Yishay [1 ,5 ]
Talwar, Kunal [4 ]
机构
[1] Google Res, Tel Aviv, Israel
[2] Technion Israel Inst Technol, Haifa, Israel
[3] Bar Ilan Univ, Ramat Gan, Israel
[4] Google Brain, Mountain View, CA 94043 USA
[5] Tel Aviv Univ, Tel Aviv, Israel
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We study the problem of controlling linear time-invariant systems with known noisy dynamics and adversarially chosen quadratic losses. We present the first efficient online learning algorithms in this setting that guarantee O(root T) regret under mild assumptions, where T is the time horizon. Our algorithms rely on a novel SDP relaxation for the steady-state distribution of the system. Crucially, and in contrast to previously proposed relaxations, the feasible solutions of our SDP all correspond to "strongly stable" policies that mix exponentially fast to a steady state.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] Self-triggered linear quadratic control
    Gommans, Tom
    Antunes, Duarte
    Donkers, Tijs
    Tabuada, Paulo
    Heemels, Maurice
    AUTOMATICA, 2014, 50 (04) : 1279 - 1287
  • [42] Linear-quadratic control and information relaxations
    Haugh, Martin
    Lim, Andrew E. B.
    OPERATIONS RESEARCH LETTERS, 2012, 40 (06) : 521 - 528
  • [43] Reducing the dimensionality of linear quadratic control problems
    Balvers, Ronald J.
    Mitchell, Douglas W.
    JOURNAL OF ECONOMIC DYNAMICS & CONTROL, 2007, 31 (01): : 141 - 159
  • [44] Linear quadratic flight control for ejection seats
    Wise, KA
    Brinker, JS
    JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 1996, 19 (01) : 15 - 22
  • [45] On Topological Equivalence in Linear Quadratic Optimal Control
    Jongeneel, Wouter
    Kuhn, Daniel
    2021 EUROPEAN CONTROL CONFERENCE (ECC), 2021, : 2002 - 2007
  • [46] Stochastic Linear Quadratic Optimal Control Problems
    S. Chen
    J. Yong
    Applied Mathematics & Optimization, 2001, 43 : 21 - 45
  • [47] Linear quadratic control for singularly perturbed systems
    Li, Y
    Wang, JL
    Yang, GH
    DYNAMICS OF CONTINUOUS DISCRETE AND IMPULSIVE SYSTEMS-SERIES B-APPLICATIONS & ALGORITHMS, 2005, 12 (01): : 29 - 39
  • [48] QUADRATIC CONTROL FOR LINEAR PERIODIC-SYSTEMS
    DAPRATO, G
    ICHIKAWA, A
    APPLIED MATHEMATICS AND OPTIMIZATION, 1988, 18 (01): : 39 - 66
  • [49] Linear quadratic optimal learning control (LQL)
    Frueh, JA
    Phan, MQ
    INTERNATIONAL JOURNAL OF CONTROL, 2000, 73 (10) : 832 - 839
  • [50] Linear quadratic control problem in biomedical engineering
    Chávez, IYS
    Morales-Menéndez, R
    Chapa, SOM
    EUROPEAN SYMPOSIUM ON COMPUTER-AIDED PROCESS ENGINEERING-15, 20A AND 20B, 2005, 20a-20b : 1195 - 1200