Online Linear Quadratic Control

被引:0
|
作者
Cohen, Alon [1 ,2 ]
Hassidim, Avinatan [1 ,3 ]
Koren, Tomer [4 ]
Lazic, Nevena [4 ]
Mansour, Yishay [1 ,5 ]
Talwar, Kunal [4 ]
机构
[1] Google Res, Tel Aviv, Israel
[2] Technion Israel Inst Technol, Haifa, Israel
[3] Bar Ilan Univ, Ramat Gan, Israel
[4] Google Brain, Mountain View, CA 94043 USA
[5] Tel Aviv Univ, Tel Aviv, Israel
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We study the problem of controlling linear time-invariant systems with known noisy dynamics and adversarially chosen quadratic losses. We present the first efficient online learning algorithms in this setting that guarantee O(root T) regret under mild assumptions, where T is the time horizon. Our algorithms rely on a novel SDP relaxation for the steady-state distribution of the system. Crucially, and in contrast to previously proposed relaxations, the feasible solutions of our SDP all correspond to "strongly stable" policies that mix exponentially fast to a steady state.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] Linear quadratic optimal control of networked control system
    Wang, Zhi-Wen
    Gao, Hong-Hong
    2015 27TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2015, : 2177 - 2182
  • [32] Online Inverse Linear-Quadratic Differential Games Applied to Human Behavior Identification in Shared Control
    Inga, Jairo
    Creutz, Andreas
    Hohmann, Soeren
    2021 EUROPEAN CONTROL CONFERENCE (ECC), 2021, : 353 - 360
  • [33] An online value iteration method for linear-quadratic mean field social control with unknown dynamics
    Wang, Bing-Chang
    Li, Shumei
    Cao, Ying
    SCIENCE CHINA-INFORMATION SCIENCES, 2024, 67 (04)
  • [34] An online value iteration method for linear-quadratic mean field social control with unknown dynamics
    Bing-Chang WANG
    Shumei LI
    Ying CAO
    ScienceChina(InformationSciences), 2024, 67 (04) : 38 - 39
  • [35] Ergodic Problems for Linear Exponential Quadratic Gaussian Control and Linear Quadratic Stochastic Differential Games
    Duncan, T. E.
    Pasik-Duncan, B.
    2013 IEEE 52ND ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2013, : 2488 - 2492
  • [36] Linear-quadratic control and quadratic differential forms for multidimensional behaviors
    Napp, D.
    Trentelman, H. L.
    LINEAR ALGEBRA AND ITS APPLICATIONS, 2011, 434 (01) : 117 - 130
  • [37] Stochastic linear quadratic optimal control problems
    Chen, S
    Yong, J
    APPLIED MATHEMATICS AND OPTIMIZATION, 2001, 43 (01): : 21 - 45
  • [38] Linear quadratic performance criteria for cascade control
    Gattami, Ather
    Rantzer, Anders
    2005 44TH IEEE CONFERENCE ON DECISION AND CONTROL & EUROPEAN CONTROL CONFERENCE, VOLS 1-8, 2005, : 3632 - 3637
  • [39] A MULTIOBJECTIVE LINEAR QUADRATIC GAUSSIAN CONTROL PROBLEM
    TOIVONEN, HT
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1984, 29 (03) : 279 - 280
  • [40] Gain scheduled linear quadratic control for quadcopter
    Okasha, M.
    Shah, J.
    Fauzi, W.
    Hanouf, Z.
    AEROS CONFERENCE 2017, 2017, 270