Output-Feedback Synthesis Orbit Geometry: Quotient Manifolds and LQG Direct Policy Optimization

被引：0

作者：

Kraisler, Spencer ^{[1
]}

Mesbahi, Mehran ^{[1
]}

机构：

[1] Univ Washington, William E Boeing Dept Aeronaut & Astronaut, Seattle, WA 98115 USA

来源：

IEEE CONTROL SYSTEMS LETTERS | 2024年 / 8卷

关键词：

Measurement; Optimization; Space vehicles; Aerospace electronics; Manifolds; Orbits; Geometry; Policy optimization; linear-quadratic gaussian synthesis; coordinate-invariant Riemannian metrics; quotient manifolds; SYSTEMS;

D O I：

10.1109/LCSYS.2024.3414962

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

We consider direct policy optimization for the linear-quadratic Gaussian (LQG) setting. Over the past few years, it has been recognized that the landscape of dynamic output-feedback controllers of relevance to LQG has an intricate geometry, particularly pertaining to the existence of degenerate stationary points, that hinders gradient methods. In order to address these challenges, in this letter, we adopt a system-theoretic coordinate-invariant Riemannian metric for the space of dynamic output-feedback controllers and develop a Riemannian gradient descent for direct LQG policy optimization. We then proceed to prove that the orbit space of such controllers, modulo the coordinate transformation, admits a Riemannian quotient manifold structure. This geometric structure-that is of independent interest-provides an effective approach to derive direct policy optimization algorithms for LQG with a local linear rate convergence guarantee. Subsequently, we show that the proposed approach exhibits significantly faster and more robust numerical performance as compared with ordinary gradient descent.

引用

页码：1577 / 1582

页数：6

共 50 条

[1] Optimal Decentralized Output-Feedback LQG Control With Random Communication Delay
Wang, Yan
Xiong, Junlin
IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (01) : 338 - 350
[2] On the Global Optimality of Direct Policy Search for Nonsmooth H∞ Output-Feedback Control
Tang, Yujie
Zheng, Yang
2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 6148 - 6153
[3] ROBUST OUTPUT-FEEDBACK CONTROLLER - DIRECT DESIGN
CHEN, YH
INTERNATIONAL JOURNAL OF CONTROL, 1987, 46 (03) : 1083 - 1091
[4] OUTPUT-FEEDBACK MATRICES IN THE PRESENCE OF DIRECT FEEDTHROUGH
FLETCHER, LR
INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 1981, 12 (12) : 1493 - 1495
[5] Decentralised output-feedback LQG control with one-step communication delay
Wang, Yan
Xiong, Junlin
Ren, Wei
INTERNATIONAL JOURNAL OF CONTROL, 2018, 91 (08) : 1920 - 1930
[6] OPTIMAL DIRECT OUTPUT-FEEDBACK OF STRUCTURAL CONTROL
CHUNG, LL
LIN, CC
CHU, SY
JOURNAL OF ENGINEERING MECHANICS, 1993, 119 (11) : 2157 - 2173
[7] OUTPUT-FEEDBACK STABILIZATION - SOLUTION BY ALGEBRAIC GEOMETRY METHODS
ANDERSON, BDO
SCOTT, RW
PROCEEDINGS OF THE IEEE, 1977, 65 (06) : 849 - 861
[8] DIRECT OUTPUT-FEEDBACK CONTROL OF LARGE SPACE STRUCTURES
BALAS, MJ
JOURNAL OF THE ASTRONAUTICAL SCIENCES, 1979, 27 (02): : 157 - 180
[9] A Homotopy Approach for Robust Output-Feedback Synthesis
Halicki, Tobias
Scherer, Carsten W.
2019 27TH MEDITERRANEAN CONFERENCE ON CONTROL AND AUTOMATION (MED), 2019, : 87 - 93
[10] Multiobjective output-feedback control via LMI optimization
Scherer, C
Gahinet, P
Chilali, M
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1997, 42 (07) : 896 - 911

← 1 2 3 4 5 →