Variational Principles for Mirror Descent and Mirror Langevin Dynamics

被引：0

作者：

Tzen, Belinda ^{[1
]}

Raj, Anant ^{[2
,3
]}

Raginsky, Maxim ^{[3
]}

Bach, Francis ^{[2
]}

机构：

[1] Columbia Univ, Dept Stat, New York, NY 10027 USA

[2] PSL Res Univ, Ecole Normale Super, INRIA, F-75006 Paris, France

[3] Univ Illinois, Dept Elect & Comp Engn, Urbana, IL 61801 USA

来源：

IEEE CONTROL SYSTEMS LETTERS | 2023年 / 7卷

关键词：

Mirrors; Trajectory; Optimal control; Dynamical systems; Costs; Closed loop systems; Geometry; Optimization; optimal control; stochastic optimal control;

D O I：

10.1109/LCSYS.2023.3274069

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Mirror descent, introduced by Nemirovski and Yudin in the 1970s, is a primal-dual convex optimization method that can be tailored to the geometry of the optimization problem at hand through the choice of a strongly convex potential function. It arises as a basic primitive in a variety of applications, including large-scale optimization, machine learning, and control. This letter proposes a variational formulation of mirror descent and of its stochastic variant, mirror Langevin dynamics. The main idea, inspired by the classic work of Brezis and Ekeland on variational principles for gradient flows, is to show that mirror descent emerges as a closed-loop solution for a certain optimal control problem, and the Bellman value function is given by the Bregman divergence between the initial condition and the global minimizer of the objective function.

引用

页码：1542 / 1547

页数：6

共 50 条

[21] Provable Phase Retrieval with Mirror Descent
Godeme, Jean-Jacques
Fadili, Jalal
Buet, Xavier
Zerrad, Myriam
Lequime, Michel
Amra, Claude
SIAM JOURNAL ON IMAGING SCIENCES, 2023, 16 (03): : 1106 - 1141
[22] Solving MRF Minimization by Mirror Descent
Luong, Duy V. N.
Parpas, Panos
Rueckert, Daniel
Rustem, Berc
ADVANCES IN VISUAL COMPUTING, ISVC 2012, PT I, 2012, 7431 : 587 - 598
[23] Conformal mirror descent with logarithmic divergences
Kainth, Amanjit Singh
Wong, Ting-Kam Leonard
Rudzicz, Frank
INFORMATION GEOMETRY, 2024, 7 (SUPPL1) : 303 - 327
[24] Unifying mirror descent and dual averaging
Anatoli Juditsky
Joon Kwon
Éric Moulines
Mathematical Programming, 2023, 199 : 793 - 830
[25] An Accelerated Stochastic Mirror Descent Method
Jiang, Bo-Ou
Yuan, Ya-Xiang
JOURNAL OF THE OPERATIONS RESEARCH SOCIETY OF CHINA, 2024, 12 (03) : 549 - 571
[26] Parameter-free Mirror Descent
Jacobsen, Andrew
Cutkosky, Ashok
CONFERENCE ON LEARNING THEORY, VOL 178, 2022, 178
[27] The Mirror Langevin Algorithm Converges with Vanishing Bias
Li, Ruilin
Tao, Molei
Vempala, Santosh S.
Wibisono, Andre
INTERNATIONAL CONFERENCE ON ALGORITHMIC LEARNING THEORY, VOL 167, 2022, 167
[28] Mirror Descent Learning in Continuous Games
Zhou, Zhengyuan
Mertikopoulos, Panayotis
Moustakas, Aris L.
Bambos, Nicholas
Glynn, Peter
2017 IEEE 56TH ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2017,
[29] Adaptive Mirror Descent for Constrained Optimization
Bayandina, Anastasia
2017 CONSTRUCTIVE NONSMOOTH ANALYSIS AND RELATED TOPICS (DEDICATED TO THE MEMORY OF V.F. DEMYANOV) (CNSA), 2017, : 36 - 39
[30] Policy Optimization with Stochastic Mirror Descent
Yang, Long
Zhang, Yu
Zheng, Gang
Zheng, Qian
Li, Pengfei
Huang, Jianghang
Pan, Gang
THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 8823 - 8831

← 1 2 3 4 5 →