Variational Principles for Mirror Descent and Mirror Langevin Dynamics

被引:0
|
作者
Tzen, Belinda [1 ]
Raj, Anant [2 ,3 ]
Raginsky, Maxim [3 ]
Bach, Francis [2 ]
机构
[1] Columbia Univ, Dept Stat, New York, NY 10027 USA
[2] PSL Res Univ, Ecole Normale Super, INRIA, F-75006 Paris, France
[3] Univ Illinois, Dept Elect & Comp Engn, Urbana, IL 61801 USA
来源
关键词
Mirrors; Trajectory; Optimal control; Dynamical systems; Costs; Closed loop systems; Geometry; Optimization; optimal control; stochastic optimal control;
D O I
10.1109/LCSYS.2023.3274069
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Mirror descent, introduced by Nemirovski and Yudin in the 1970s, is a primal-dual convex optimization method that can be tailored to the geometry of the optimization problem at hand through the choice of a strongly convex potential function. It arises as a basic primitive in a variety of applications, including large-scale optimization, machine learning, and control. This letter proposes a variational formulation of mirror descent and of its stochastic variant, mirror Langevin dynamics. The main idea, inspired by the classic work of Brezis and Ekeland on variational principles for gradient flows, is to show that mirror descent emerges as a closed-loop solution for a certain optimal control problem, and the Bellman value function is given by the Bregman divergence between the initial condition and the global minimizer of the objective function.
引用
收藏
页码:1542 / 1547
页数:6
相关论文
共 50 条
  • [21] Provable Phase Retrieval with Mirror Descent
    Godeme, Jean-Jacques
    Fadili, Jalal
    Buet, Xavier
    Zerrad, Myriam
    Lequime, Michel
    Amra, Claude
    SIAM JOURNAL ON IMAGING SCIENCES, 2023, 16 (03): : 1106 - 1141
  • [22] Solving MRF Minimization by Mirror Descent
    Luong, Duy V. N.
    Parpas, Panos
    Rueckert, Daniel
    Rustem, Berc
    ADVANCES IN VISUAL COMPUTING, ISVC 2012, PT I, 2012, 7431 : 587 - 598
  • [23] Conformal mirror descent with logarithmic divergences
    Kainth, Amanjit Singh
    Wong, Ting-Kam Leonard
    Rudzicz, Frank
    INFORMATION GEOMETRY, 2024, 7 (SUPPL1) : 303 - 327
  • [24] Unifying mirror descent and dual averaging
    Anatoli Juditsky
    Joon Kwon
    Éric Moulines
    Mathematical Programming, 2023, 199 : 793 - 830
  • [25] An Accelerated Stochastic Mirror Descent Method
    Jiang, Bo-Ou
    Yuan, Ya-Xiang
    JOURNAL OF THE OPERATIONS RESEARCH SOCIETY OF CHINA, 2024, 12 (03) : 549 - 571
  • [26] Parameter-free Mirror Descent
    Jacobsen, Andrew
    Cutkosky, Ashok
    CONFERENCE ON LEARNING THEORY, VOL 178, 2022, 178
  • [27] The Mirror Langevin Algorithm Converges with Vanishing Bias
    Li, Ruilin
    Tao, Molei
    Vempala, Santosh S.
    Wibisono, Andre
    INTERNATIONAL CONFERENCE ON ALGORITHMIC LEARNING THEORY, VOL 167, 2022, 167
  • [28] Mirror Descent Learning in Continuous Games
    Zhou, Zhengyuan
    Mertikopoulos, Panayotis
    Moustakas, Aris L.
    Bambos, Nicholas
    Glynn, Peter
    2017 IEEE 56TH ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2017,
  • [29] Adaptive Mirror Descent for Constrained Optimization
    Bayandina, Anastasia
    2017 CONSTRUCTIVE NONSMOOTH ANALYSIS AND RELATED TOPICS (DEDICATED TO THE MEMORY OF V.F. DEMYANOV) (CNSA), 2017, : 36 - 39
  • [30] Policy Optimization with Stochastic Mirror Descent
    Yang, Long
    Zhang, Yu
    Zheng, Gang
    Zheng, Qian
    Li, Pengfei
    Huang, Jianghang
    Pan, Gang
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 8823 - 8831