On Almost Sure Convergence Rates of Stochastic Gradient Methods

被引:0
|
作者
Liu, Jun [1 ]
Yuan, Ye [2 ,3 ]
机构
[1] Univ Waterloo, Dept Appl Math, Waterloo, ON, Canada
[2] Huazhong Univ Sci & Technol, Sch Artificial Intelligence & Automat, Wuhan, Peoples R China
[3] Huazhong Univ Sci & Technol, Sch Mech Sci & Engn, Wuhan, Peoples R China
来源
基金
加拿大自然科学与工程研究理事会;
关键词
Stochastic gradient descent; stochastic heavy-ball; stochastic Nesterov's accelerated gradient; almost sure convergence rate; OPTIMIZATION; BOUNDS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The vast majority of convergence rates analysis for stochastic gradient methods in the literature focus on convergence in expectation, whereas trajectory-wise almost sure convergence is clearly important to ensure that any instantiation of the stochastic algorithms would converge with probability one. Here we provide a unified almost sure convergence rates analysis for stochastic gradient descent (SGD), stochastic heavy-ball (SHB), and stochastic Nesterov's accelerated gradient (SNAG) methods. We show, for the first time, that the almost sure convergence rates obtained for these stochastic gradient methods on strongly convex functions, are arbitrarily close to their optimal convergence rates possible. For non-convex objective functions, we not only show that a weighted average of the squared gradient norms converges to zero almost surely, but also the last iterates of the algorithms. We further provide last-iterate almost sure convergence rates analysis for stochastic gradient methods on general convex smooth functions, in contrast with most existing results in the literature that only provide convergence in expectation for a weighted average of the iterates.
引用
收藏
页数:21
相关论文
共 50 条
  • [31] Almost Sure Uniform Convergence of Stochastic Processes in the Dual of a Nuclear Space
    Fonseca-Mora, C. A.
    JOURNAL OF THEORETICAL PROBABILITY, 2023, 36 (01) : 1 - 26
  • [32] On the almost sure convergence of sums
    Pratelli, Luca
    Rigo, Pietro
    STATISTICS & PROBABILITY LETTERS, 2021, 172
  • [33] A REMARK ON ALMOST SURE CONVERGENCE
    LOEVE, M
    ANNALS OF MATHEMATICAL STATISTICS, 1951, 22 (01): : 142 - 142
  • [34] AN INEQUALITY AND ALMOST SURE CONVERGENCE
    KOUNIAS, EG
    WENG, TS
    ANNALS OF MATHEMATICAL STATISTICS, 1969, 40 (03): : 1091 - &
  • [35] ALMOST SURE CONVERGENCE ON CHAOSES
    Poly, Guillaume
    Zheng, Guangqu
    PROCEEDINGS OF THE AMERICAN MATHEMATICAL SOCIETY, 2019, 147 (09) : 4055 - 4065
  • [36] On rates of convergence for sample average approximations in the almost sure sense and in mean
    Banholzer, Dirk
    Fliege, Jorg
    Werner, Ralf
    MATHEMATICAL PROGRAMMING, 2022, 191 (01) : 307 - 345
  • [37] On rates of convergence for sample average approximations in the almost sure sense and in mean
    Dirk Banholzer
    Jörg Fliege
    Ralf Werner
    Mathematical Programming, 2022, 191 : 307 - 345
  • [38] Almost sure parameter estimation and convergence rates for hidden Markov models
    Elliott, RJ
    Moore, JB
    SYSTEMS & CONTROL LETTERS, 1997, 32 (04) : 203 - 207
  • [39] A note on almost sure convergence and convergence in measure
    Kriz, P.
    Stepan, J.
    COMMENTATIONES MATHEMATICAE UNIVERSITATIS CAROLINAE, 2014, 55 (01): : 29 - 40
  • [40] Almost sure convergence of two time-scale stochastic approximation algorithms
    Tadic, VB
    PROCEEDINGS OF THE 2004 AMERICAN CONTROL CONFERENCE, VOLS 1-6, 2004, : 3802 - 3807