On Almost Sure Convergence Rates of Stochastic Gradient Methods

被引：0

作者：

Liu, Jun ^{[1
]}

Yuan, Ye ^{[2
,3
]}

机构：

[1] Univ Waterloo, Dept Appl Math, Waterloo, ON, Canada

[2] Huazhong Univ Sci & Technol, Sch Artificial Intelligence & Automat, Wuhan, Peoples R China

[3] Huazhong Univ Sci & Technol, Sch Mech Sci & Engn, Wuhan, Peoples R China

来源：

CONFERENCE ON LEARNING THEORY, VOL 178 | 2022年 / 178卷

基金：

加拿大自然科学与工程研究理事会;

关键词：

Stochastic gradient descent; stochastic heavy-ball; stochastic Nesterov's accelerated gradient; almost sure convergence rate; OPTIMIZATION; BOUNDS;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The vast majority of convergence rates analysis for stochastic gradient methods in the literature focus on convergence in expectation, whereas trajectory-wise almost sure convergence is clearly important to ensure that any instantiation of the stochastic algorithms would converge with probability one. Here we provide a unified almost sure convergence rates analysis for stochastic gradient descent (SGD), stochastic heavy-ball (SHB), and stochastic Nesterov's accelerated gradient (SNAG) methods. We show, for the first time, that the almost sure convergence rates obtained for these stochastic gradient methods on strongly convex functions, are arbitrarily close to their optimal convergence rates possible. For non-convex objective functions, we not only show that a weighted average of the squared gradient norms converges to zero almost surely, but also the last iterates of the algorithms. We further provide last-iterate almost sure convergence rates analysis for stochastic gradient methods on general convex smooth functions, in contrast with most existing results in the literature that only provide convergence in expectation for a weighted average of the iterates.

引用

页数：21

共 50 条

[21] ON THE ALMOST SURE CONVERGENCE OF A GENERAL STOCHASTIC-APPROXIMATION PROCEDURE
EVANS, SN
WEBER, NC
BULLETIN OF THE AUSTRALIAN MATHEMATICAL SOCIETY, 1986, 34 (03) : 335 - 342
[22] Almost Sure Convergence of the Numerical Discretization of Stochastic Jump Diffusions
C. W. Li
X. Q. Liu
Acta Applicandae Mathematica, 2000, 62 : 225 - 244
[23] Almost sure convergence of iterative learning control for stochastic systems
Chen, HF
SCIENCE IN CHINA SERIES F-INFORMATION SCIENCES, 2003, 46 (01): : 67 - 79
[24] Almost sure convergence of iterative learning control for stochastic systems
陈翰馥
Science in China(Series F:Information Sciences), 2003, (01) : 67 - 79
[25] ALMOST SURE CONVERGENCE OF STOCHASTIC INTEGRALS IN HILBERT-SPACES
CHOW, PL
JIANG, JL
STOCHASTIC ANALYSIS AND APPLICATIONS, 1992, 10 (05) : 533 - 543
[26] Almost Sure Uniform Convergence of Stochastic Processes in the Dual of a Nuclear Space
C. A. Fonseca-Mora
Journal of Theoretical Probability, 2023, 36 : 2564 - 2589
[27] Almost sure convergence of extremum seeking algorithm using stochastic perturbation
Radenkovic, Miloje S.
Park, Jae-Do
SYSTEMS & CONTROL LETTERS, 2016, 94 : 133 - 141
[28] ALMOST SURE CONVERGENCE THEOREMS FOR A CLASS WITH DECREASING PATHWISE STOCHASTIC ALGORITHMS
METIVIER, M
PRIOURET, P
PROBABILITY THEORY AND RELATED FIELDS, 1987, 74 (03) : 403 - 428
[29] Almost sure rate of convergence of the parameter estimates in stochastic approximation algorithm
Radenkovic, MS
Michel, AN
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2000, 45 (06) : 1161 - 1166
[30] Stochastic approximation for consensus seeking: Mean square and almost sure convergence
Huang, Minyi
Manton, Jonathan H.
PROCEEDINGS OF THE 46TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-14, 2007, : 3806 - +

← 1 2 3 4 5 →