A linearly convergent doubly stochastic Gauss-Seidel algorithm for solving linear equations and a certain class of over-parameterized optimization problems
被引:8
|
作者:
Razaviyayn, Meisam
论文数: 0引用数: 0
h-index: 0
机构:
Univ Southern Calif, Dept Ind & Syst Engn, Los Angeles, CA 90089 USAUniv Southern Calif, Dept Ind & Syst Engn, Los Angeles, CA 90089 USA
Razaviyayn, Meisam
[1
]
Hong, Mingyi
论文数: 0引用数: 0
h-index: 0
机构:
Univ Minnesota, Dept Elect & Comp Engn, Minneapolis, MN 55455 USAUniv Southern Calif, Dept Ind & Syst Engn, Los Angeles, CA 90089 USA
Hong, Mingyi
[2
]
Reyhanian, Navid
论文数: 0引用数: 0
h-index: 0
机构:
Univ Minnesota, Dept Elect & Comp Engn, Minneapolis, MN 55455 USAUniv Southern Calif, Dept Ind & Syst Engn, Los Angeles, CA 90089 USA
Reyhanian, Navid
[2
]
Luo, Zhi-Quan
论文数: 0引用数: 0
h-index: 0
机构:
Chinese Univ Hong Kong, Shenzhen Res Inst Big Data, Shenzhen, Peoples R ChinaUniv Southern Calif, Dept Ind & Syst Engn, Los Angeles, CA 90089 USA
Luo, Zhi-Quan
[3
]
机构:
[1] Univ Southern Calif, Dept Ind & Syst Engn, Los Angeles, CA 90089 USA
Gauss-Seidel algorithm;
Linear systems of equations;
Nonuniform block coordinate descent algorithm;
Over-parameterized optimization;
49xx Calculus of variations and optimal control;
optimization;
D O I:
10.1007/s10107-019-01404-0
中图分类号:
TP31 [计算机软件];
学科分类号:
081202 ;
0835 ;
摘要:
Consider the classical problem of solving a general linear system of equations Ax=b. It is well known that the (successively over relaxed) Gauss-Seidel scheme and many of its variants may not converge when A is neither diagonally dominant nor symmetric positive definite. Can we have a linearly convergent G-S type algorithm that works for anyA? In this paper we answer this question affirmatively by proposing a doubly stochastic G-S algorithm that is provably linearly convergent (in the mean square error sense) for any feasible linear system of equations. The key in the algorithm design is to introduce a nonuniform double stochastic scheme for picking the equation and the variable in each update step as well as a stepsize rule. These techniques also generalize to certain iterative alternating projection algorithms for solving the linear feasibility problem Axb with an arbitrary A, as well as high-dimensional minimization problems for training over-parameterized models in machine learning. Our results demonstrate that a carefully designed randomization scheme can make an otherwise divergent G-S algorithm converge.
机构:
Xiamen Univ, Sch Math Sci, Fujian Prov Key Lab Math Modeling & High Performa, Xiamen 361005, Peoples R ChinaXiamen Univ, Sch Math Sci, Fujian Prov Key Lab Math Modeling & High Performa, Xiamen 361005, Peoples R China
Du, Kui
Sun, Xiao-Hui
论文数: 0引用数: 0
h-index: 0
机构:
Xiamen Univ, Sch Math Sci, Xiamen 361005, Peoples R ChinaXiamen Univ, Sch Math Sci, Fujian Prov Key Lab Math Modeling & High Performa, Xiamen 361005, Peoples R China