A Theoretical Revisit to Linear Convergence for Saddle Point Problems

被引:0
|
作者
Wu, Wendi [1 ]
Zhao, Yawei [2 ]
Zhu, En [3 ]
Liu, Xinwang [3 ]
Zhang, Xingxing [4 ,5 ]
Luo, Lailong [6 ]
Wang, Shixiong [7 ]
Yin, Jianping [8 ]
机构
[1] Natl Univ Def Technol, State Key Lab High Performance Comp, Deya Rd 109, Changsha 410073, Hunan, Peoples R China
[2] China Elect Equipment Syst Engn Co, Minzhuang Rd 77, Beijing 100093, Peoples R China
[3] Natl Univ Def Technol, Sch Comp, Deya Rd 109, Changsha 410073, Hunan, Peoples R China
[4] Beijing Jiaotong Univ, Inst Informat Sci, Beijing 100044, Peoples R China
[5] Beijing Jiaotong Univ, Beijing Key Lab Adv Informat Sci & Network Techno, Beijing 100044, Peoples R China
[6] Natl Univ Def Technol, Sci & Technol Informat Syst Engn Lab, Deya Rd 109, Changsha 410073, Hunan, Peoples R China
[7] Natl Innovat Inst Def Technol, Artificial Intelligence Res Ctr, Beijing 100071, Peoples R China
[8] Dongguan Univ Technol, Dept Software Engn, Dongguan 523808, Guangdong, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Saddle point problems; linear convergence; strong convexity; min-max optimization; CONVEX; OPTIMIZATION;
D O I
10.1145/3420035
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, convex-concave bilinear Saddle Point Problems (SPP) is widely used in lasso problems, Support Vector Machines, game theory, and so on. Previous researches have proposed many methods to solve SPP, and present their convergence rate theoretically. To achieve linear convergence, analysis in those previouse studies requires strong convexity of phi(z). But, we find the linear convergence can also be achieved even for a general convex but not strongly convex phi(z). In the article, by exploiting the strong duality of SPP, we propose a new method to solve SPP, and achieve the linear convergence. We present a new general sufficient condition to achieve linear convergence, but do not require the strong convexity of phi(z). Furthermore, a more efficient method is also proposed, and its convergence rate is analyzed in theoretical. Our analysis shows that the well conditioned phi(z) is necessary to improve the efficiency of our method. Finally, we conduct extensive empirical studies to evaluate the convergence performance of our methods.
引用
收藏
页数:17
相关论文
共 50 条