Zeroth-order algorithms for nonconvex-strongly-concave minimax problems with improved complexities

被引：1

作者：

Wang, Zhongruo ^{[1
]}

Balasubramanian, Krishnakumar ^{[2
]}

Ma, Shiqian ^{[1
]}

Razaviyayn, Meisam ^{[3
]}

机构：

[1] Univ Calif Davis, Dept Math, Davis, CA 95616 USA

[2] Univ Calif Davis, Dept Stat, Davis, CA USA

[3] Univ Southern Calif, Dept Ind & Syst Engn, Los Angeles, CA USA

来源：

JOURNAL OF GLOBAL OPTIMIZATION | 2023年 / 87卷 / 2-4期

关键词：

Minimax problem; Zeroth-order algorithms; Oracle complexity; Gradient descent ascent; Stochastic algorithms; OPTIMIZATION;

D O I：

10.1007/s10898-022-01160-0

中图分类号：

C93 [管理学]; O22 [运筹学];

学科分类号：

070105 ; 12 ; 1201 ; 1202 ; 120202 ;

摘要：

In this paper, we study zeroth-order algorithms for minimax optimization problems that are nonconvex in one variable and strongly-concave in the other variable. Such minimax optimization problems have attracted significant attention lately due to their applications in modern machine learning tasks. We first consider a deterministic version of the problem. We design and analyze the Zeroth-Order Gradient Descent Ascent (ZO-GDA) algorithm, and provide improved results compared to existing works, in terms of oracle complexity. We also propose the Zeroth-Order Gradient Descent Multi-Step Ascent (ZO-GDMSA) algorithm that significantly improves the oracle complexity of ZO-GDA. We then consider stochastic versions of ZO-GDA and ZO-GDMSA, to handle stochastic nonconvex minimax problems. For this case, we provide oracle complexity results under two assumptions on the stochastic gradient: (i) the uniformly bounded variance assumption, which is common in traditional stochastic optimization, and (ii) the Strong Growth Condition (SGC), which has been known to be satisfied by modern over-parameterized machine learning models. We establish that under the SGC assumption, the complexities of the stochastic algorithms match that of deterministic algorithms. Numerical experiments are presented to support our theoretical results.

引用

页码：709 / 740

页数：32

共 50 条

[1] Zeroth-order algorithms for nonconvex–strongly-concave minimax problems with improved complexities
Zhongruo Wang
Krishnakumar Balasubramanian
Shiqian Ma
Meisam Razaviyayn
Journal of Global Optimization, 2023, 87 : 709 - 740
[2] Zeroth-order single-loop algorithms for nonconvex-linear minimax problems
Jingjing Shen
Ziqi Wang
Zi Xu
Journal of Global Optimization, 2023, 87 : 551 - 580
[3] Zeroth-order single-loop algorithms for nonconvex-linear minimax problems
Shen, Jingjing
Wang, Ziqi
Xu, Zi
JOURNAL OF GLOBAL OPTIMIZATION, 2023, 87 (2-4) : 551 - 580
[4] Zeroth-order single-loop algorithms for nonconvex-linear minimax problems
Shen, Jingjing
Wang, Ziqi
Xu, Zi
JOURNAL OF GLOBAL OPTIMIZATION, 2022,
[5] Stochastic Recursive Gradient Descent Ascent for Stochastic Nonconvex-Strongly-Concave Minimax Problems
Luo, Luo
Ye, Haishan
Huang, Zhichao
Zhang, Tong
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
[6] Finding Second-Order Stationary Points in Nonconvex-Strongly-Concave Minimax Optimization
Luo, Luo
Li, Yujun
Chen, Cheng
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[7] Zeroth-Order Alternating Gradient Descent Ascent Algorithms for A Class of Nonconvex-Nonconcave Minimax Problems
Xu, Zi
Wang, Zi-Qi
Wang, Jun-Lin
Dai, Yu-Hong
JOURNAL OF MACHINE LEARNING RESEARCH, 2023, 24
[8] An Efficient Stochastic Algorithm for Decentralized Nonconvex-Strongly-Concave Minimax Optimization
Chen, Lesi
Ye, Haishan
Luo, Luo
INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 238, 2024, 238
[9] Improved Zeroth-Order Variance Reduced Algorithms and Analysis for Nonconvex Optimization
Ji, Kaiyi
Wang, Zhe
Zhou, Yi
Liang, Yingbin
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
[10] Zeroth-order algorithms for stochastic distributed nonconvex optimization
Yi, Xinlei
Zhang, Shengjun
Yang, Tao
Johansson, Karl H.
AUTOMATICA, 2022, 142

← 1 2 3 4 5 →