Zeroth-order algorithms for nonconvex-strongly-concave minimax problems with improved complexities

被引:1
|
作者
Wang, Zhongruo [1 ]
Balasubramanian, Krishnakumar [2 ]
Ma, Shiqian [1 ]
Razaviyayn, Meisam [3 ]
机构
[1] Univ Calif Davis, Dept Math, Davis, CA 95616 USA
[2] Univ Calif Davis, Dept Stat, Davis, CA USA
[3] Univ Southern Calif, Dept Ind & Syst Engn, Los Angeles, CA USA
关键词
Minimax problem; Zeroth-order algorithms; Oracle complexity; Gradient descent ascent; Stochastic algorithms; OPTIMIZATION;
D O I
10.1007/s10898-022-01160-0
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
In this paper, we study zeroth-order algorithms for minimax optimization problems that are nonconvex in one variable and strongly-concave in the other variable. Such minimax optimization problems have attracted significant attention lately due to their applications in modern machine learning tasks. We first consider a deterministic version of the problem. We design and analyze the Zeroth-Order Gradient Descent Ascent (ZO-GDA) algorithm, and provide improved results compared to existing works, in terms of oracle complexity. We also propose the Zeroth-Order Gradient Descent Multi-Step Ascent (ZO-GDMSA) algorithm that significantly improves the oracle complexity of ZO-GDA. We then consider stochastic versions of ZO-GDA and ZO-GDMSA, to handle stochastic nonconvex minimax problems. For this case, we provide oracle complexity results under two assumptions on the stochastic gradient: (i) the uniformly bounded variance assumption, which is common in traditional stochastic optimization, and (ii) the Strong Growth Condition (SGC), which has been known to be satisfied by modern over-parameterized machine learning models. We establish that under the SGC assumption, the complexities of the stochastic algorithms match that of deterministic algorithms. Numerical experiments are presented to support our theoretical results.
引用
收藏
页码:709 / 740
页数:32
相关论文
共 50 条
  • [1] Zeroth-order algorithms for nonconvex–strongly-concave minimax problems with improved complexities
    Zhongruo Wang
    Krishnakumar Balasubramanian
    Shiqian Ma
    Meisam Razaviyayn
    Journal of Global Optimization, 2023, 87 : 709 - 740
  • [2] Zeroth-order single-loop algorithms for nonconvex-linear minimax problems
    Jingjing Shen
    Ziqi Wang
    Zi Xu
    Journal of Global Optimization, 2023, 87 : 551 - 580
  • [3] Zeroth-order single-loop algorithms for nonconvex-linear minimax problems
    Shen, Jingjing
    Wang, Ziqi
    Xu, Zi
    JOURNAL OF GLOBAL OPTIMIZATION, 2023, 87 (2-4) : 551 - 580
  • [4] Zeroth-order single-loop algorithms for nonconvex-linear minimax problems
    Shen, Jingjing
    Wang, Ziqi
    Xu, Zi
    JOURNAL OF GLOBAL OPTIMIZATION, 2022,
  • [5] Stochastic Recursive Gradient Descent Ascent for Stochastic Nonconvex-Strongly-Concave Minimax Problems
    Luo, Luo
    Ye, Haishan
    Huang, Zhichao
    Zhang, Tong
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [6] Finding Second-Order Stationary Points in Nonconvex-Strongly-Concave Minimax Optimization
    Luo, Luo
    Li, Yujun
    Chen, Cheng
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [7] Zeroth-Order Alternating Gradient Descent Ascent Algorithms for A Class of Nonconvex-Nonconcave Minimax Problems
    Xu, Zi
    Wang, Zi-Qi
    Wang, Jun-Lin
    Dai, Yu-Hong
    JOURNAL OF MACHINE LEARNING RESEARCH, 2023, 24
  • [8] An Efficient Stochastic Algorithm for Decentralized Nonconvex-Strongly-Concave Minimax Optimization
    Chen, Lesi
    Ye, Haishan
    Luo, Luo
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 238, 2024, 238
  • [9] Improved Zeroth-Order Variance Reduced Algorithms and Analysis for Nonconvex Optimization
    Ji, Kaiyi
    Wang, Zhe
    Zhou, Yi
    Liang, Yingbin
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [10] Zeroth-order algorithms for stochastic distributed nonconvex optimization
    Yi, Xinlei
    Zhang, Shengjun
    Yang, Tao
    Johansson, Karl H.
    AUTOMATICA, 2022, 142