Large-Scale Non-convex Stochastic Constrained Distributionally Robust Optimization

被引:0
|
作者
Zhang, Qi [1 ]
Zhou, Yi [2 ]
Prater-Bennette, Ashley [3 ]
Shen, Lixin [4 ]
Zou, Shaofeng [1 ]
机构
[1] Univ Buffalo, Buffalo, NY 14260 USA
[2] Univ Utah, Salt Lake City, UT USA
[3] Air Force Res Lab, Wright Patterson AFB, OH USA
[4] Syracuse Univ, Syracuse, NY USA
基金
美国国家科学基金会;
关键词
DIVERGENCE;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Distributionally robust optimization (DRO) is a powerful framework for training robust models against data distribution shifts. This paper focuses on constrained DRO, which has an explicit characterization of the robustness level. Existing studies on constrained DRO mostly focus on convex loss function, and exclude the practical and challenging case with non-convex loss function, e.g., neural network. This paper develops a stochastic algorithm and its performance analysis for non-convex constrained DRO. The computational complexity of our stochastic algorithm at each iteration is independent of the overall dataset size, and thus is suitable for large-scale applications. We focus on the general Cressie-Read family divergence defined uncertainty set which includes chi(2)-divergences as a special case. We prove that our algorithm finds an epsilon-stationary point with an improved computational complexity than existing methods. Our method also applies to the smoothed conditional value at risk (CVaR) DRO.
引用
收藏
页码:8217 / 8225
页数:9
相关论文
共 50 条
  • [21] Natasha: Faster Non-Convex Stochastic Optimization via Strongly Non-Convex Parameter
    Allen-Zhu, Zeyuan
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
  • [22] Distributionally Robust Convex Optimization
    Wiesemann, Wolfram
    Kuhn, Daniel
    Sim, Melvyn
    OPERATIONS RESEARCH, 2014, 62 (06) : 1358 - 1376
  • [23] On Graduated Optimization for Stochastic Non-Convex Problems
    Hazan, Elad
    Levy, Kfir Y.
    Shalev-Shwartz, Shai
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48, 2016, 48
  • [24] Lower bounds for non-convex stochastic optimization
    Yossi Arjevani
    Yair Carmon
    John C. Duchi
    Dylan J. Foster
    Nathan Srebro
    Blake Woodworth
    Mathematical Programming, 2023, 199 : 165 - 214
  • [25] Lower bounds for non-convex stochastic optimization
    Arjevani, Yossi
    Carmon, Yair
    Duchi, John C.
    Foster, Dylan J.
    Srebro, Nathan
    Woodworth, Blake
    MATHEMATICAL PROGRAMMING, 2023, 199 (1-2) : 165 - 214
  • [26] Distributed Computational Framework for Large-Scale Stochastic Convex Optimization
    Rostampour, Vahab
    Keviczky, Tamas
    ENERGIES, 2021, 14 (01)
  • [27] LARGE-SCALE NONCONVEX STOCHASTIC OPTIMIZATION BY DOUBLY STOCHASTIC SUCCESSIVE CONVEX APPROXIMATION
    Mokhtari, Aryan
    Koppel, Alec
    Scutari, Gesualdo
    Ribeiro, Alejandro
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 4701 - 4705
  • [28] DOUBLE SMOOTHING TECHNIQUE FOR LARGE-SCALE LINEARLY CONSTRAINED CONVEX OPTIMIZATION
    Devolder, Olivier
    Glineur, Francois
    Nesterov, Yurii
    SIAM JOURNAL ON OPTIMIZATION, 2012, 22 (02) : 702 - 727
  • [29] Stochastic First-Order Algorithms for Constrained Distributionally Robust Optimization
    Im, Hyungki
    Grigas, Paul
    INFORMS JOURNAL ON COMPUTING, 2024,
  • [30] Stochastic Network Optimization with Non-Convex Utilities and Costs
    Neely, Michael J.
    2010 INFORMATION THEORY AND APPLICATIONS WORKSHOP (ITA), 2010, : 352 - 361