Revisiting Optimal Convergence Rate for Smooth and Non-convex Stochastic Decentralized Optimization

被引:0
|
作者
Yuan, Kun [1 ,3 ]
Huang, Xinmeng [2 ]
Chen, Yiming [1 ,4 ]
Zhang, Xiaohan [2 ]
Zhang, Yingya [1 ]
Pan, Pan [1 ]
机构
[1] DAMO Acad, Alibaba Grp, Beijing, Peoples R China
[2] Univ Penn, Philadelphia, PA 19104 USA
[3] Peking Univ, Beijing, Peoples R China
[4] MetaCarbon, Beijing, Peoples R China
关键词
DISTRIBUTED OPTIMIZATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Decentralized optimization is effective to save communication in large-scale machine learning. Although numerous algorithms have been proposed with theoretical guarantees and empirical successes, the performance limits in decentralized optimization, especially the influence of network topology and its associated weight matrix on the optimal convergence rate, have not been fully understood. While Lu and Sa [44] have recently provided an optimal rate for non-convex stochastic decentralized optimization with weight matrices defined over linear graphs, the optimal rate with general weight matrices remains unclear. This paper revisits non-convex stochastic decentralized optimization and establishes an optimal convergence rate with general weight matrices. In addition, we also establish the optimal rate when non-convex loss functions further satisfy the Polyak-Lojasiewicz (PL) condition. Following existing lines of analysis in literature cannot achieve these results. Instead, we leverage the Ring-Lattice graph to admit general weight matrices while maintaining the optimal relation between the graph diameter and weight matrix connectivity. Lastly, we develop a new decentralized algorithm to nearly attain the above two optimal rates under additional mild conditions.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] Efficient Convex Optimization for Non-convex Non-smooth Image Restoration
    Li, Xinyi
    Yuan, Jing
    Tai, Xue-Cheng
    Liu, Sanyang
    JOURNAL OF SCIENTIFIC COMPUTING, 2024, 99 (02)
  • [32] Global convergence of a curvilinear search for non-convex optimization
    Michael Bartholomew-Biggs
    Salah Beddiaf
    Bruce Christianson
    Numerical Algorithms, 2023, 92 : 2025 - 2043
  • [33] STOCHASTIC PROBLEMS OF OPTIMAL CONTROL WITH NON-CONVEX LIMITATION
    ABASHEV, FK
    KATS, IY
    PRIKLADNAYA MATEMATIKA I MEKHANIKA, 1974, 38 (03): : 409 - 416
  • [34] On the Convergence of (Stochastic) Gradient Descent with Extrapolation for Non-Convex Minimization
    Xu, Yi
    Yuan, Zhuoning
    Yang, Sen
    Jin, Rong
    Yang, Tianbao
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 4003 - 4009
  • [35] A PROJECTION-FREE DECENTRALIZED ALGORITHM FOR NON-CONVEX OPTIMIZATION
    Wai, Hoi-To
    Scaglione, Anna
    Lafond, Jean
    Moulines, Eric
    2016 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP), 2016, : 475 - 479
  • [36] Private Stochastic Optimization with Large Worst-Case Lipschitz Parameter: Optimal Rates for (Non-Smooth) Convex Losses and Extension to Non-Convex Losses
    Lowy, Andrew
    Razaviyayn, Meisam
    INTERNATIONAL CONFERENCE ON ALGORITHMIC LEARNING THEORY, VOL 201, 2023, 201 : 986 - 1054
  • [37] Stochastic Network Optimization with Non-Convex Utilities and Costs
    Neely, Michael J.
    2010 INFORMATION THEORY AND APPLICATIONS WORKSHOP (ITA), 2010, : 352 - 361
  • [38] A STOCHASTIC APPROACH TO THE CONVEX OPTIMIZATION OF NON-CONVEX DISCRETE ENERGY SYSTEMS
    Burger, Eric M.
    Moura, Scott J.
    PROCEEDINGS OF THE ASME 10TH ANNUAL DYNAMIC SYSTEMS AND CONTROL CONFERENCE, 2017, VOL 3, 2017,
  • [39] Adaptive Stochastic Gradient Descent Method for Convex and Non-Convex Optimization
    Chen, Ruijuan
    Tang, Xiaoquan
    Li, Xiuting
    FRACTAL AND FRACTIONAL, 2022, 6 (12)
  • [40] Optimal and Practical Algorithms for Smooth and Strongly Convex Decentralized Optimization
    Kovalev, Dmitry
    Salim, Adil
    Richtarik, Peter
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33