ON THE DIVERGENCE OF DECENTRALIZED NONCONVEX OPTIMIZATION

被引：4

作者：

Hong, M. I. N. G. Y. I. ^{[1
]}

Zeng, S. I. L. I. A. N. G. ^{[1
]}

Zhang, J. U. N. Y. U. ^{[2
]}

Sun, H. A. O. R. A. N. ^{[1
]}

机构：

[1] Univ Minnesota Twin Cities, Dept ECE, Minneapolis, MN 55455 USA

[2] Natl Univ Singapore, Dept ISEM, Singapore 119007, Singapore

来源：

SIAM JOURNAL ON OPTIMIZATION | 2022年 / 32卷 / 04期

关键词：

decentralized optimization; nonconvex problems; Lipschitz continuous gradient; DISTRIBUTED OPTIMIZATION; CONVERGENCE; ALGORITHM; STRATEGIES;

D O I：

10.1137/20M1353149

中图分类号：

O29 [应用数学];

学科分类号：

070104 ;

摘要：

agents jointly optimize the nonconvex objective function f(u) := 1/N\sumN Abstract. In this work, we study a generic class of decentralized algorithms in which N i=1 fi(u), while only communicating with their neighbors. This class of problems has become popular in modeling many signal processing and decentralized machine learning applications, and efficient algorithms have been proposed for such a type of problem. However, most of the existing decentralized algorithms require that the local function gradients V fi's as well as the average function gradient Vf are Lipschitz, that is, the local Lipschitz conditions (LLC) and global Lipschitz condition (GLC) are satisfied. In this work, we first demonstrate the importance of the above Lipschitzness assumptions on the state-of-the-art decentralized algorithms. First, by constructing a series of examples, we show that when the LLC on the local function gradient V fi's are not satisfied, a number of state-of-the-art decentralized algorithms diverge, even if the global Lipschitz condition (GLC) still holds. This observation brings out a fundamental theoretical issue of the existing decentralized algorithms---their convergence conditions are strictly stronger than centralized algorithms such as the gradient descent, which only requires the GLC. Our observation raises an important open question: How to design decentralized algorithms when the LLC, or even the GLC, is not satisfied? To address this question, we design two first-order algorithms, which are capable of computing stationary solutions of the original problem with neither the LLC nor the GLC condition. In particular, we show that the proposed algorithms converge sublinearly to a certain \epsilon -stationary solution, where the precise rate depends on various algorithmic and problem parameters. In particular, if the local function fi's are lower bounded Qth order polynomials, then the rate becomes 0(1/\epsilonQ-1) for Q \geq 2 (where the 0 notation hides some constants such as dependency on the network topology). Such a rate is tight for the special case of Q = 2 where each fi satisfies LLC. To our knowledge, this is the first attempt that studies decentralized nonconvex optimization problems with neither the LLC nor the GLC.

引用

页码：2879 / 2908

页数：30

共 50 条

[41] ACCELERATED METHODS FOR NONCONVEX OPTIMIZATION
Carmon, Yair
Duchi, John C.
Hinder, Oliver
Sidford, Aaron
SIAM JOURNAL ON OPTIMIZATION, 2018, 28 (02) : 1751 - 1772
[42] ON NONCONVEX OPTIMIZATION WITH INTEGRAL CONSTRAINTS
PATINO, JGS
JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 1987, 55 (03) : 391 - 401
[43] On Convex Relaxations in Nonconvex Optimization
Westerlund, Tapio
Lundell, Andreas
Westerlund, Joakim
ICHEAP-10: 10TH INTERNATIONAL CONFERENCE ON CHEMICAL AND PROCESS ENGINEERING, PTS 1-3, 2011, 24 : 331 - +
[44] Duality for Nonconvex Approximation and Optimization
Niculescu, Constantin P.
REVUE ROUMAINE DE MATHEMATIQUES PURES ET APPLIQUEES, 2007, 52 (05): : 608 - 608
[45] NONLINEAR NONCONVEX OPTIMIZATION PROBLEM
WEIDNER, P
ZEITSCHRIFT FUR ANGEWANDTE MATHEMATIK UND PHYSIK, 1972, 23 (04): : 567 - 574
[46] Nonconvex optimization in wireless networks
Suris, Juan E.
DaSilva, Luiz A.
GLOBECOM 2007: 2007 IEEE GLOBAL TELECOMMUNICATIONS CONFERENCE, VOLS 1-11, 2007, : 4676 - 4680
[47] Nonconvex Optimization for Communication Networks
Chiang, Mung
ADVANCES IN APPLIED MATHEMATICS AND GLOBAL OPTIMIZATION, 2009, 17 : 137 - 196
[48] Recursive Decomposition for Nonconvex Optimization
Friesen, Abram L.
Domingos, Pedro
PROCEEDINGS OF THE TWENTY-FOURTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI), 2015, : 253 - 259
[49] Decentralized Frank-Wolfe Algorithm for Convex and Nonconvex Problems
Wai, Hoi-To
Lafond, Jean
Scaglione, Anna
Moulines, Eric
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2017, 62 (11) : 5522 - 5537
[50] Taming Nonconvex Stochastic Mirror Descent with General Bregman Divergence
Fatkhullin, Ilyas
He, Niao
INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 238, 2024, 238

← 1 2 3 4 5 →