Stability and Generalization of the Decentralized Stochastic Gradient Descent Ascent Algorithm

被引：0

作者：

Zhu, Miaoxi ^{[1
,2
]}

Shen, Li ^{[3
]}

Du, Bo ^{[1
,2
]}

Tao, Dacheng ^{[4
]}

机构：

[1] Wuhan Univ, Sch Comp Sci, Natl Engn Res Ctr Multimedia Software, Inst Artificial Intelligence, Wuhan, Peoples R China

[2] Wuhan Univ, Hubei Key Lab Multimedia & Network Commun Engn, Wuhan, Peoples R China

[3] JD Explore Acad, Beijing, Peoples R China

[4] Univ Sydney, Sydney, NSW, Australia

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023) | 2023年

基金：

中国国家自然科学基金;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The growing size of available data has attracted increasing interest in solving minimax problems in a decentralized manner for various machine learning tasks. Previous theoretical research has primarily focused on the convergence rate and communication complexity of decentralized minimax algorithms, with little attention given to their generalization. In this paper, we investigate the primal-dual generalization bound of the decentralized stochastic gradient descent ascent (D-SGDA) algorithm using the approach of algorithmic stability under both convex-concave and nonconvex-nonconcave settings. Our theory refines the algorithmic stability in a decentralized manner and demonstrates that the decentralized structure does not destroy the stability and generalization of D-SGDA, implying that it can generalize as well as the vanilla SGDA in certain situations. Our results analyze the impact of different topologies on the generalization bound of the D-SGDA algorithm beyond trivial factors such as sample sizes, learning rates, and iterations. We also evaluate the optimization error and balance it with the generalization gap to obtain the optimal population risk of D-SGDA in the convex-concave setting. Additionally, we perform several numerical experiments which validate our theoretical findings.

引用

页数：35

共 50 条

[21] Network Gradient Descent Algorithm for Decentralized Federated Learning
Wu, Shuyuan
Huang, Danyang
Wang, Hansheng
JOURNAL OF BUSINESS & ECONOMIC STATISTICS, 2023, 41 (03) : 806 - 818
[22] Robust decentralized stochastic gradient descent over unstable networks
Zheng, Yanwei
Zhang, Liangxu
Chen, Shuzhen
Zhang, Xiao
Cai, Zhipeng
Cheng, Xiuzhen
COMPUTER COMMUNICATIONS, 2023, 203 : 163 - 179
[23] Decentralized Asynchronous Stochastic Gradient Descent: Convergence Rate Analysis
Bedi, Amrit Singh
Pradhan, Hrusikesha
Rajawat, Ketan
2018 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS (SPCOM 2018), 2018, : 402 - 406
[24] Generalization Bounds for Stochastic Gradient Descent via Localized ε-Covers
Park, Sejun
Simsekli, Umut
Erdogdu, Murat A.
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
[25] Towards stability and optimality in stochastic gradient descent
Toulis, Panos
Tran, Dustin
Airoldi, Edoardo M.
ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 51, 2016, 51 : 1290 - 1298
[26] Global Convergence and Stability of Stochastic Gradient Descent
Patel, Vivak
Zhang, Shushu
Tian, Bowen
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
[27] Optimal stochastic gradient descent algorithm for filtering
Turali, M. Yigit
Koc, Ali T.
Kozat, Suleyman S.
DIGITAL SIGNAL PROCESSING, 2024, 155
[28] Decentralized Policy Gradient Descent Ascent for Safe Multi-Agent Reinforcement Learning
Lu, Songtao
Zhang, Kaiqing
Chen, Tianyi
Basar, Tamer
Horesh, Lior
THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 8767 - 8775
[29] Stochastic Gradient Descent-Ascent: Unified Theory and New Efficient Methods
Beznosikov, Aleksandr
Gorbunov, Eduard
Berard, Hugo
Loizou, Nicolas
INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 206, 2023, 206 : 172 - 235
[30] Stochastic Recursive Gradient Descent Ascent for Stochastic Nonconvex-Strongly-Concave Minimax Problems
Luo, Luo
Ye, Haishan
Huang, Zhichao
Zhang, Tong
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33

← 1 2 3 4 5 →