Asynchronous Distributed ADMM for Learning with Large-Scale and High-Dimensional Sparse Data Set

被引:2
|
作者
Wang, Dongxia [1 ]
Lei, Yongmei [1 ]
机构
[1] Shanghai Univ, Sch Comp Engn & Sci, 333 Nanchen Rd, Shanghai 200436, Peoples R China
基金
中国国家自然科学基金;
关键词
GA-ADMM; General form consensus; Bounded asynchronous; Non-convex;
D O I
10.1007/978-3-030-36405-2_27
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The distributed alternating direction method of multipliers is an effective method to solve large-scale machine learning. At present, most distributed ADMM algorithms need to transfer the entire model parameter in the communication, which leads to high communication cost, especially when the features of model parameter is very large. In this paper, an asynchronous distributed ADMM algorithm (GA-ADMM) based on general form consensus is proposed. First, the GA-ADMM algorithm filters the information transmitted between nodes by analyzing the characteristics of high-dimensional sparse data set: only associated features, rather than all features of the model, need to be transmitted between workers and the master, thus greatly reducing the communication cost. Second, the bounded asynchronous communication protocol is used to further improve the performance of the algorithm. The convergence of the algorithm is also analyzed theoretically when the objective function is non-convex. Finally, the algorithm is tested on the cluster supercomputer "Ziqiang 4000". The experiments show that the GA-ADMM algorithm converges when appropriate parameters are selected, the GA-ADMM algorithm requires less system time to reach convergence than the AD-ADMM algorithm, and the accuracy of these two algorithms is approximate.
引用
收藏
页码:259 / 274
页数:16
相关论文
共 50 条
  • [31] Spectral clustering based on iterative optimization for large-scale and high-dimensional data
    Zhao, Yang
    Yuan, Yuan
    Nie, Feiping
    Wang, Qi
    NEUROCOMPUTING, 2018, 318 : 227 - 235
  • [32] Supervised Papers Classification on Large-Scale High-Dimensional Data with Apache Spark
    Akritidis, Leonidas
    Bozanis, Panayiotis
    Fevgas, Athanasios
    2018 16TH IEEE INT CONF ON DEPENDABLE, AUTONOM AND SECURE COMP, 16TH IEEE INT CONF ON PERVAS INTELLIGENCE AND COMP, 4TH IEEE INT CONF ON BIG DATA INTELLIGENCE AND COMP, 3RD IEEE CYBER SCI AND TECHNOL CONGRESS (DASC/PICOM/DATACOM/CYBERSCITECH), 2018, : 987 - 994
  • [33] BFGS-ADMM for Large-Scale Distributed Optimization
    Li, Yichuan
    Gong, Yonghai
    Freris, Nikolaos M.
    Voulgaris, Petros
    Stipanovic, Dusan
    2021 60TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2021, : 1689 - 1694
  • [34] High-dimensional and large-scale phenotyping of yeast mutants
    Ohya, Y
    Sese, J
    Yukawa, M
    Sano, F
    Nakatani, Y
    Saito, TL
    Saka, A
    Fukuda, T
    Ishihara, S
    Oka, S
    Suzuki, G
    Watanabe, M
    Hirata, A
    Ohtani, M
    Sawai, H
    Fraysse, N
    Latgé, JP
    François, JM
    Aebi, M
    Tanaka, S
    Muramatsu, S
    Araki, H
    Sonoike, K
    Nogami, S
    Morishita, S
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2005, 102 (52) : 19015 - 19020
  • [35] LARGE-SCALE HIGH-DIMENSIONAL CLUSTERING WITH FAST SKETCHING
    Chatalic, Antoine
    Gribonval, Remi
    Keriven, Nicolas
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 4714 - 4718
  • [36] Discovering a sparse set of pairwise discriminating features in high-dimensional data
    Melton, Samuel
    Ramanathan, Sharad
    BIOINFORMATICS, 2021, 37 (02) : 202 - 212
  • [37] Distributed Learning of Deep Sparse Neural Networks for High-dimensional Classification
    Garg, Shweta
    Krishnan, R.
    Jagannathan, S.
    Samaranayake, V. A.
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 1587 - 1592
  • [38] Learning from high-dimensional cyber-physical data streams: a case of large-scale smart grid
    Hassani, Hossein
    Hallaji, Ehsan
    Razavi-Far, Roozbeh
    Saif, Mehrdad
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2025, 16 (03) : 1819 - 1831
  • [39] Sparse Learning of the Disease Severity Score for High-Dimensional Data
    Stojkovic, Ivan
    Obradovic, Zoran
    COMPLEXITY, 2017,
  • [40] On the challenges of learning with inference networks on sparse, high-dimensional data
    Krishnan, Rahul G.
    Liang, Dawen
    Hoffman, Matthew D.
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 84, 2018, 84