Efficient Distributed Transfer Learning for Large-Scale Gaussian Graphic Models

被引:0
|
作者
Zhou, Xingcai [1 ]
Zheng, Haotian [1 ]
Zhang, Haoran [2 ]
Huang, Chao [3 ]
机构
[1] Nanjing Audit Univ, Sch Stat & Data Sci, Nanjing, Jiangsu, Peoples R China
[2] Monash Univ, Fac Sci, Clayton, Vic, Australia
[3] Univ Georgia, Dept Epidemiol & Biostat, Athens, GA USA
来源
STAT | 2024年 / 13卷 / 04期
关键词
distributed learning; FDR; Gaussian graphic models; transfer learning; REGRESSION;
D O I
10.1002/sta4.70004
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
A transfer learning for the large-scale Gaussian graphical models (GGMs) is considered in a distributed architecture for fitting these target and auxiliary studies scattered across multiple sites. A distributed transfer learning algorithm, DisPower-Trans-CLIME, is proposed to learn the target GGMs by incorporating the data from similar and related auxiliary studies. The algorithm is communication efficient via the distributed power method for matrix decomposition. We show that DisPower-Trans-CLIME has a fast convergence rate comparable to the centred Trans-CLIME algorithm. A debiased DisPower-Trans-CLIME is constructed and is proved to be element-wise asymptotically normal for statistical inference. Thus, a multiple testing procedure is developed to detect edge of GGMs with false discovery rate (FDR) control. Extensive simulation experiments have been conducted to demonstrate superior numerical performance of our proposed learning algorithm on estimation and edge detection. It is also applied to infer the regional connection networks in brain regions of interest (ROIs) based on a target hospital site by leveraging the graph from multiple other hospital sites for autism spectrum disorder. We observe that the degrees of regional connectivity in the right brain are balanced, while the ones of the left brain region are extremely uneven because of the presence of multiple strong connections. However, the specific association, between the degrees of connectivity of these regions and ASD disease, is unknown.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Efficient Distributed Machine Learning for Large-scale Models by Reducing Redundant Communication
    Yokoyama, Harumichi
    Araki, Takuya
    2017 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTED, SCALABLE COMPUTING & COMMUNICATIONS, CLOUD & BIG DATA COMPUTING, INTERNET OF PEOPLE AND SMART CITY INNOVATION (SMARTWORLD/SCALCOM/UIC/ATC/CBDCOM/IOP/SCI), 2017,
  • [2] Transfer Learning in Large-Scale Gaussian Graphical Models with False Discovery Rate Control
    Li, Sai
    Cai, T. Tony
    Li, Hongzhe
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2023, 118 (543) : 2171 - 2183
  • [3] Gaussian process decentralized data fusion meets transfer learning in large-scale distributed cooperative perception
    Ouyang, Ruofei
    Low, Bryan Kian Hsiang
    AUTONOMOUS ROBOTS, 2020, 44 (3-4) : 359 - 376
  • [4] On Efficient Training of Large-Scale Deep Learning Models
    Shen, Li
    Sun, Yan
    Yu, Zhiyuan
    Ding, Liang
    Tian, Xinmei
    Tao, Dacheng
    ACM COMPUTING SURVEYS, 2025, 57 (03)
  • [5] Gaussian Process Decentralized Data Fusion Meets Transfer Learning in Large-Scale Distributed Cooperative Perception
    Ouyang, Ruofei
    Low, Kian Hsiang
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 3876 - 3883
  • [6] Gaussian process decentralized data fusion meets transfer learning in large-scale distributed cooperative perception
    Ruofei Ouyang
    Bryan Kian Hsiang Low
    Autonomous Robots, 2020, 44 : 359 - 376
  • [7] Efficient Distributed Learning for Large-Scale Expectile Regression With Sparsity
    Pan, Yingli
    Liu, Zhan
    IEEE ACCESS, 2021, 9 (09): : 64732 - 64746
  • [8] Distributed Learning for Large-Scale Models at Edge With Privacy Protection
    Yuan, Yuan
    Chen, Shuzhen
    Yu, Dongxiao
    Zhao, Zengrui
    Zou, Yifei
    Cui, Lizhen
    Cheng, Xiuzhen
    IEEE TRANSACTIONS ON COMPUTERS, 2024, 73 (04) : 1060 - 1070
  • [9] Learning large-scale graphical Gaussian models from genomic data
    Schäfer, J
    Strimmer, K
    SCIENCE OF COMPLEX NETWORKS: FROM BIOLOGY TO THE INTERNET AND WWW, 2005, 776 : 263 - 276
  • [10] Efficient Large-Scale Structured Learning
    Branson, Steve
    Beijbom, Oscar
    Belongie, Serge
    2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 1806 - 1813