Federated Learning with GAN-Based Data Synthesis for Non-IID Clients

被引:18
|
作者
Li, Zijian [1 ]
Shao, Jiawei [1 ]
Mao, Yuyi [2 ]
Wang, Jessie Hui [3 ]
Zhang, Jun [1 ]
机构
[1] Hong Kong Univ Sci & Technol, Hong Kong, Peoples R China
[2] Hong Kong Polytech Univ, Hong Kong, Peoples R China
[3] Tsinghua Univ, Beijing, Peoples R China
来源
关键词
Federated Learning; Non-Independent and Identically Distributed (non-IID) Problem; Generative Adversarial Network (GAN);
D O I
10.1007/978-3-031-28996-5_2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Federated learning (FL) has recently emerged as a popular privacy-preserving collaborative learning paradigm. However, it suffers from the non-independent and identically distributed (non-IID) data among clients. In this chapter, we propose a novel framework, named Synthetic Data Aided Federated Learning (SDA-FL), to resolve this non-IID challenge by sharing synthetic data. Specifically, each client pretrains a local generative adversarial network (GAN) to generate differentially private synthetic data, which are uploaded to the parameter server (PS) to construct a global shared synthetic dataset. To generate confident pseudo labels for the synthetic dataset, we also propose an iterative pseudo labeling mechanism performed by the PS. The assistance of the synthetic dataset with confident pseudo labels significantly alleviates the data heterogeneity among clients, which improves the consistency among local updates and benefits the global aggregation. Extensive experiments evidence that the proposed framework outperforms the baseline methods by a large margin in several benchmark datasets under both the supervised and semi-supervised settings.
引用
收藏
页码:17 / 32
页数:16
相关论文
共 50 条
  • [21] Contractible Regularization for Federated Learning on Non-IID Data
    Chen, Zifan
    Wu, Zhe
    Wu, Xian
    Zhang, Li
    Zhao, Jie
    Yan, Yangtian
    Zheng, Yefeng
    2022 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2022, : 61 - 70
  • [22] Federated Learning With Non-IID Data in Wireless Networks
    Zhao, Zhongyuan
    Feng, Chenyuan
    Hong, Wei
    Jiang, Jiamo
    Jia, Chao
    Quek, Tony Q. S.
    Peng, Mugen
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2022, 21 (03) : 1927 - 1942
  • [23] Dynamic Clustering Federated Learning for Non-IID Data
    Chen, Ming
    Wu, Jinze
    Yin, Yu
    Huang, Zhenya
    Liu, Qi
    Chen, Enhong
    ARTIFICIAL INTELLIGENCE, CICAI 2022, PT III, 2022, 13606 : 119 - 131
  • [24] Data augmentation scheme for federated learning with non-IID data
    Tang L.
    Wang D.
    Liu S.
    Tongxin Xuebao/Journal on Communications, 2023, 44 (01): : 164 - 176
  • [25] Optimizing Federated Learning on Non-IID Data with Reinforcement Learning
    Wang, Hao
    Kaplan, Zakhary
    Niu, Di
    Li, Baochun
    IEEE INFOCOM 2020 - IEEE CONFERENCE ON COMPUTER COMMUNICATIONS, 2020, : 1698 - 1707
  • [26] Federated Learning Based on Diffusion Model to Cope with Non-IID Data
    Zhao, Zhuang
    Yang, Feng
    Liang, Guirong
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT IX, 2024, 14433 : 220 - 231
  • [27] GAN-Enhanced Vertical Federated Learning System for WHAR with non-IID Data
    Lee, Chanmin
    Cho, Shinyoung
    Park, Hyungbin
    Park, Jihyun
    Lee, Sukyoung
    PROCEEDINGS OF 2024 IEEE/IFIP NETWORK OPERATIONS AND MANAGEMENT SYMPOSIUM, NOMS 2024, 2024,
  • [28] FEDERATED PAC-BAYESIAN LEARNING ON NON-IID DATA
    Zhao, Zihao
    Liu, Yang
    Ding, Wenbo
    Zhang, Xiao-Ping
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 5945 - 5949
  • [29] Accelerating Federated learning on non-IID data against stragglers
    Zhang, Yupeng
    Duan, Lingjie
    Cheung, Ngai-Man
    2022 IEEE INTERNATIONAL CONFERENCE ON SENSING, COMMUNICATION, AND NETWORKING (SECON WORKSHOPS), 2022, : 43 - 48
  • [30] Inverse Distance Aggregation for Federated Learning with Non-IID Data
    Yeganeh, Yousef
    Farshad, Azade
    Navab, Nassir
    Albarqouni, Shadi
    DOMAIN ADAPTATION AND REPRESENTATION TRANSFER, AND DISTRIBUTED AND COLLABORATIVE LEARNING, DART 2020, DCL 2020, 2020, 12444 : 150 - 159