GFL: Federated Learning on Non-IID data via Privacy-preserving Synthetic data

被引:9
|
作者
Cheng, Yihang [1 ]
Zhang, Lan [1 ,2 ]
Li, Anran [3 ]
机构
[1] Univ Sci & Technol China, Sch Comp Sci & Technol, Hefei, Peoples R China
[2] Hefei Comprehens Natl Sci Ctr, Inst Artificial Intelligence, Hefei, Peoples R China
[3] Nanyang Technol Univ, Singapore, Singapore
基金
国家重点研发计划;
关键词
Federated Learning; Non-IID; Membership Inference Attack;
D O I
10.1109/PERCOM56429.2023.10099110
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Federated learning (FL) enables large amounts of participants to construct a global learning model, while storing training data privately at local client devices. A fundamental issue in FL systems is the susceptibility to the highly skewed distributed data. A series of methods have been proposed to mitigate the Non-IID problem by limiting the distances between local models and the global model, but they cannot address the root cause of skewed data distribution eventually. Some methods share extra samples from the server to clients, which requires comprehensive data collection by the server and may raise potential privacy risks. In this work, we propose an efficient and adaptive framework, named Generative Federated Learning (GFL), to solve the skewed data problem in FL systems in a privacy-friendly way. We introduce Generative Adversarial Networks (GAN) into FL to generate synthetic data, which can be used by the server to balance data distributions. To keep the distribution and membership of clients' data private, the synthetic samples are generated with random distributions and protected by a differential privacy mechanism. The results show that GFL significantly outperforms existing approaches in terms of achieving more accurate global models (e.g., 17%-50% higher accuracy) as well as building global models with faster convergence speed without increasing much computation or communication costs.
引用
收藏
页码:61 / 70
页数:10
相关论文
共 50 条
  • [1] Privacy-preserving clustering federated learning for non-IID data
    Luo, Guixun
    Chen, Naiyue
    He, Jiahuan
    Jin, Bingwei
    Zhang, Zhiyuan
    Li, Yidong
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2024, 154 : 384 - 395
  • [2] Privacy-Preserving Federated Graph Neural Network Learning on Non-IID Graph Data
    Zhang K.
    Cai Z.
    Seo D.
    Wireless Communications and Mobile Computing, 2023, 2023
  • [3] Privacy-preserving Blockchain-based Global Data Sharing for Federated Learning with Non-IID Data
    Lian, Zhuotao
    Zeng, Qingkui
    Su, Chunhua
    2022 IEEE 42ND INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS WORKSHOPS (ICDCSW), 2022, : 193 - 198
  • [4] Privacy-Preserving Federated Learning Against Label-Flipping Attacks on Non-IID Data
    Shen, Xicong
    Liu, Ying
    Li, Fu
    Li, Chunguang
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (01): : 1241 - 1255
  • [5] Privacy-Preserving Asynchronous Federated Learning Under Non-IID Settings
    Miao, Yinbin
    Kuang, Da
    Li, Xinghua
    Xu, Shujiang
    Li, Hongwei
    Choo, Kim-Kwang Raymond
    Deng, Robert H.
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 19 : 5828 - 5841
  • [6] Blockchain-Enabled Federated Learning for Privacy-Preserving Non-IID Data Sharing in Industrial Internet
    Wang, Qiuyan
    Dong, Haibing
    Huang, Yongfei
    Liu, Zenglei
    Gou, Yundong
    CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 80 (02): : 1967 - 1983
  • [7] Privacy-Enhanced Federated Learning for Non-IID Data
    Tan, Qingjie
    Wu, Shuhui
    Tao, Yuanhong
    MATHEMATICS, 2023, 11 (19)
  • [8] Efficient privacy-preserving ML for IoT: Cluster-based split federated learning scheme for non-IID data
    Arafeh, Mohamad
    Wazzeh, Mohamad
    Sami, Hani
    Ould-Slimane, Hakima
    Talhi, Chamseddine
    Mourad, Azzam
    Otrok, Hadi
    JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2025, 236
  • [9] Clustered Federated Multitask Learning on Non-IID Data With Enhanced Privacy
    Shu, Jiangang
    Yang, Tingting
    Liao, Xinying
    Chen, Farong
    Xiao, Yao
    Yang, Kan
    Jia, Xiaohua
    IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (04) : 3453 - 3467
  • [10] Federated learning on non-IID data: A survey
    Zhu, Hangyu
    Xu, Jinjin
    Liu, Shiqing
    Jin, Yaochu
    NEUROCOMPUTING, 2021, 465 : 371 - 390