Sample selection using multi-task autoencoders in federated learning with non-IID data

被引:0
|
作者
Ardic, Emre [1 ]
Genc, Yakup [1 ]
机构
[1] Gebze Tech Univ, Dept Comp Engn, TR-41400 Gebze, Turkiye
关键词
Federated learning; Data valuation; Unsupervised outlier detection; Multi-task autoencoder;
D O I
10.1016/j.jestch.2024.101920
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Federated learning is a machine learning paradigm in which multiple devices collaboratively train a model under the supervision of a central server while ensuring data privacy. However, its performance is often hindered by redundant, malicious, or abnormal samples, leading to model degradation and inefficiency. To overcome these issues, we propose novel sample selection methods for image classification, employing a multitask autoencoder to estimate sample contributions through loss and feature analysis. Our approach incorporates unsupervised outlier detection, using one-class support vector machine (OCSVM), isolation forest (IF), and adaptive loss threshold (AT) methods managed by a central server to filter noisy samples on clients. We also propose a multi-class deep support vector data description (SVDD) loss controlled by a central server to enhance feature-based sample selection. We validate our methods on CIFAR10 and MNIST datasets across varying numbers of clients, non-IID distributions, and noise levels up to 40%. The results show significant accuracy improvements with loss-based sample selection, achieving gains of up to 7.02% on CIFAR10 with OCSVM and 1.83% on MNIST with AT. Additionally, our federated SVDD loss further improves feature-based sample selection, yielding accuracy gains of up to 0.99% on CIFAR10 with OCSVM. These results show the effectiveness of our methods in improving model accuracy across various client counts and noise conditions.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Federated Multi-Task Learning on Non-IID Data Silos: An Experimental Study
    Yang, Yuwen
    Lu, Yuxiang
    Huang, Suizhi
    Sirejiding, Shalayiding
    Lu, Hongtao
    Ding, Yue
    PROCEEDINGS OF THE 4TH ANNUAL ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2024, 2024, : 684 - 693
  • [2] Federated learning on non-IID data: A survey
    Zhu, Hangyu
    Xu, Jinjin
    Liu, Shiqing
    Jin, Yaochu
    NEUROCOMPUTING, 2021, 465 : 371 - 390
  • [3] FedNSE: Optimal Node Selection for Federated Learning with Non-IID Data
    Bansal, Sourav
    Bansal, Manav
    Verma, Rohit
    Shorey, Rajeev
    Saran, Huzur
    2023 15TH INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS & NETWORKS, COMSNETS, 2023,
  • [4] Adaptive Federated Learning With Non-IID Data
    Zeng, Yan
    Mu, Yuankai
    Yuan, Junfeng
    Teng, Siyuan
    Zhang, Jilin
    Wan, Jian
    Ren, Yongjian
    Zhang, Yunquan
    COMPUTER JOURNAL, 2023, 66 (11): : 2758 - 2772
  • [5] Federated Learning With Taskonomy for Non-IID Data
    Jamali-Rad, Hadi
    Abdizadeh, Mohammad
    Singh, Anuj
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (11) : 8719 - 8730
  • [6] Federated Learning With Non-IID Data: A Survey
    Lu, Zili
    Pan, Heng
    Dai, Yueyue
    Si, Xueming
    Zhang, Yan
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (11): : 19188 - 19209
  • [7] A Survey of Federated Learning on Non-IID Data
    HAN Xuming
    GAO Minghan
    WANG Limin
    HE Zaobo
    WANG Yanze
    ZTECommunications, 2022, 20 (03) : 17 - 26
  • [8] Non-IID Federated Learning
    Cao, Longbing
    IEEE INTELLIGENT SYSTEMS, 2022, 37 (02) : 14 - 15
  • [9] Differentially private federated learning with non-IID data
    Cheng, Shuyan
    Li, Peng
    Wang, Ruchuan
    Xu, He
    COMPUTING, 2024, 106 (07) : 2459 - 2488
  • [10] Personalized Federated Learning Algorithm with Adaptive Clustering for Non-IID IoT Data Incorporating Multi-Task Learning and Neural Network Model Characteristics
    Hsu, Hua-Yang
    Keoy, Kay Hooi
    Chen, Jun-Ru
    Chao, Han-Chieh
    Lai, Chin-Feng
    SENSORS, 2023, 23 (22)