Sample selection using multi-task autoencoders in federated learning with non-IID data

被引:0
|
作者
Ardic, Emre [1 ]
Genc, Yakup [1 ]
机构
[1] Gebze Tech Univ, Dept Comp Engn, TR-41400 Gebze, Turkiye
来源
ENGINEERING SCIENCE AND TECHNOLOGY-AN INTERNATIONAL JOURNAL-JESTECH | 2025年 / 61卷
关键词
Federated learning; Data valuation; Unsupervised outlier detection; Multi-task autoencoder;
D O I
10.1016/j.jestch.2024.101920
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Federated learning is a machine learning paradigm in which multiple devices collaboratively train a model under the supervision of a central server while ensuring data privacy. However, its performance is often hindered by redundant, malicious, or abnormal samples, leading to model degradation and inefficiency. To overcome these issues, we propose novel sample selection methods for image classification, employing a multitask autoencoder to estimate sample contributions through loss and feature analysis. Our approach incorporates unsupervised outlier detection, using one-class support vector machine (OCSVM), isolation forest (IF), and adaptive loss threshold (AT) methods managed by a central server to filter noisy samples on clients. We also propose a multi-class deep support vector data description (SVDD) loss controlled by a central server to enhance feature-based sample selection. We validate our methods on CIFAR10 and MNIST datasets across varying numbers of clients, non-IID distributions, and noise levels up to 40%. The results show significant accuracy improvements with loss-based sample selection, achieving gains of up to 7.02% on CIFAR10 with OCSVM and 1.83% on MNIST with AT. Additionally, our federated SVDD loss further improves feature-based sample selection, yielding accuracy gains of up to 0.99% on CIFAR10 with OCSVM. These results show the effectiveness of our methods in improving model accuracy across various client counts and noise conditions.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] FedEL: Federated ensemble learning for non-iid data
    Wu, Xing
    Pei, Jie
    Han, Xian-Hua
    Chen, Yen-Wei
    Yao, Junfeng
    Liu, Yang
    Qian, Quan
    Guo, Yike
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 237
  • [22] Contractible Regularization for Federated Learning on Non-IID Data
    Chen, Zifan
    Wu, Zhe
    Wu, Xian
    Zhang, Li
    Zhao, Jie
    Yan, Yangtian
    Zheng, Yefeng
    2022 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2022, : 61 - 70
  • [23] Federated Learning With Non-IID Data in Wireless Networks
    Zhao, Zhongyuan
    Feng, Chenyuan
    Hong, Wei
    Jiang, Jiamo
    Jia, Chao
    Quek, Tony Q. S.
    Peng, Mugen
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2022, 21 (03) : 1927 - 1942
  • [24] Dynamic Clustering Federated Learning for Non-IID Data
    Chen, Ming
    Wu, Jinze
    Yin, Yu
    Huang, Zhenya
    Liu, Qi
    Chen, Enhong
    ARTIFICIAL INTELLIGENCE, CICAI 2022, PT III, 2022, 13606 : 119 - 131
  • [25] Data augmentation scheme for federated learning with non-IID data
    Tang L.
    Wang D.
    Liu S.
    Tongxin Xuebao/Journal on Communications, 2023, 44 (01): : 164 - 176
  • [26] Optimizing Federated Learning on Non-IID Data with Reinforcement Learning
    Wang, Hao
    Kaplan, Zakhary
    Niu, Di
    Li, Baochun
    IEEE INFOCOM 2020 - IEEE CONFERENCE ON COMPUTER COMMUNICATIONS, 2020, : 1698 - 1707
  • [27] FEDBS: Learning on Non-IID Data in Federated Learning using Batch Normalization
    Idrissi, Meryem Janati
    Berrada, Ismail
    Noubir, Guevara
    2021 IEEE 33RD INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2021), 2021, : 861 - 867
  • [28] Accelerating Federated learning on non-IID data against stragglers
    Zhang, Yupeng
    Duan, Lingjie
    Cheung, Ngai-Man
    2022 IEEE INTERNATIONAL CONFERENCE ON SENSING, COMMUNICATION, AND NETWORKING (SECON WORKSHOPS), 2022, : 43 - 48
  • [29] Inverse Distance Aggregation for Federated Learning with Non-IID Data
    Yeganeh, Yousef
    Farshad, Azade
    Navab, Nassir
    Albarqouni, Shadi
    DOMAIN ADAPTATION AND REPRESENTATION TRANSFER, AND DISTRIBUTED AND COLLABORATIVE LEARNING, DART 2020, DCL 2020, 2020, 12444 : 150 - 159
  • [30] FEDERATED PAC-BAYESIAN LEARNING ON NON-IID DATA
    Zhao, Zihao
    Liu, Yang
    Ding, Wenbo
    Zhang, Xiao-Ping
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 5945 - 5949