FedDSS: A data-similarity approach for client selection in horizontal federated learning

被引:0
|
作者
Nguyen, Tuong Minh [1 ]
Poh, Kim Leng [1 ]
Chong, Shu-Ling [2 ]
Lee, Jan Hau [3 ,4 ]
机构
[1] Natl Univ Singapore, Dept Ind Syst Engn & Management, Singapore 117576, Singapore
[2] KK Womens & Childrens Hosp, Childrens Emergency, Singapore 229899, Singapore
[3] Duke NUS Med Sch, SingHlth Duke NUS Paediat Acad Clin Programme, Singapore 169857, Singapore
[4] KK Womens & Childrens Hosp, Childrens Intens Care Unit, Singapore 229899, Singapore
关键词
Federated learning; Non-i.i.d; Client selection; Data similarity; Pediatric sepsis;
D O I
10.1016/j.ijmedinf.2024.105650
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Background and objective: Federated learning (FL) is an emerging distributed learning framework allowing multiple clients (hospitals, institutions, smart devices, etc.) to collaboratively train a centralized machine learning model without disclosing personal data. It has the potential to address several healthcare challenges, including a lack of training data, data privacy, and security concerns. However, model learning under FL is affected by non-i.i.d. data, leading to severe model divergence and reduced performance due to the varying client's data distributions. To address this problem, we propose FedDSS, Federated Data Similarity Selection, a framework that uses a data-similarity approach to select clients, without compromising client data privacy. Methods: FedDSS comprises a statistical-based data similarity metric, a N-similar-neighbor network, and a network-based selection strategy. We assessed FedDSS' performance against FedAvg's in i.i.d. and non-i.i.d. settings with two public pediatric sepsis datasets (PICD and MIMICIII). Selection fairness was measured using entropy. . Simulations were repeated five times to evaluate average loss, true positive rate (TPR), and entropy. . Results: In i.i.d setting on PICD, FedDSS achieved a higher TPR starting from the 9th round and surpassing 0.6 three rounds earlier than FedAvg. On MIMICIII, FedDSS's loss decreases significantly from the 13th round, with TPR > 0.8 by the 2nd round, two rounds ahead of FedAvg (at the 4th round). In the non-i.i.d. setting, FedDSS achieved TPR > 0.7 by the 4th and > 0.8 by the 7th round, earlier than FedAvg (at the 5th and 11th rounds). In both settings, FedDSS showed reasonable fairness ( entropy of 2.2 and 2.1). Conclusion: We demonstrated that FedDSS contributes to improved learning in FL by achieving faster convergence, reaching the desired TPR with fewer communication rounds, and potentially enhancing sepsis prediction (TPR) over FedAvg.
引用
收藏
页数:9
相关论文
共 50 条
  • [41] Compressed Client Selection for Efficient Communication in Federated Learning
    Mohamed, Aissa Hadj
    Assumpcao, Nicolas R. G.
    Astudillo, Carlos A.
    de Souza, Allan M.
    Bittencourt, Luiz F.
    Villas, Leandro A.
    2023 IEEE 20TH CONSUMER COMMUNICATIONS & NETWORKING CONFERENCE, CCNC, 2023,
  • [42] Incentive Mechanism for Federated Learning With Random Client Selection
    Wu, Hongyi
    Tang, Xiaoying
    Zhang, Ying-Jun Angela
    Gao, Lin
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2024, 11 (02): : 1922 - 1933
  • [43] Stochastic Client Selection for Federated Learning With Volatile Clients
    Huang, Tiansheng
    Lin, Weiwei
    Shen, Li
    Li, Keqin
    Zomaya, Albert Y.
    IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (20) : 20055 - 20070
  • [44] Contribution-based Federated Learning client selection
    Lin, Weiwei
    Xu, Yinhai
    Liu, Bo
    Li, Dongdong
    Huang, Tiansheng
    Shi, Fang
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2022, 37 (10) : 7235 - 7260
  • [45] FAIRNESS-AWARE CLIENT SELECTION FOR FEDERATED LEARNING
    Shi, Yuxin
    Liu, Zelei
    Shi, Zhuan
    Yu, Han
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 324 - 329
  • [46] A Robust Client Selection Mechanism for Federated Learning Environments
    Veiga, Rafael
    Sousa, John
    Morais, Renan
    Bastos, Lucas
    Lobato, Wellington
    Rosário, Denis
    Cerqueira, Eduardo
    Journal of the Brazilian Computer Society, 30 (01): : 444 - 455
  • [47] VFedCS: Optimizing Client Selection for Volatile Federated Learning
    Shi, Fang
    Hu, Chunchao
    Lin, Weiwei
    Fan, Lisheng
    Huang, Tiansheng
    Wu, Wentai
    IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (24) : 24995 - 25010
  • [48] A Review of Client Selection Mechanisms in Heterogeneous Federated Learning
    Wang, Xiao
    Ge, Lina
    Zhang, Guifeng
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT II, 2023, 14087 : 761 - 772
  • [49] Client Selection in Federated Learning: Principles, Challenges, and Opportunities
    Fu, Lei
    Zhang, Huanle
    Gao, Ge
    Zhang, Mi
    Liu, Xin
    IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (24) : 21811 - 21819
  • [50] Fast Heterogeneous Federated Learning with Hybrid Client Selection
    Song, Duanxiao
    Shen, Guangyuan
    Gao, Dehong
    Yang, Libin
    Zhou, Xukai
    Pan, Shirui
    Lou, Wei
    Zhou, Fang
    UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2023, 216 : 2006 - 2015