Vertical federated learning based on data subset representation for healthcare application

被引:0
|
作者
Shi, Yukun [1 ]
Zhang, Jilin [1 ]
Xue, Meiting [1 ]
Zeng, Yan [2 ]
Jia, Gangyong [2 ]
Yu, Qihong [2 ]
Li, Miaoqi [2 ]
机构
[1] Hangzhou Dianzi Univ, Sch Cyberspace, Hangzhou 310018, Peoples R China
[2] Hangzhou Dianzi Univ, Sch Comp Sci & Technol, Hangzhou 310018, Peoples R China
基金
中国国家自然科学基金;
关键词
Vertical federated learning; Latent feature representation; Smart healthcare; Privacy preservation;
D O I
10.1016/j.cmpb.2025.108623
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Background and Objective : Artificial intelligence is increasingly essential for disease classification and clinical diagnosis tasks in healthcare. Given the strict privacy needs of healthcare data, Vertical Federated Learning (VFL) has been introduced. VFL allows multiple hospitals to collaboratively train models on vertically partitioned data, where each holds only the patient's partial data features, thus maintaining patient confidentiality. However, VFL applications in healthcare scenarios with fewer samples and labels are challenging because existing methods heavily depend on labeled samples and do not consider the intrinsic connections among the data across hospitals. Methods : This paper proposes FedRL, a representation-based VFL method that enhances the performance of downstream tasks by utilizing aligned data for federated representation pretraining. The proposed method creates the same feature dimensions subsets by splitting the local data, exploiting the relationships among these subsets, constructing a bespoke loss function, and collaboratively training a representation model to these subsets across all participating hospitals. This model captures the latent representations of the global data, which are then applied to the downstream classification tasks. Results and Conclusion : The proposed FedRL method was validated through experiments on three healthcare datasets. The results demonstrate that the proposed method outperforms several existing methods across three performance metrics. Specifically, FedRL achieves average improvements of 4.7%, 5.6%, and 4.8% in accuracy, AUC, and F1-score, respectively, compared to current methods. In addition, FedRL demonstrates greater robustness and consistent performance in scenarios with limited labeled samples, thereby confirming its effectiveness and potential use in healthcare data analysis.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Federated learning for preserving data privacy in collaborative healthcare research
    Loftus, Tyler J.
    Ruppert, Matthew M.
    Shickel, Benjamin
    Ozrazgat-Baslanti, Tezcan
    Balch, Jeremy A.
    Efron, Philip A.
    Upchurch, Gilbert R.
    Rashidi, Parisa
    Tignanelli, Christopher
    Bian, Jiang
    Bihorac, Azra
    DIGITAL HEALTH, 2022, 8
  • [22] Privacy-Preserving Federated Learning Model for Healthcare Data
    Ul Islam, Tanzir
    Ghasemi, Reza
    Mohammed, Noman
    2022 IEEE 12TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE (CCWC), 2022, : 281 - 287
  • [23] DFedXGB: An XGB Vertical Federated Learning Framework with Data Desensitization
    Yang, Qing
    Tian, Youliang
    Xiong, Jinbo
    2023 IEEE 22ND INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS, TRUSTCOM, BIGDATASE, CSE, EUC, ISCI 2023, 2024, : 157 - 164
  • [24] Explainable federated learning scheme for secure healthcare data sharing
    Zhao, Liutao
    Xie, Haoran
    Zhong, Lin
    Wang, Yujue
    HEALTH INFORMATION SCIENCE AND SYSTEMS, 2024, 12 (01):
  • [25] Model Optimization Method Based on Vertical Federated Learning
    Yang, Kuihe
    Song, Ziying
    Zhang, Yingchao
    Zhou, Yufan
    Sun, Xiaohan
    Wang, Jianxuan
    2021 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2021,
  • [26] Architecture-Based FedAvg for Vertical Federated Learning
    Casella, Bruno
    Fonio, Samuele
    16TH IEEE/ACM INTERNATIONAL CONFERENCE ON UTILITY AND CLOUD COMPUTING, UCC 2023, 2023,
  • [27] Federated Learning Approach to Protect Healthcare Data over Big Data Scenario
    Dhiman, Gaurav
    Juneja, Sapna
    Mohafez, Hamidreza
    El-Bayoumy, Ibrahim
    Sharma, Lokesh Kumar
    Hadizadeh, Maryam
    Islam, Mohammad Aminul
    Viriyasitavat, Wattana
    Khandaker, Mayeen Uddin
    SUSTAINABILITY, 2022, 14 (05)
  • [28] Federated Learning for Healthcare Applications
    Chaddad, Ahmad
    Wu, Yihang
    Desrosiers, Christian
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (05): : 7339 - 7358
  • [29] Federated Learning for Healthcare Informatics
    Jie Xu
    Benjamin S. Glicksberg
    Chang Su
    Peter Walker
    Jiang Bian
    Fei Wang
    Journal of Healthcare Informatics Research, 2021, 5 : 1 - 19
  • [30] Federated Learning for Healthcare Informatics
    Xu, Jie
    Glicksberg, Benjamin S.
    Su, Chang
    Walker, Peter
    Bian, Jiang
    Wang, Fei
    JOURNAL OF HEALTHCARE INFORMATICS RESEARCH, 2021, 5 (01) : 1 - 19