Vertical federated learning based on data subset representation for healthcare application

被引:0
|
作者
Shi, Yukun [1 ]
Zhang, Jilin [1 ]
Xue, Meiting [1 ]
Zeng, Yan [2 ]
Jia, Gangyong [2 ]
Yu, Qihong [2 ]
Li, Miaoqi [2 ]
机构
[1] Hangzhou Dianzi Univ, Sch Cyberspace, Hangzhou 310018, Peoples R China
[2] Hangzhou Dianzi Univ, Sch Comp Sci & Technol, Hangzhou 310018, Peoples R China
基金
中国国家自然科学基金;
关键词
Vertical federated learning; Latent feature representation; Smart healthcare; Privacy preservation;
D O I
10.1016/j.cmpb.2025.108623
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Background and Objective : Artificial intelligence is increasingly essential for disease classification and clinical diagnosis tasks in healthcare. Given the strict privacy needs of healthcare data, Vertical Federated Learning (VFL) has been introduced. VFL allows multiple hospitals to collaboratively train models on vertically partitioned data, where each holds only the patient's partial data features, thus maintaining patient confidentiality. However, VFL applications in healthcare scenarios with fewer samples and labels are challenging because existing methods heavily depend on labeled samples and do not consider the intrinsic connections among the data across hospitals. Methods : This paper proposes FedRL, a representation-based VFL method that enhances the performance of downstream tasks by utilizing aligned data for federated representation pretraining. The proposed method creates the same feature dimensions subsets by splitting the local data, exploiting the relationships among these subsets, constructing a bespoke loss function, and collaboratively training a representation model to these subsets across all participating hospitals. This model captures the latent representations of the global data, which are then applied to the downstream classification tasks. Results and Conclusion : The proposed FedRL method was validated through experiments on three healthcare datasets. The results demonstrate that the proposed method outperforms several existing methods across three performance metrics. Specifically, FedRL achieves average improvements of 4.7%, 5.6%, and 4.8% in accuracy, AUC, and F1-score, respectively, compared to current methods. In addition, FedRL demonstrates greater robustness and consistent performance in scenarios with limited labeled samples, thereby confirming its effectiveness and potential use in healthcare data analysis.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Practical Vertical Federated Learning With Unsupervised Representation Learning
    Wu, Zhaomin
    Li, Qinbin
    He, Bingsheng
    IEEE TRANSACTIONS ON BIG DATA, 2024, 10 (06) : 864 - 878
  • [2] On the Impact of Data Heterogeneity in Federated Learning Environments with Application to Healthcare Networks
    Milasheuski, U.
    Barbieri, L.
    Tedeschini, B. Camajori
    Nicoli, M.
    Savazzi, M. S.
    2024 IEEE CONFERENCE ON ARTIFICIAL INTELLIGENCE, CAI 2024, 2024, : 1017 - 1023
  • [3] Application of Robust Zero-Watermarking Scheme Based on Federated Learning for Securing the Healthcare Data
    Han, Baoru
    Jhaveri, Rutvij H.
    Wang, Han
    Qiao, Dawei
    Du, Jinglong
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2023, 27 (02) : 804 - 813
  • [4] TabVFL: Improving Latent Representation in Vertical Federated Learning
    Rashad, Mohamed
    Zhao, Zilong
    Decouchant, Jeremie
    Chen, Lydia Y.
    2024 43RD INTERNATIONAL SYMPOSIUM ON RELIABLE DISTRIBUTED SYSTEMS, SRDS 2024, 2024, : 210 - 221
  • [5] PraVFed: Practical Heterogeneous Vertical Federated Learning via Representation Learning
    Wang, Shuo
    Gai, Keke
    Yu, Jing
    Zhang, Zijian
    Zhu, Liehuang
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2025, 20 : 2693 - 2705
  • [6] A Data Augmentation Method for Vertical Federated Learning
    Zhang, JianFei
    Jiang, YuChen
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2022, 2022
  • [7] Review on security of federated learning and its application in healthcare
    Li, Hao
    Li, Chengcheng
    Wang, Jian
    Yang, Aimin
    Ma, Zezhong
    Zhang, Zunqian
    Hua, Dianbo
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2023, 144 : 271 - 290
  • [8] A Data Reconstruction Attack against Vertical Federated Learning Based on Knowledge Transfer
    Suimon, Takumi
    Koizumi, Yuki
    Takemasa, Junji
    Hasegawa, Toni
    IEEE INFOCOM 2024-IEEE CONFERENCE ON COMPUTER COMMUNICATIONS WORKSHOPS, INFOCOM WKSHPS 2024, 2024,
  • [9] Gradient-based defense methods for data leakage in vertical federated learning
    Chang, Wenhan
    Zhu, Tianqing
    COMPUTERS & SECURITY, 2024, 139
  • [10] ReVFed: Representation-Based Privacy-Preserving Vertical Federated Learning with Heterogeneous Models
    Wang, Shuo
    Yu, Jing
    Gai, Keke
    Zhu, Liehuang
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT III, KSEM 2024, 2024, 14886 : 386 - 397