Stealing Knowledge from Pre-trained Language Models for Federated Classifier Debiasing

被引：1

作者：

Zhu, Meilu ^{[1
]}

Yang, Qiushi ^{[2
]}

Gao, Zhifan ^{[3
]}

Liu, Jun ^{[1
]}

Yuan, Yixuan ^{[4
]}

机构：

[1] City Univ Hong Kong, Dept Mech Engn, Hong Kong, Peoples R China

[2] City Univ Hong Kong, Dept Elect Engn, Hong Kong, Peoples R China

[3] Sun Yat Sen Univ, Sch Biomed Engn, Guangzhou, Peoples R China

[4] Chinese Univ Hong Kong, Dept Elect Engn, Hong Kong, Peoples R China

来源：

MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT X | 2024年 / 15010卷

关键词：

Federated learning; Medical Image Classification; Pre-trained Language Model;

D O I：

10.1007/978-3-031-72117-5_64

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Federated learning (FL) has shown great potential in medical image computing since it provides a decentralized learning paradigm that allows multiple clients to train a model collaboratively without privacy leakage. However, current studies have shown that heterogeneous data of clients causes biased classifiers of local models during training, leading to the performance degradation of a federation system. In experiments, we surprisingly found that continuously freezing local classifiers can significantly improve the performance of the baseline FL method (FedAvg) for heterogeneous data. This observation motivates us to pre-construct a high-quality initial classifier for local models and freeze it during local training to avoid classifier biases. With this insight, we propose a novel approach named Federated Classifier deBiasing (FedCB) to solve the classifier biases problem in heterogeneous federated learning. The core idea behind FedCB is to exploit linguistic knowledge from pre-trained language models (PLMs) to construct high-quality local classifiers. Specifically, FedCB first collects the class concepts from clients and then uses a set of prompts to contextualize them, yielding language descriptions of these concepts. These descriptions are fed into a pre-trained language model to obtain their text embeddings. The generated embeddings are sent to clients to estimate the distribution of each category in the semantic space. Regarding these distributions as the local classifiers, we perform the alignment between the image representations and the corresponding semantic distribution by minimizing an upper bound of the expected cross-entropy loss. Extensive experiments on public datasets demonstrate the superior performance FedCB compared to state-of-the-art methods. The source code is available at https://github.com/CUHK-AIM-Group/FedCB.

引用

页码：685 / 695

页数：11

共 50 条

[41] Probing for Hyperbole in Pre-Trained Language Models
Schneidermann, Nina Skovgaard
Hershcovich, Daniel
Pedersen, Bolette Sandford
PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-SRW 2023, VOL 4, 2023, : 200 - 211
[42] Pre-trained language models in medicine: A survey *
Luo, Xudong
Deng, Zhiqi
Yang, Binxia
Luo, Michael Y.
ARTIFICIAL INTELLIGENCE IN MEDICINE, 2024, 154
[43] ReAugKD: Retrieval-Augmented Knowledge Distillation For Pre-trained Language Models
Zhang, Jianyi
Muhamed, Aashiq
Anantharaman, Aditya
Wang, Guoyin
Chen, Changyou
Zhong, Kai
Cui, Qingjun
Xu, Yi
Zeng, Belinda
Chilimbi, Trishul
Chen, Yiran
61ST CONFERENCE OF THE THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 2, 2023, : 1128 - 1136
[44] A Study on Knowledge Distillation from Weak Teacher for Scaling Up Pre-trained Language Models
Lee, Hayeon
Hon, Rui
Kim, Jongpil
Liang, Davis
Hwang, Sung Ju
Min, Alexander
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023), 2023, : 11239 - 11246
[45] Evaluating Embeddings from Pre-Trained Language Models and Knowledge Graphs for Educational Content Recommendation
Li, Xiu
Henriksson, Aron
Duneld, Martin
Nouri, Jalal
Wu, Yongchao
FUTURE INTERNET, 2024, 16 (01)
[46] SimKGC: Simple Contrastive Knowledge Graph Completion with Pre-trained Language Models
Wang, Liang
Zhao, Wei
Wei, Zhuoyu
Liu, Jingming
PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 4281 - 4294
[47] Integrating Knowledge Graph Embeddings and Pre-trained Language Models in Hypercomplex Spaces
Nayyeri, Mojtaba
Wang, Zihao
Akter, Mst. Mahfuja
Alam, Mirza Mohtashim
Rony, Md Rashad Al Hasan
Lehmann, Jens
Staab, Steffen
SEMANTIC WEB, ISWC 2023, PART I, 2023, 14265 : 388 - 407
[48] Assisted Process Knowledge Graph Building Using Pre-trained Language Models
Bellan, Patrizio
Dragoni, Mauro
Ghidini, Chiara
AIXIA 2022 - ADVANCES IN ARTIFICIAL INTELLIGENCE, 2023, 13796 : 60 - 74
[49] Measuring the Knowledge Acquisition-Utilization Gap in Pre-trained Language Models
Kazemnejad, Amirhossein
Rezagholizadeh, Mehdi
Parthasarathi, Prasanna
Chandar, Sarath
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS - EMNLP 2023, 2023, : 4305 - 4319
[50] Towards fair decision: A novel representation method for debiasing pre-trained models
He, Junheng
Lin, Nankai
Bai, Qifeng
Liang, Haoyu
Zhou, Dong
Yang, Aimin
DECISION SUPPORT SYSTEMS, 2024, 181

← 1 2 3 4 5 →