Domain Adaptation Without Source Data

被引：83

作者：

Kim Y. ^{[1
]}

Cho D. ^{[2
]}

Han K. ^{[3
]}

Panda P. ^{[1
]}

Hong S. ^{[3
]}

机构：

[1] The Department of Electrical Engineering, Yale University, New Haven, 06520, CT

[2] The Department of Electronics Engineering, Chungnam National University, Daejeon

[3] The Department of Electrical and Computer Engineering, Inha University, Incheon

来源：

IEEE Transactions on Artificial Intelligence | 2021年 / 2卷 / 06期

基金：

新加坡国家研究基金会;

关键词：

Class prototypes; data privacy; pseudolabels; self-entropy; source data free domain adaptation (SFDA);

D O I：

10.1109/TAI.2021.3110179

中图分类号：

学科分类号：

摘要：

Domain adaptation assumes that samples from source and target domains are freely accessible during a training phase. However, such an assumption is rarely plausible in the real world and possibly causes data privacy issues, especially when the label of the source domain can be a sensitive attribute as an identifier. To avoid accessing source data that could contain sensitive information, we introduce source data free domain adaptation (SFDA). Our key idea is to leverage a pretrained model from the source domain and progressively update the target model in a self-learning manner. We observe that target samples with lower self-entropy measured by the pretrained source model are more likely to be classified correctly. From this, we select the reliable samples with the self-entropy criterion and define these as class prototypes. We then assign pseudolabels for every target sample based on the similarity score with class prototypes. We further propose point-to-set distance-based filtering, which does not require any tunable hyperparameters to reduce uncertainty from the pseudolabeling process. Finally, we train the target model with the filtered pseudolabels with regularization from the pretrained source model. Surprisingly, without direct usage of labeled source samples, our SFDA outperforms conventional domain adaptation methods on benchmark datasets. Impact Statement-This study addresses the data privacy issue, especially in unsupervised domain adaptation. Based on our privacy-preserving domain adaptation, various stakeholders, including enterprises and government organizations, can be free of concern about privacy issues with their labeled source dataset. Furthermore, the proposed data-free approach can contribute to creating a positive social impact, especially in large-scale datasets. Recently, since the size of data across various fields has been scaling up, it is almost incapable for individual researchers to directly utilize a large scale of data during training. For this reason, a new social trend of sharing pretrained models, e.g., EfficientNet and BERT, led by global enterprises with their huge amount of resources has been rising up. From this viewpoint, our approach thus enables more people to participate in the domain adaptation field specifically when the source data are large scale and contain sensitive attributes. © 2021 IEEE.

引用

页码：508 / 518

页数：10

共 50 条

[41] Semi-Supervised Domain Adaptation with Source Label Adaptation
Yu, Yu-Chu
Lin, Hsuan-Tien
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 24100 - 24109
[42] A Free Lunch for Unsupervised Domain Adaptive Object Detection without Source Data
Li, Xianfeng
Chen, Weijie
Xie, Di
Yang, Shicai
Yuan, Peng
Pu, Shiliang
Zhuang, Yueting
THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 8474 - 8481
[43] A domain adaptation method for bearing fault diagnosis using multiple incomplete source data
Qibin Wang
Yuanbing Xu
Shengkang Yang
Jiantao Chang
Jingang Zhang
Xianguang Kong
Journal of Intelligent Manufacturing, 2024, 35 : 777 - 791
[44] A domain adaptation method for bearing fault diagnosis using multiple incomplete source data
Wang, Qibin
Xu, Yuanbing
Yang, Shengkang
Chang, Jiantao
Zhang, Jingang
Kong, Xianguang
JOURNAL OF INTELLIGENT MANUFACTURING, 2024, 35 (02) : 777 - 791
[45] Adaptive Contrastive Learning with Label Consistency for Source Data Free Unsupervised Domain Adaptation
Zhao, Xuejun
Stanislawski, Rafal
Gardoni, Paolo
Sulowicz, Maciej
Glowacz, Adam
Krolczyk, Grzegorz
Li, Zhixiong
SENSORS, 2022, 22 (11)
[46] Semantic consistency learning on manifold for source data-free unsupervised domain adaptation
Tang, Song
Zou, Yan
Song, Zihao
Lyu, Jianzhi
Chen, Lijuan
Ye, Mao
Zhong, Shouming
Zhang, Jianwei
NEURAL NETWORKS, 2022, 152 : 467 - 478
[47] Source-free domain adaptation with unrestricted source hypothesis
He, Jiujun
Wu, Liang
Tao, Chaofan
Lv, Fengmao
PATTERN RECOGNITION, 2024, 149
[48] Adversarial Source Generation for Source-Free Domain Adaptation
Cui, Chaoran
Meng, Fan'an
Zhang, Chunyun
Liu, Ziyi
Zhu, Lei
Gong, Shuai
Lin, Xue
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (06) : 4887 - 4898
[49] Source-free domain adaptation with unrestricted source hypothesis
He, Jiujun
Wu, Liang
Tao, Chaofan
Lv, Fengmao
Pattern Recognition, 2024, 149
[50] Source bias reduction for source-free domain adaptation
Tian, Liang
Ye, Mao
Zhou, Lihua
Wang, Zhenbin
SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (SUPPL 1) : 883 - 893

← 1 2 3 4 5 →