Domain Adaptation Without Source Data

被引:83
|
作者
Kim Y. [1 ]
Cho D. [2 ]
Han K. [3 ]
Panda P. [1 ]
Hong S. [3 ]
机构
[1] The Department of Electrical Engineering, Yale University, New Haven, 06520, CT
[2] The Department of Electronics Engineering, Chungnam National University, Daejeon
[3] The Department of Electrical and Computer Engineering, Inha University, Incheon
来源
IEEE Transactions on Artificial Intelligence | 2021年 / 2卷 / 06期
基金
新加坡国家研究基金会;
关键词
Class prototypes; data privacy; pseudolabels; self-entropy; source data free domain adaptation (SFDA);
D O I
10.1109/TAI.2021.3110179
中图分类号
学科分类号
摘要
Domain adaptation assumes that samples from source and target domains are freely accessible during a training phase. However, such an assumption is rarely plausible in the real world and possibly causes data privacy issues, especially when the label of the source domain can be a sensitive attribute as an identifier. To avoid accessing source data that could contain sensitive information, we introduce source data free domain adaptation (SFDA). Our key idea is to leverage a pretrained model from the source domain and progressively update the target model in a self-learning manner. We observe that target samples with lower self-entropy measured by the pretrained source model are more likely to be classified correctly. From this, we select the reliable samples with the self-entropy criterion and define these as class prototypes. We then assign pseudolabels for every target sample based on the similarity score with class prototypes. We further propose point-to-set distance-based filtering, which does not require any tunable hyperparameters to reduce uncertainty from the pseudolabeling process. Finally, we train the target model with the filtered pseudolabels with regularization from the pretrained source model. Surprisingly, without direct usage of labeled source samples, our SFDA outperforms conventional domain adaptation methods on benchmark datasets. Impact Statement-This study addresses the data privacy issue, especially in unsupervised domain adaptation. Based on our privacy-preserving domain adaptation, various stakeholders, including enterprises and government organizations, can be free of concern about privacy issues with their labeled source dataset. Furthermore, the proposed data-free approach can contribute to creating a positive social impact, especially in large-scale datasets. Recently, since the size of data across various fields has been scaling up, it is almost incapable for individual researchers to directly utilize a large scale of data during training. For this reason, a new social trend of sharing pretrained models, e.g., EfficientNet and BERT, led by global enterprises with their huge amount of resources has been rising up. From this viewpoint, our approach thus enables more people to participate in the domain adaptation field specifically when the source data are large scale and contain sensitive attributes. © 2021 IEEE.
引用
收藏
页码:508 / 518
页数:10
相关论文
共 50 条
  • [41] Semi-Supervised Domain Adaptation with Source Label Adaptation
    Yu, Yu-Chu
    Lin, Hsuan-Tien
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 24100 - 24109
  • [42] A Free Lunch for Unsupervised Domain Adaptive Object Detection without Source Data
    Li, Xianfeng
    Chen, Weijie
    Xie, Di
    Yang, Shicai
    Yuan, Peng
    Pu, Shiliang
    Zhuang, Yueting
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 8474 - 8481
  • [43] A domain adaptation method for bearing fault diagnosis using multiple incomplete source data
    Qibin Wang
    Yuanbing Xu
    Shengkang Yang
    Jiantao Chang
    Jingang Zhang
    Xianguang Kong
    Journal of Intelligent Manufacturing, 2024, 35 : 777 - 791
  • [44] A domain adaptation method for bearing fault diagnosis using multiple incomplete source data
    Wang, Qibin
    Xu, Yuanbing
    Yang, Shengkang
    Chang, Jiantao
    Zhang, Jingang
    Kong, Xianguang
    JOURNAL OF INTELLIGENT MANUFACTURING, 2024, 35 (02) : 777 - 791
  • [45] Adaptive Contrastive Learning with Label Consistency for Source Data Free Unsupervised Domain Adaptation
    Zhao, Xuejun
    Stanislawski, Rafal
    Gardoni, Paolo
    Sulowicz, Maciej
    Glowacz, Adam
    Krolczyk, Grzegorz
    Li, Zhixiong
    SENSORS, 2022, 22 (11)
  • [46] Semantic consistency learning on manifold for source data-free unsupervised domain adaptation
    Tang, Song
    Zou, Yan
    Song, Zihao
    Lyu, Jianzhi
    Chen, Lijuan
    Ye, Mao
    Zhong, Shouming
    Zhang, Jianwei
    NEURAL NETWORKS, 2022, 152 : 467 - 478
  • [47] Source-free domain adaptation with unrestricted source hypothesis
    He, Jiujun
    Wu, Liang
    Tao, Chaofan
    Lv, Fengmao
    PATTERN RECOGNITION, 2024, 149
  • [48] Adversarial Source Generation for Source-Free Domain Adaptation
    Cui, Chaoran
    Meng, Fan'an
    Zhang, Chunyun
    Liu, Ziyi
    Zhu, Lei
    Gong, Shuai
    Lin, Xue
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (06) : 4887 - 4898
  • [49] Source-free domain adaptation with unrestricted source hypothesis
    He, Jiujun
    Wu, Liang
    Tao, Chaofan
    Lv, Fengmao
    Pattern Recognition, 2024, 149
  • [50] Source bias reduction for source-free domain adaptation
    Tian, Liang
    Ye, Mao
    Zhou, Lihua
    Wang, Zhenbin
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (SUPPL 1) : 883 - 893