Domain Adaptation Without Source Data

被引:83
|
作者
Kim Y. [1 ]
Cho D. [2 ]
Han K. [3 ]
Panda P. [1 ]
Hong S. [3 ]
机构
[1] The Department of Electrical Engineering, Yale University, New Haven, 06520, CT
[2] The Department of Electronics Engineering, Chungnam National University, Daejeon
[3] The Department of Electrical and Computer Engineering, Inha University, Incheon
来源
IEEE Transactions on Artificial Intelligence | 2021年 / 2卷 / 06期
基金
新加坡国家研究基金会;
关键词
Class prototypes; data privacy; pseudolabels; self-entropy; source data free domain adaptation (SFDA);
D O I
10.1109/TAI.2021.3110179
中图分类号
学科分类号
摘要
Domain adaptation assumes that samples from source and target domains are freely accessible during a training phase. However, such an assumption is rarely plausible in the real world and possibly causes data privacy issues, especially when the label of the source domain can be a sensitive attribute as an identifier. To avoid accessing source data that could contain sensitive information, we introduce source data free domain adaptation (SFDA). Our key idea is to leverage a pretrained model from the source domain and progressively update the target model in a self-learning manner. We observe that target samples with lower self-entropy measured by the pretrained source model are more likely to be classified correctly. From this, we select the reliable samples with the self-entropy criterion and define these as class prototypes. We then assign pseudolabels for every target sample based on the similarity score with class prototypes. We further propose point-to-set distance-based filtering, which does not require any tunable hyperparameters to reduce uncertainty from the pseudolabeling process. Finally, we train the target model with the filtered pseudolabels with regularization from the pretrained source model. Surprisingly, without direct usage of labeled source samples, our SFDA outperforms conventional domain adaptation methods on benchmark datasets. Impact Statement-This study addresses the data privacy issue, especially in unsupervised domain adaptation. Based on our privacy-preserving domain adaptation, various stakeholders, including enterprises and government organizations, can be free of concern about privacy issues with their labeled source dataset. Furthermore, the proposed data-free approach can contribute to creating a positive social impact, especially in large-scale datasets. Recently, since the size of data across various fields has been scaling up, it is almost incapable for individual researchers to directly utilize a large scale of data during training. For this reason, a new social trend of sharing pretrained models, e.g., EfficientNet and BERT, led by global enterprises with their huge amount of resources has been rising up. From this viewpoint, our approach thus enables more people to participate in the domain adaptation field specifically when the source data are large scale and contain sensitive attributes. © 2021 IEEE.
引用
收藏
页码:508 / 518
页数:10
相关论文
共 50 条
  • [21] Cross Domain Pulmonary Nodule Detection Without Source Data
    Xu, Rui
    Luo, Yong
    Xu, Yan
    ADVANCES IN ARTIFICIAL INTELLIGENCE, AI 2023, PT I, 2024, 14471 : 153 - 164
  • [22] Data-Driven Approach to Multiple-Source Domain Adaptation
    Stojanov, Petar
    Gong, Mingming
    Carbonell, Jaime G.
    Zhang, Kun
    22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89
  • [23] Multi-source domain adaptation of GPR data for IED detection
    Mehmet Oturak
    Seniha Esen Yuksel
    Sefa Kucuk
    Signal, Image and Video Processing, 2023, 17 : 1831 - 1839
  • [24] Multi-source domain adaptation of GPR data for IED detection
    Oturak, Mehmet
    Yuksel, Seniha Esen
    Kucuk, Sefa
    SIGNAL IMAGE AND VIDEO PROCESSING, 2023, 17 (05) : 1831 - 1839
  • [25] Multi-Source Domain Adaptation Using Ambient Sensor Data
    Dridi, Jawher
    Amayri, Manar
    Bouguila, Nizar
    APPLIED ARTIFICIAL INTELLIGENCE, 2024, 38 (01)
  • [26] Source Data-Free Unsupervised Domain Adaptation for Semantic Segmentation
    Ye, Mucong
    Zhang, Jing
    Ouyang, Jingpeng
    Yuan, Ding
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 2233 - 2242
  • [27] Unsupervised domain adaptation without source domain training samples - a maximum margin clustering based approach
    Saha, Sudipan
    Banerjee, Biplab
    Merchant, Shabbir N.
    TENTH INDIAN CONFERENCE ON COMPUTER VISION, GRAPHICS AND IMAGE PROCESSING (ICVGIP 2016), 2016,
  • [28] Partial Domain Adaptation Without Domain Alignment
    Li, Weikai
    Chen, Songcan
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (07) : 8787 - 8797
  • [29] DACH: Domain Adaptation Without Domain Information
    Cai, Ruichu
    Li, Jiahao
    Zhang, Zhenjie
    Yang, Xiaoyan
    Hao, Zhifeng
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (12) : 5055 - 5067
  • [30] VDM-DA: Virtual Domain Modeling for Source Data-Free Domain Adaptation
    Tian, Jiayi
    Zhang, Jing
    Li, Wen
    Xu, Dong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (06) : 3749 - 3760