Domain-Invariant Feature Progressive Distillation with Adversarial Adaptive Augmentation for Low-Resource Cross-Domain NER

被引:2
|
作者
Zhang, Tao [1 ]
Xia, Congying [1 ]
Liu, Zhiwei [1 ]
Zhao, Shu [2 ]
Peng, Hao [3 ]
Yu, Philip [1 ]
机构
[1] Univ Illinois, Dept Comp Sci, 851 South Morgan St, Chicago, IL 60607 USA
[2] Anhui Univ, Sch Comp Sci & Technol, 111 Jiulong Rd, Hefei 230601, Anhui, Peoples R China
[3] Beihang Univ, Beijing Adv Innovat Ctr Big Data & Brain Comp, 37 Xue Yuan Rd, Beijing 100191, Peoples R China
基金
北京市自然科学基金; 国家重点研发计划;
关键词
NER; adversarial augmentation; cross-domain; domain adaptation; low-resource; knowledge distillation;
D O I
10.1145/3570502
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Considering the expensive annotation in Named Entity Recognition (NER), Cross-domain NER enables NER in low-resource target domains with few or without labeled data, by transferring the knowledge of high-resource domains. However, the discrepancy between different domains causes the domain shift problem and hampers the performance of cross-domain NER in low-resource scenarios. In this article, we first propose an adversarial adaptive augmentation, where we integrate the adversarial strategy into a multi-task learner to augment and qualify domain adaptive data. We extract domain-invariant features of the adaptive data to bridge the cross-domain gap and alleviate the label-sparsity problem simultaneously. Therefore, another important component in this article is the progressive domain-invariant feature distillation framework. A multi-grained MMD (Maximum Mean Discrepancy) approach in the framework to extract the multi-level domain invariant features and enable knowledge transfer across domains through the adversarial adaptive data. Advanced Knowledge Distillation (KD) schema processes progressively domain adaptation through the powerful pre-trained language models and multi-level domain invariant features. Extensive comparative experiments over four English and two Chinese benchmarks show the importance of adversarial augmentation and effective adaptation from high-resource domains to low-resource target domains. Comparison with two vanilla and four latest baselines indicates the state-of-the-art performance and superiority confronted with both zero-resource and minimal-resource scenarios.
引用
收藏
页数:21
相关论文
共 50 条
  • [21] Low-Resource Adversarial Domain Adaptation for Cross-modality Nucleus Detection
    Xing, Fuyong
    Cornish, Toby C.
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT VII, 2022, 13437 : 639 - 649
  • [22] Graph-Diffusion-Based Domain-Invariant Representation Learning for Cross-Domain Facial Expression Recognition
    Wang, Run
    Song, Peng
    Zheng, Wenming
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, 11 (03) : 4163 - 4174
  • [23] Domain Generalization and Feature Fusion for Cross-domain Imperceptible Adversarial Attack Detection
    Li, Yi
    Angelov, Plamen
    Suri, Neeraj
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [24] Adaptive Adversarial Contrastive Learning for Cross-Domain Recommendation
    Hsu, Chi-Wei
    Chen, Chiao-Ting
    Huang, Szu-Hao
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2024, 18 (03)
  • [25] Multiscale capsule networks with attention mechanisms based on domain-invariant properties for cross-domain lifetime prediction
    Shang, Zhiwu
    Feng, Zehua
    DIGITAL SIGNAL PROCESSING, 2024, 146
  • [26] Domain adaptation based on domain-invariant and class-distinguishable feature learning using multiple adversarial networks
    Fan, Cangning
    Liu, Peng
    Xiao, Ting
    Zhao, Wei
    Tang, Xianglong
    NEUROCOMPUTING, 2020, 411 : 178 - 192
  • [27] Cross-Domain Palmprint Recognition via Regularized Adversarial Domain Adaptive Hashing
    Du, Xuefeng
    Zhong, Dexing
    Shao, Huikai
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (06) : 2372 - 2385
  • [29] Progressive Transfer Learning and Adversarial Domain Adaptation for Cross-Domain Skin Disease Classification
    Gu, Yanyang
    Ge, Zongyuan
    Bonnington, C. Paul
    Zhou, Jun
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2020, 24 (05) : 1379 - 1393
  • [30] Feature transfer based adversarial domain adaptation method for cross-domain road extraction
    Wang, Shuyang
    Mu, Xiaodong
    He, Hao
    Yang, Dongfang
    Zhao, Peng
    GEOCARTO INTERNATIONAL, 2022, 37 (02) : 445 - 455