Domain-Invariant Feature Progressive Distillation with Adversarial Adaptive Augmentation for Low-Resource Cross-Domain NER

被引:2
|
作者
Zhang, Tao [1 ]
Xia, Congying [1 ]
Liu, Zhiwei [1 ]
Zhao, Shu [2 ]
Peng, Hao [3 ]
Yu, Philip [1 ]
机构
[1] Univ Illinois, Dept Comp Sci, 851 South Morgan St, Chicago, IL 60607 USA
[2] Anhui Univ, Sch Comp Sci & Technol, 111 Jiulong Rd, Hefei 230601, Anhui, Peoples R China
[3] Beihang Univ, Beijing Adv Innovat Ctr Big Data & Brain Comp, 37 Xue Yuan Rd, Beijing 100191, Peoples R China
基金
北京市自然科学基金; 国家重点研发计划;
关键词
NER; adversarial augmentation; cross-domain; domain adaptation; low-resource; knowledge distillation;
D O I
10.1145/3570502
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Considering the expensive annotation in Named Entity Recognition (NER), Cross-domain NER enables NER in low-resource target domains with few or without labeled data, by transferring the knowledge of high-resource domains. However, the discrepancy between different domains causes the domain shift problem and hampers the performance of cross-domain NER in low-resource scenarios. In this article, we first propose an adversarial adaptive augmentation, where we integrate the adversarial strategy into a multi-task learner to augment and qualify domain adaptive data. We extract domain-invariant features of the adaptive data to bridge the cross-domain gap and alleviate the label-sparsity problem simultaneously. Therefore, another important component in this article is the progressive domain-invariant feature distillation framework. A multi-grained MMD (Maximum Mean Discrepancy) approach in the framework to extract the multi-level domain invariant features and enable knowledge transfer across domains through the adversarial adaptive data. Advanced Knowledge Distillation (KD) schema processes progressively domain adaptation through the powerful pre-trained language models and multi-level domain invariant features. Extensive comparative experiments over four English and two Chinese benchmarks show the importance of adversarial augmentation and effective adaptation from high-resource domains to low-resource target domains. Comparison with two vanilla and four latest baselines indicates the state-of-the-art performance and superiority confronted with both zero-resource and minimal-resource scenarios.
引用
收藏
页数:21
相关论文
共 50 条
  • [31] Explicit Invariant Feature Induced Cross-Domain Crowd Counting
    Cai, Yiqing
    Chen, Lianggangxu
    Guan, Haoyue
    Lin, Shaohui
    Lu, Changhong
    Wang, Changbo
    He, Gaoqi
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 1, 2023, : 259 - 267
  • [32] Cross-domain Image Localization by Adaptive Feature Fusion
    Bhowmik, Neelanjan
    Weng, Li
    Gouet-Brunet, Valerie
    Soheilian, Bahman
    2017 JOINT URBAN REMOTE SENSING EVENT (JURSE), 2017,
  • [33] Structure and Label Constrained Data Augmentation for Cross-domain Few-shot NER
    Zhang, Jingyi
    Zhang, Ying
    Chen, Yufeng
    Xu, Jinan
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS - EMNLP 2023, 2023, : 518 - 530
  • [34] Semantic invariant cross-domain image generation with generative adversarial networks
    Mao, Xiaofeng
    Wang, Shuhui
    Zheng, Liying
    Huang, Qingming
    NEUROCOMPUTING, 2018, 293 : 55 - 63
  • [35] Cross-domain intelligent diagnostics for rotating machinery using domain adaptive and adversarial networks
    Hu, Kui
    Cheng, Yiwei
    Wu, Jun
    Zhu, Haiping
    JOURNAL OF INDUSTRIAL INFORMATION INTEGRATION, 2024, 42
  • [36] Cross-Domain Adversarial Feature Learning for Sketch Re-identification
    Pang, Lu
    Wang, Yaowei
    Song, Yi-Zhe
    Huang, Tiejun
    Tian, Yonghong
    PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18), 2018, : 609 - 617
  • [37] Cross-domain Semantic Feature Learning via Adversarial Adaptation Networks
    Li, Rui
    Cao, Wenming
    Qian, Sheng
    Wong, Hau-San
    Wu, Si
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 37 - 42
  • [38] Cross-Domain Human Parsing via Adversarial Feature and Label Adaptation
    Liu, Si
    Sun, Yao
    Zhu, Defa
    Ren, Guanghui
    Chen, Yu
    Feng, Jiashi
    Han, Jizhong
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 7146 - 7153
  • [39] Cross-Domain Few-Shot Classification via Adversarial Task Augmentation
    Wang, Haoqing
    Deng, Zhi-Hong
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 1075 - 1081
  • [40] Cross-Domain Adaptive Learning Model Based on Feature Separation
    Li X.
    Li Z.
    Wei J.
    Yang Y.
    Wang H.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2022, 59 (01): : 105 - 117