Learning to Augment for Data-Scarce Domain BERT Knowledge Distillation

被引:0
|
作者
Feng, Lingyun [1 ]
Qiu, Minghui [2 ]
Li, Yaliang [2 ]
Zheng, Hai-Tao [1 ]
Shen, Ying [3 ]
机构
[1] Tsinghua Univ, Beijing, Peoples R China
[2] Alibaba Grp, Hangzhou, Peoples R China
[3] Sun Yat Sen Univ, Guangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite pre-trained language models such as BERT have achieved appealing performance in a wide range of natural language processing tasks, they are computationally expensive to be deployed in real-time applications. A typical method is to adopt knowledge distillation to compress these large pre-trained models (teacher models) to small student models. However, for a target domain with scarce training data, the teacher can hardly pass useful knowledge to the student, which yields performance degradation for the student models. To tackle this problem, we propose a method to learn to augment for data-scarce domain BERT knowledge distillation, by learning a cross-domain manipulation scheme that automatically augments the target with the help of resource-rich source domains. Specifically, the proposed method generates samples acquired from a stationary distribution near the target data and adopts a reinforced selector to automatically refine the augmentation strategy according to the performance of the student. Extensive experiments demonstrate that the proposed method significantly outperforms state-of-the-art baselines on four different tasks, and for the data-scarce domains, the compressed student models even perform better than the original large teacher model, with much fewer parameters (only -13.3%) when only a few labeled examples available.
引用
收藏
页码:7422 / 7430
页数:9
相关论文
共 50 条
  • [31] Hydrological impacts of climate change on a data-scarce Greek catchment
    Venetsanou, P.
    Anagnostopoulou, C.
    Loukas, A.
    Voudouris, K.
    THEORETICAL AND APPLIED CLIMATOLOGY, 2020, 140 (3-4) : 1017 - 1030
  • [32] Representation Learning and Knowledge Distillation for Lightweight Domain Adaptation
    Bin Shah, Sayed Rafay
    Putty, Shreyas Subhash
    Schwung, Andreas
    2024 IEEE CONFERENCE ON ARTIFICIAL INTELLIGENCE, CAI 2024, 2024, : 1202 - 1207
  • [33] Transfer Learning in Landslide Susceptibility Mapping: Bridging Data-Rich and Data-Scarce Regions in the Northwestern Himalayas
    Singh, Ankit
    Dhiman, Nitesh
    Shukla, Dericks Praise
    IGARSS 2024-2024 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, IGARSS 2024, 2024, : 3253 - 3256
  • [34] Integrated hydrodynamic and machine learning models for compound flooding prediction in a data-scarce estuarine delta
    Sampurno, Joko
    Vallaeys, Valentin
    Ardianto, Randy
    Hanert, Emmanuel
    NONLINEAR PROCESSES IN GEOPHYSICS, 2022, 29 (03) : 301 - 315
  • [35] Development of a Distributed Physics-Informed Deep Learning Hydrological Model for Data-Scarce Regions
    Zhong, Liangjin
    Lei, Huimin
    Yang, Jingjing
    WATER RESOURCES RESEARCH, 2024, 60 (06)
  • [36] Enhancing the performance of runoff prediction in data-scarce hydrological domains using advanced transfer learning
    Chen, Songliang
    Mao, Qinglin
    Feng, Youcan
    Li, Hongyan
    Ma, Donghe
    Zhao, Yilian
    Liu, Junhui
    Cheng, Hui
    RESOURCES ENVIRONMENT AND SUSTAINABILITY, 2024, 18
  • [37] An instream ecological flow method for data-scarce regulated rivers
    Liu, Changming
    Zhao, Changsen
    Xia, Jun
    Sun, Changlei
    Wang, Rui
    Liu, Tao
    JOURNAL OF HYDROLOGY, 2011, 398 (1-2) : 17 - 25
  • [38] Hydrological Modeling in Data-Scarce Catchments: The Kilombero Floodplain in Tanzania
    Naeschen, Kristian
    Diekkrueger, Bernd
    Leemhuis, Constanze
    Steinbach, Stefanie
    Seregina, Larisa S.
    Thonfeld, Frank
    van der Linden, Roderick
    WATER, 2018, 10 (05)
  • [39] Comparing conceptual and super ensemble deep learning models for streamflow simulation in data-scarce catchments
    Wegayehu, Eyob Betru
    Muluneh, Fiseha Behulu
    JOURNAL OF HYDROLOGY-REGIONAL STUDIES, 2024, 52
  • [40] Analyses of groundwater level in a data-scarce region based on assessed precipitation products and machine learning
    El-Azhari, Ahmed
    Karaoui, Ismail
    Brahim, Yassine Ait
    Azhar, Mohamed
    Chehbouni, Abdelghani
    Bouchaou, Lhoussaine
    GROUNDWATER FOR SUSTAINABLE DEVELOPMENT, 2024, 26