Robust scientific text classification using prompt tuning based on data augmentation with L2 regularization

被引:7
|
作者
Shi, Shijun [1 ]
Hu, Kai [1 ]
Xie, Jie [2 ,3 ]
Guo, Ya [1 ]
Wu, Huayi [4 ]
机构
[1] Jiangnan Univ, Key Lab Adv Proc Control Light Ind, Minist Educ, Wuxi 214122, Peoples R China
[2] Nanjing Normal Univ, Sch Comp & Elect Informat, Nanjing 210023, Peoples R China
[3] Nanjing Normal Univ, Sch Artificial Intelligence, Nanjing 210023, Peoples R China
[4] Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & Re, Wuhan 430079, Peoples R China
基金
中国国家自然科学基金;
关键词
Scientific text classification; Pre-training model; Prompt tuning; Data augmentation; Pairwise training; L2; regularization;
D O I
10.1016/j.ipm.2023.103531
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recently, the prompt tuning technique, which incorporates prompts into the input of the pretraining language model (like BERT, GPT), has shown promise in improving the performance of language models when facing limited annotated data. However, the equivalence of template semantics in learning is not related to the effect of prompts and the prompt tuning often exhibits unstable performance, which is more severe in the domain of the scientific domain. To address this challenge, we propose to enhance prompt tuning using data augmentation with L2 regularization. Namely, pairing-wise training for the pair of the original and transformed data is performed. Our experiments on two scientific text datasets (ACL-ARC and SciCite) demonstrate that our proposed method significantly improves both accuracy and robustness. By using 1000 samples out of 1688 in the ACL-ARC training set, our method achieved an F1 score 3.33% higher than the same model trained on all 1688-sample data. In the SciCite dataset, our method surpassed the same model with labeled data reduced by over 93%. Our method is also proved to have high robustness, reaching F1 scores from 1% to 8% higher than those models without our method after the Probability Weighted Word Saliency attack.
引用
收藏
页数:19
相关论文
共 50 条
  • [11] TextANN: An Improved Text Classification Model Based on Data Augmentation
    Li, Hong
    Yang, Xiaosheng
    Yang, Guoqing
    Ouyang, Xiaogang
    Chen, Yu
    Wang, Xueqing
    2018 INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, BIG DATA AND BLOCKCHAIN (ICCBB 2018), 2018, : 160 - 163
  • [12] GDA: Grammar-based Data Augmentation for Text Classification using Slot Information
    Hahn, Joonghyuk
    Cheon, Hyunjoon
    Orwig, Elizabeth
    Kim, Su-Hyeon
    Ko, Sang-Ki
    Han, Yo-Sub
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS - EMNLP 2023, 2023, : 7291 - 7306
  • [13] Medical text classification based on the discriminative pre-training model and prompt-tuning
    Wang, Yu
    Wang, Yuan
    Peng, Zhenwan
    Zhang, Feifan
    Zhou, Luyao
    Yang, Fei
    DIGITAL HEALTH, 2023, 9
  • [14] Joint Classification of Hyperspectral Image and LiDAR Data Based on Spectral Prompt Tuning
    Kong, Yi
    Cheng, Yuhu
    Chen, Yang
    Wang, Xuesong
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [15] RoPDA: Robust Prompt -Based Data Augmentation for Low -Resource Named Entity Recognition
    Song, Sihan
    Shen, Furao
    Zhao, Jian
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 17, 2024, : 19017 - 19025
  • [16] 3-D inversion of magnetic data based on the L1–L2 norm regularization
    Mitsuru Utsugi
    Earth, Planets and Space, 71
  • [17] Shared features of L2 writing: Intergroup homogeneity and text classification
    Crossley, Scott A.
    McNamara, Danielle S.
    JOURNAL OF SECOND LANGUAGE WRITING, 2011, 20 (04) : 271 - 285
  • [18] Combined l2 data and gradient fitting in conjunction with l1 regularization
    Didas, Stephan
    Setzer, Simon
    Steidl, Gabriele
    ADVANCES IN COMPUTATIONAL MATHEMATICS, 2009, 30 (01) : 79 - 99
  • [19] Learning Robust Scene Classification Model with Data Augmentation Based on Xception
    Chen, Haiyan
    Yang, Yu
    Zhang, Suning
    5TH ANNUAL INTERNATIONAL CONFERENCE ON INFORMATION SYSTEM AND ARTIFICIAL INTELLIGENCE (ISAI2020), 2020, 1575
  • [20] Novel Robust Augmentation Approach Based on Sensing Features for Data Classification
    Alajmi, Masoud M.
    Awedat, Khalfalla A.
    IEEE ACCESS, 2021, 9 : 127559 - 127564