Robust scientific text classification using prompt tuning based on data augmentation with L2 regularization

被引:7
|
作者
Shi, Shijun [1 ]
Hu, Kai [1 ]
Xie, Jie [2 ,3 ]
Guo, Ya [1 ]
Wu, Huayi [4 ]
机构
[1] Jiangnan Univ, Key Lab Adv Proc Control Light Ind, Minist Educ, Wuxi 214122, Peoples R China
[2] Nanjing Normal Univ, Sch Comp & Elect Informat, Nanjing 210023, Peoples R China
[3] Nanjing Normal Univ, Sch Artificial Intelligence, Nanjing 210023, Peoples R China
[4] Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & Re, Wuhan 430079, Peoples R China
基金
中国国家自然科学基金;
关键词
Scientific text classification; Pre-training model; Prompt tuning; Data augmentation; Pairwise training; L2; regularization;
D O I
10.1016/j.ipm.2023.103531
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recently, the prompt tuning technique, which incorporates prompts into the input of the pretraining language model (like BERT, GPT), has shown promise in improving the performance of language models when facing limited annotated data. However, the equivalence of template semantics in learning is not related to the effect of prompts and the prompt tuning often exhibits unstable performance, which is more severe in the domain of the scientific domain. To address this challenge, we propose to enhance prompt tuning using data augmentation with L2 regularization. Namely, pairing-wise training for the pair of the original and transformed data is performed. Our experiments on two scientific text datasets (ACL-ARC and SciCite) demonstrate that our proposed method significantly improves both accuracy and robustness. By using 1000 samples out of 1688 in the ACL-ARC training set, our method achieved an F1 score 3.33% higher than the same model trained on all 1688-sample data. In the SciCite dataset, our method surpassed the same model with labeled data reduced by over 93%. Our method is also proved to have high robustness, reaching F1 scores from 1% to 8% higher than those models without our method after the Probability Weighted Word Saliency attack.
引用
收藏
页数:19
相关论文
共 50 条
  • [21] White Blood Cell Classification Using Multi-Attention Data Augmentation and Regularization
    Bayat, Nasrin
    Davey, Diane D. D.
    Coathup, Melanie
    Park, Joon-Hyuk
    BIG DATA AND COGNITIVE COMPUTING, 2022, 6 (04)
  • [22] GD-PTCF: Prompt-Tuning Based Classification Framework for Government Data
    Mao, Ming
    Zhang, Duo
    Xia, Chao
    Guo, Yunchuan
    Zhang, Dunmin
    Li, Xiaolin
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT II, ICIC 2024, 2024, 14876 : 211 - 224
  • [23] Data augmentation using virtual word insertion techniques in text classification tasks
    Long, Zhigao
    Li, Hong
    Shi, Jiawen
    Ma, Xin
    EXPERT SYSTEMS, 2024, 41 (04)
  • [24] Data Augmentation Using Transformers and Similarity Measures for Improving Arabic Text Classification
    Refai, Dania
    Abu-Soud, Saleh
    Abdel-Rahman, Mohammad J.
    IEEE ACCESS, 2023, 11 : 132516 - 132531
  • [25] Displacement Data Imputation in Urban Internet of Things System Based on Tucker Decomposition With L2 Regularization
    Li, Linchao
    Lin, Xiang
    Liu, Hanlin
    Lu, Wenqi
    Zhou, Baoding
    Zhu, Jiasong
    IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (15) : 13315 - 13326
  • [26] Iterative Translation-Based Data Augmentation Method for Text Classification Tasks
    Lee, Sangwon
    Liu, Ling
    Choi, Wonik
    IEEE ACCESS, 2021, 9 : 160437 - 160445
  • [27] Improving Text Classification with Large Language Model-Based Data Augmentation
    Zhao, Huanhuan
    Chen, Haihua
    Ruggles, Thomas A.
    Feng, Yunhe
    Singh, Debjani
    Yoon, Hong-Jun
    ELECTRONICS, 2024, 13 (13)
  • [28] Enhancing relative humidity modelling using L2 regularization updates
    Abdellah Ben Yahia
    Iman Kadir
    Abdelaziz Abdallaoui
    Abdellah El-Hmaidi
    Scientific Reports, 15 (1)
  • [29] On l2 data fitting and modified nonconvex nonsmooth regularization for image recovery
    Xiao, Jin
    Yang, Yu-Fei
    Yuan, Xiao
    JOURNAL OF COMPUTATIONAL ANALYSIS AND APPLICATIONS, 2013, 15 (02) : 264 - 279
  • [30] APPROXIMATION OF L2 ELEMENT USING GENERALIZED RATIONAL FRACTION WITH RESPECT TO STANDARD L2 AND REGULARIZATION OF APPROXIMATION SET
    WOLF, J
    COMPTES RENDUS HEBDOMADAIRES DES SEANCES DE L ACADEMIE DES SCIENCES SERIE A, 1974, 278 (17): : 1111 - 1113