Robust scientific text classification using prompt tuning based on data augmentation with L2 regularization

被引:7
|
作者
Shi, Shijun [1 ]
Hu, Kai [1 ]
Xie, Jie [2 ,3 ]
Guo, Ya [1 ]
Wu, Huayi [4 ]
机构
[1] Jiangnan Univ, Key Lab Adv Proc Control Light Ind, Minist Educ, Wuxi 214122, Peoples R China
[2] Nanjing Normal Univ, Sch Comp & Elect Informat, Nanjing 210023, Peoples R China
[3] Nanjing Normal Univ, Sch Artificial Intelligence, Nanjing 210023, Peoples R China
[4] Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & Re, Wuhan 430079, Peoples R China
基金
中国国家自然科学基金;
关键词
Scientific text classification; Pre-training model; Prompt tuning; Data augmentation; Pairwise training; L2; regularization;
D O I
10.1016/j.ipm.2023.103531
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recently, the prompt tuning technique, which incorporates prompts into the input of the pretraining language model (like BERT, GPT), has shown promise in improving the performance of language models when facing limited annotated data. However, the equivalence of template semantics in learning is not related to the effect of prompts and the prompt tuning often exhibits unstable performance, which is more severe in the domain of the scientific domain. To address this challenge, we propose to enhance prompt tuning using data augmentation with L2 regularization. Namely, pairing-wise training for the pair of the original and transformed data is performed. Our experiments on two scientific text datasets (ACL-ARC and SciCite) demonstrate that our proposed method significantly improves both accuracy and robustness. By using 1000 samples out of 1688 in the ACL-ARC training set, our method achieved an F1 score 3.33% higher than the same model trained on all 1688-sample data. In the SciCite dataset, our method surpassed the same model with labeled data reduced by over 93%. Our method is also proved to have high robustness, reaching F1 scores from 1% to 8% higher than those models without our method after the Probability Weighted Word Saliency attack.
引用
收藏
页数:19
相关论文
共 50 条
  • [41] A Hybrid Improved Neural Networks Algorithm Based on L2 and Dropout Regularization
    Xie, Xiaoyun
    Xie, Ming
    Moshayedi, Ata Jahangir
    Skandari, Mohammad Hadi Noori
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2022, 2022
  • [42] Robust fuzzy modeling based on L2 gain criterion
    Hori, T
    Taniguti, T
    JOINT 9TH IFSA WORLD CONGRESS AND 20TH NAFIPS INTERNATIONAL CONFERENCE, PROCEEDINGS, VOLS. 1-5, 2001, : 634 - 638
  • [43] A Hybrid Improved Neural Networks Algorithm Based on L2 and Dropout Regularization
    Xie, Xiaoyun
    Xie, Ming
    Moshayedi, Ata Jahangir
    Skandari, Mohammad Hadi Noori
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2022, 2022
  • [44] Skin Lesion Classification Using GAN based Data Augmentation
    Rashid, Haroon
    Tanveer, M. Asjid
    Khan, Hassan Aqeel
    2019 41ST ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2019, : 916 - 919
  • [45] Brainwave Classification Using Covariance-Based Data Augmentation
    Yang, Wonseok
    Nam, Woochul
    IEEE ACCESS, 2020, 8 : 211714 - 211722
  • [46] An antinoise sparse representation method for robust face recognition via joint l1 and l2 regularization
    Zeng, Shaoning
    Gou, Jianping
    Deng, Lunman
    EXPERT SYSTEMS WITH APPLICATIONS, 2017, 82 : 1 - 9
  • [47] C2L: Causally Contrastive Learning for Robust Text Classification
    Choi, Seungtaek
    Jeong, Myeongho
    Han, Hojae
    Hwang, Seung-won
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 10526 - 10534
  • [48] L1 vs L2 synchronous text-based interaction in computer-mediated L2 writing
    Yanguas, Inigo
    SYSTEM, 2020, 88
  • [49] An Improved Kernel Minimum Square Error Classification Algorithm Based on L2,1-Norm Regularization
    Liu, Zhonghua
    Xue, Shan
    Zhang, Lin
    Pu, Jiexin
    Wang, Haijun
    IEEE ACCESS, 2017, 5 : 14133 - 14140
  • [50] Robust landmark-based image registration using l1 and l2 norm regularizations
    Yang, Xuan
    Wang, Bo
    Li, Yan-Ran
    He, Tiancheng
    PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2015, : 425 - 428