Rule-based adversarial sample generation for text classification

被引:0
|
作者
Nai Zhou
Nianmin Yao
Jian Zhao
Yanan Zhang
机构
[1] Dalian University of Technology,School of Computer Science
[2] Dalian University of Technology,School of Automotive Engineering
[3] Automotive Data of China Co. Ltd,undefined
[4] Ningbo Institute of Dalian University of Technology,undefined
来源
关键词
Adversarial examples; Text classification; Rule-based generator; Sentence matrix representation;
D O I
暂无
中图分类号
学科分类号
摘要
In Text Classification, modern neural networks have achieved great performance, but simultaneously, it is sensitive to adversarial examples. Existing studies usually use synonym replacement or token insertion strategies to generate adversarial examples. These strategies focus on obtaining semantically similar adversarial examples, but they ignore the richness of generating adversarial examples. To expand the richness of adversarial samples. Here, we propose a simple Rule-based Adversarial sample Generator (RAG) to generate adversarial samples by controlling the size of the perturbation added to the sentence matrix representation. Concretely, we introduce two methods to control the size of the added perturbation, i) Control the number of word replacements in sentences (RAG(R)); ii) Control the size of the offset value added to the sentence matrix representation (RAG(A)). Based on RAG, we will obtain numerous adversarial samples to make the model more robust to adversarial noise, and thereby improving the model’s generalization ability. Compared with the BERT and BiLSTM model baseline, experiments show that our method reduces the error rate by an average of 18% on four standard training datasets. Especially in low-training data scenarios, the overall average accuracy is increased by 12%. Extensive experimental results demonstrate that our method not only achieves excellent classification performance on the standard training datasets, but it still gets prominent performance on few-shot text classification.
引用
收藏
页码:10575 / 10586
页数:11
相关论文
共 50 条
  • [21] A comparison of rule-based and centroid single-sample multiclass predictors for transcriptomic classification
    Eriksson, Pontus
    Marzouka, Nour-al-dain
    Sjodahl, Gottfrid
    Bernardo, Carina
    Liedberg, Fredrik
    Hoglund, Mattias
    BIOINFORMATICS, 2022, 38 (04) : 1022 - 1029
  • [22] Adversarial Examples Generation Method for Chinese Text Classification
    Xu, En-Hui
    Zhang, Xiao-Lin
    Wang, Yong-Ping
    Zhang, Shuai
    Liu, Li-Xin
    Xu, Li
    International Journal of Network Security, 2022, 24 (04) : 587 - 596
  • [23] Open-Category Classification by Adversarial Sample Generation
    Yu, Yang
    Qu, Wei-Yang
    Li, Nan
    Guo, Zimin
    PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 3357 - 3363
  • [24] Adversarial Sample Generation Method for Spam SMS Classification
    Su, Ling
    Liu, Yu
    Chen, Feiyan
    Zhang, Yingqi
    Zhao, Haiming
    Long, Yujie
    2022 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY, WI-IAT, 2022, : 922 - 929
  • [25] Fuzzy Rule-Based Classification Method for Incremental Rule Learning
    Niu, Jiaojiao
    Chen, Degang
    Li, Jinhai
    Wang, Hui
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2022, 30 (09) : 3748 - 3761
  • [26] Effect of rule weights in fuzzy rule-based classification systems
    Ishibuchi, H
    Nakashima, T
    NINTH IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE 2000), VOLS 1 AND 2, 2000, : 59 - 64
  • [27] Clinical text classification with rule-based features and knowledge-guided convolutional neural networks
    Liang Yao
    Chengsheng Mao
    Yuan Luo
    BMC Medical Informatics and Decision Making, 19
  • [28] Tagging Icelandic text: A linguistic rule-based approach
    Loftsson, Hrafn
    NORDIC JOURNAL OF LINGUISTICS, 2008, 31 (01) : 47 - 72
  • [29] Rule-Based Model for Malay Text Sentiment Analysis
    Chekima, Khalifa
    Alfred, Rayner
    Chin, Kim On
    COMPUTATIONAL SCIENCE AND TECHNOLOGY, ICCST 2017, 2018, 488 : 172 - 185
  • [30] Clinical text classification with rule-based features and knowledge-guided convolutional neural networks
    Yao, Liang
    Mao, Chengsheng
    Luo, Yuan
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2019, 19 (Suppl 3)