A semi-supervised method to generate a persian dataset for suggestion classification

被引:0
|
作者
Safari, Leila [1 ]
Mohammady, Zanyar [1 ]
机构
[1] Univ Zanjan, Dept Comp Engn, Zanjan 4537138791, Iran
关键词
Automatic classification of suggestions; Annotator; Neural networks; Pre-trained language model; Transformers;
D O I
10.1007/s10579-023-09688-7
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Suggestion mining has become a popular subject in the field of natural language processing (NLP) that is useful in areas like a service/product improvement. The purpose of this study is to provide an automated machine learning (ML) based approach to extract suggestions from Persian text. In this research, first, a novel two-step semi-supervised method has been proposed to generate a Persian dataset called ParsSugg, which is then used in the automatic classification of the user's suggestions. The first step is manual labeling of data based on a proposed guideline, followed by a data augmentation phase. In the second step, using pre-trained Persian Bidirectional Encoder Representations from Transformers (ParsBERT) as a classifier and the data from the previous step, more data were labeled. The performance of various ML models, including Support Vector Machine (SVM), Random Forest (RF), Convolutional Neural Networks (CNN), Long Short Term Memory (LSTM), and the ParsBERT language model has been examined on the generated dataset. The F-score value of 97.27 for ParsBERT and about 94.5 for SVM and CNN classifiers were obtained for the suggestion class which is a promising result as the first research on suggestion classification on Persian texts. Also, the proposed guideline can be used for other NLP tasks, and the generated dataset can be used in other suggestion classification tasks.
引用
收藏
页码:839 / 858
页数:20
相关论文
共 50 条
  • [21] The incremental image classification method based on semi-supervised learning
    Wu, Weiwen
    Wang, Zhiyan
    Liang, Peng
    Xu, Xiaowei
    International Journal of Digital Content Technology and its Applications, 2012, 6 (19) : 305 - 314
  • [22] Semi-Supervised Modulation Classification via an Ensemble SigMatch Method
    Wang, Hanlin
    Yang, Shuyuan
    Feng, Zhixi
    Huang, Bincheng
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (20): : 32985 - 32997
  • [23] Semi-supervised time series classification method for quantum computing
    Yarkoni, Sheir
    Kleshchonok, Andrii
    Dzerin, Yury
    Neukart, Florian
    Hilbert, Marc
    QUANTUM MACHINE INTELLIGENCE, 2021, 3 (01)
  • [24] A Parameter Estimation Method for Graph Based Semi-Supervised Classification
    Xu Jiazhen
    Chen Xinmeng
    Zhou Zheng
    Huang Xuejuan
    2008 4TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING, VOLS 1-31, 2008, : 11105 - 11108
  • [25] Fault Classification in Transmission Network with Semi-supervised Learning Method
    Li, Siyan
    Zhang, Yuhang
    Yang, Fan
    Qiu, Robert C.
    2021 IEEE PES INNOVATIVE SMART GRID TECHNOLOGY EUROPE (ISGT EUROPE 2021), 2021, : 345 - 350
  • [26] A Semi-supervised Classification Method of Parasites Using Contrastive Learning
    Ren, Yanni
    Jiang, Hao
    Zhu, Huilin
    Tian, Yanling
    Hu, Jinglu
    IEEJ TRANSACTIONS ON ELECTRICAL AND ELECTRONIC ENGINEERING, 2022, 17 (03) : 445 - 453
  • [27] Semi-supervised classification method for hyperspectral remote sensing images
    Gomez-Chova, L
    Calpe, J
    Camps-Valls, G
    Martín, JD
    Soria, E
    Vila, J
    Alonso-Chorda, L
    Moreno, J
    IGARSS 2003: IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, VOLS I - VII, PROCEEDINGS: LEARNING FROM EARTH'S SHAPES AND SIZES, 2003, : 1776 - 1778
  • [28] Active deep learning method for semi-supervised sentiment classification
    Zhou, Shusen
    Chen, Qingcai
    Wang, Xiaolong
    NEUROCOMPUTING, 2013, 120 : 536 - 546
  • [29] Text Classification Method Based On Semi-Supervised Transfer Learning
    Yu, Xiaosheng
    Zhang, Hehuan
    Li, Jing
    2021 21ST INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY COMPANION (QRS-C 2021), 2021, : 388 - 394
  • [30] A semi-supervised classification method based on transduction of labeled data
    Sun, SL
    Zhang, CS
    Lu, NJ
    Xiao, F
    2004 IEEE CONFERENCE ON CYBERNETICS AND INTELLIGENT SYSTEMS, VOLS 1 AND 2, 2004, : 1128 - 1132