Zero-sample text classification algorithm based on BERT and graph convolutional neural network

被引:0
|
作者
Qiao Y. [1 ]
Li Y. [1 ]
Zhou L. [2 ]
Shang X. [2 ]
机构
[1] School of Computer Science and Software Engineering, Southwest Petroleum University, Sichuan, Chengdu
[2] PetroChina Changqing Oilfield Company Oil Production Plant NO.7, Shaanxi, Xi'an
关键词
Attention mechanism; Baseline model; BERT model; Graph convolutional neural network; Text classification;
D O I
10.2478/amns-2024-1560
中图分类号
学科分类号
摘要
In this study, we undertake a comprehensive examination of zero-shot text classification and its associated implications. We propose the adoption of the BERT model as a method for text feature representation. Subsequently, we utilize the Pointwise Mutual Information (PMI) metric to adjust the weight values within a graph convolutional neural network, thereby facilitating the construction of a text graph. Additionally, we incorporate an attention mechanism to transform this text graph, enabling it to represent the output labels of zero-shot text classification effectively. The experimental environment is set up, and the comparison and ablation experiments of the text classification model based on BERT and graph convolutional neural network with the baseline models are carried out in several different types of datasets, and the parameter settings of λ are adjusted according to the experimental results, and the convergence of the BERT model is compared to test the robustness of the model performance and the classification effect. When λ was set to 0.60, the model achieved the best results in each dataset. When the task is set to 5-way-5-shot, the convergence rate of the model for the Snippets dataset using the penultimate layer of features can reach 74%-80% of the training accuracy at the 5,000th step. The training accuracy gradually flattens out in the first 10,000 steps, and the model achieves classification accuracy in all four learning scenarios, with good stability. © 2024 Ying Qiao et al., published by Sciendo.
引用
收藏
相关论文
共 50 条
  • [31] A Quantum Spatial Graph Convolutional Network for Text Classification
    Shah, Syed Mustajar Ahmad
    Ge, Hongwei
    Haider, Sami Ahmed
    Irshad, Muhammad
    Noman, Sohail M.
    Arshad, Jehangir
    Ahmad, Asfandeyar
    Younas, Talha
    COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2021, 36 (02): : 369 - 382
  • [32] Circulant Tensor Graph Convolutional Network for Text Classification
    Xu, Xuran
    Zhang, Tong
    Xu, Chunyan
    Cui, Zhen
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2022, 13188 LNCS : 32 - 46
  • [33] Topic-aware cosine graph convolutional neural network for short text classification
    Min C.
    Chu Y.
    Lin H.
    Wang B.
    Yang L.
    Xu B.
    Soft Computing, 2024, 28 (13-14) : 8119 - 8132
  • [34] A Convolutional Neural Network and Graph Convolutional Network Based Framework for Classification of Breast Histopathological Images
    Gao, Zhiyang
    Lu, Zhiyang
    Wang, Jun
    Ying, Shihui
    Shi, Jun
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2022, 26 (07) : 3163 - 3173
  • [35] Text Sentiment Analysis based on BERT and Convolutional Neural Networks
    Huang, P.
    Zhu, H. J.
    Zheng, L.
    Wang, Y.
    2021 5TH INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND INFORMATION RETRIEVAL, NLPIR 2021, 2021, : 1 - 7
  • [36] Hyperspectral Image Classification Based on Fusion of Convolutional Neural Network and Graph Network
    Gao, Luyao
    Xiao, Shulin
    Hu, Changhong
    Yan, Yang
    APPLIED SCIENCES-BASEL, 2023, 13 (12):
  • [37] Review of Graph Neural Network in Text Classification
    Malekzadeh, Masoud
    Hajibabaee, Parisa
    Heidari, Maryam
    Zad, Samira
    Uzuner, Ozlem
    Jones, James H. Jr Jr
    2021 IEEE 12TH ANNUAL UBIQUITOUS COMPUTING, ELECTRONICS & MOBILE COMMUNICATION CONFERENCE (UEMCON), 2021, : 84 - 91
  • [38] A deep graph convolutional neural network architecture for graph classification
    Zhou, Yuchen
    Huo, Hongtao
    Hou, Zhiwen
    Bu, Fanliang
    PLOS ONE, 2023, 18 (03):
  • [39] Text Feature Extraction and Classification Based on Convolutional Neural Network (CNN)
    Zhang, Taohong
    Li, Cunfang
    Cao, Nuan
    Ma, Rui
    Zhang, ShaoHua
    Ma, Nan
    DATA SCIENCE, PT 1, 2017, 727 : 472 - 485
  • [40] A morpheme sequence and convolutional neural network based Kazakh text classification
    Parhat, Sardar
    Ting, Gao
    Ablimit, Mijit
    Hamdulla, Askar
    2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 1903 - 1906