Hierarchical text classification using CNNs with local approaches

被引:0
|
作者
Krendzelak M. [1 ]
Jakab F. [1 ]
机构
[1] Technical University of Košice, Faculty of Electrical Engineering and Informatics, Department of Computers and Informatics, Letná 9, Košice
关键词
Convolutional neural network; Hierarchical text classification; Local topdown approach;
D O I
10.31577/CAI_2020_5_907
中图分类号
学科分类号
摘要
In this paper, we discuss the application of convolutional neural networks (CNNs) for hierarchical text classification using local top-down approaches. We present experimental results implementing a local classification per node approach, a local classification per parent node approach, and a local classification per level approach. A 20Newsgroup hierarchical training dataset with more than 20 categories and three hierarchical levels was used to train the models. The experiments involved several variations of hyperparameters settings such as batch size, embedding size, and number of available examples from the training dataset, including two variation of CNN model text embedding such as static (stat) and random (rand). The results demonstrated that our proposed use of CNNs outperformed at CNN baseline model and both the at and hierarchical support vector machine (SVM) and logistic regression (LR) baseline models. In particular, hierarchical text classification with CNN-stat models using local per parent node and local per level approaches achieved compelling results and outperformed the former and latter state-of-the-art models. However, using CNN with local per node approach for hierarchical text classification underperformed and achieved worse results. Furthermore, we performed a detailed comparison between the proposed hierarchical local approaches with CNNs. The results indicated that the hierarchical local classification per level approach using the CNN model with static text embedding achieved the best results, surpassing the at SVM and LR baseline models by 7% and 13 %, surpassing the at CNN baseline by 5 %, and surpassing the h-SVM and h-LR models by 5% and 10 %, respectively. © 2021 Slovak Academy of Sciences. All rights reserved.
引用
收藏
页码:907 / 924
页数:17
相关论文
共 50 条
  • [21] Exploring Hierarchical Multi-Label Text Classification Models using Attention-Based Approaches for Vietnamese language
    Lam, Van
    Quach, Khoi
    Nguyen, Long
    Dinh, Dien
    PROCEEDINGS OF 2023 7TH INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND INFORMATION RETRIEVAL, NLPIR 2023, 2023, : 38 - 43
  • [22] Feature Selection for Text Classification Using Machine Learning Approaches
    Thirumoorthy, K.
    Muneeswaran, K.
    NATIONAL ACADEMY SCIENCE LETTERS-INDIA, 2022, 45 (01): : 51 - 56
  • [23] Feature Selection for Text Classification Using Machine Learning Approaches
    K. Thirumoorthy
    K. Muneeswaran
    National Academy Science Letters, 2022, 45 : 51 - 56
  • [24] Hierarchical Label Generation for Text Classification
    Kwon, Jingun
    Kamigaito, Hidetaka
    Song, Young-In
    Okumura, Manabu
    17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 625 - 632
  • [25] Naive approach for hierarchical text classification
    Wang, Mingwen
    Lu, Xu
    Zhang, Huawei
    Luo, Yuansheng
    Journal of Computational Information Systems, 2007, 3 (04): : 1591 - 1598
  • [26] Hierarchical text classification methods and their specification
    Sun, AX
    Lim, EP
    Ng, WK
    COOPERATIVE INTERNET COMPUTING, 2003, 729 : 236 - 256
  • [27] Context Recognition for Hierarchical Text Classification
    Liu, Rey-Long
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2009, 60 (04): : 803 - 813
  • [28] Hierarchical Interpretation of Neural Text Classification
    Yan, Hanqi
    Gui, Lin
    He, Yulan
    COMPUTATIONAL LINGUISTICS, 2022, 48 (04) : 987 - 1020
  • [29] Hierarchical Text Classification Incremental Learning
    Song, Shengli
    Qiao, Xiaofei
    Chen, Ping
    NEURAL INFORMATION PROCESSING, PT 1, PROCEEDINGS, 2009, 5863 : 247 - 258
  • [30] A fast algorithm for hierarchical text classification
    Chuang, WT
    Tiyyagura, A
    Yang, J
    Giuffrida, G
    DATA WAREHOUSING AND KNOWLEDGE DISCOVERY, PROCEEDINGS, 2000, 1874 : 409 - 418