Exploring Hierarchical Multi-Label Text Classification Models using Attention-Based Approaches for Vietnamese language

被引:0
|
作者
Lam, Van [1 ,2 ]
Quach, Khoi [1 ,2 ]
Nguyen, Long [1 ,2 ]
Dinh, Dien [1 ,2 ]
机构
[1] Univ Sci Ho Chi Minh City, Fac Informat Technol, Ho Chi Minh City, Vietnam
[2] Vietnam Natl Univ, Ho Chi Minh City, Vietnam
来源
PROCEEDINGS OF 2023 7TH INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND INFORMATION RETRIEVAL, NLPIR 2023 | 2023年
关键词
Hierarchical Attention-based Recurrent Neural Network; Word Embedding; Vietnamese articles;
D O I
10.1145/3639233.3639244
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Hierarchical Attention-based Recurrent Neural Network (HARNN) is a system designed to categorize documents efficiently, taking into account both the content of the texts and their hierarchical category structure. This system is comprised of three primary components: the Document Representation Layer (DRL), which is used for semantic encoding, the Hierarchical Attention-based Recurrent Layer (HARL), that models dependencies between different hierarchical levels, and the Hybrid Predicting Layer (HPL), which is responsible for accurate category predictions. In this research, we put HARNN to the test, using a dataset of Vietnamese articles from VnExpress. We then contrast the performance of four different word embeddings (Word2Vec, FastText, PhoBERT, and BERT multilingual). Additionally, we introduce a domain-based approach for the HARNN model to compare the accuracy with the original manner. Experimental findings indicate that HARNN performs effectively in the context of Vietnamese language and that our domain-based approach can be advantageous in specific domains HMTC task.
引用
收藏
页码:38 / 43
页数:6
相关论文
共 50 条
  • [31] Label-Related Adaptive Graph Construction Based on Attention for Multi-label Text Classification
    Zhou, Xiwen
    Xie, Xiaopeng
    Zhao, Chenlong
    Yao, Lei
    Li, Zhaoxia
    Zhang, Yong
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT IV, ICIC 2024, 2024, 14878 : 197 - 208
  • [32] Multi-label Text Classification of German Language Medical Documents
    Spat, Stephan
    Cadonna, Bruno
    Rakovac, Ivo
    Guetl, Christian
    Leitner, Hubert
    Stark, Guenther
    Beck, Peter
    MEDINFO 2007: PROCEEDINGS OF THE 12TH WORLD CONGRESS ON HEALTH (MEDICAL) INFORMATICS, PTS 1 AND 2: BUILDING SUSTAINABLE HEALTH SYSTEMS, 2007, 129 : 1460 - +
  • [33] Multi-Label Text Classification Based on DistilBERT and Label Correlation
    Wang, Xuyang
    Geng, Liuqing
    Zhang, Xin
    Computer Engineering and Applications, 2024, 60 (23) : 168 - 175
  • [34] HMATC: Hierarchical multi-label Arabic text classification model using machine learning
    Aljedani, Nawal
    Alotaibi, Reem
    Taileb, Mounira
    EGYPTIAN INFORMATICS JOURNAL, 2021, 22 (03) : 225 - 237
  • [35] Multi-Label Text Classification Model Based on Multi-Level Constraint Augmentation and Label Association Attention
    Wei, Xiao
    Huang, Jianbao
    Zhao, Rui
    Yu, Hang
    Xu, Zheng
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2024, 23 (01)
  • [36] Large Scale Multi-label Text Classification of a Hierarchical Dataset using Rocchio algorithm
    Sowmya, B. J.
    Chetan
    Srinivasa, K. G.
    2016 INTERNATIONAL CONFERENCE ON COMPUTATION SYSTEM AND INFORMATION TECHNOLOGY FOR SUSTAINABLE SOLUTIONS (CSITSS), 2016, : 291 - 296
  • [37] An R-Transformer_BiLSTM Model Based on Attention for Multi-label Text Classification
    Yan, Yaoyao
    Liu, Fang'ai
    Zhuang, Xuqiang
    Ju, Jie
    NEURAL PROCESSING LETTERS, 2023, 55 (02) : 1293 - 1316
  • [38] An R-Transformer_BiLSTM Model Based on Attention for Multi-label Text Classification
    Yaoyao Yan
    Fang’ai Liu
    Xuqiang Zhuang
    Jie Ju
    Neural Processing Letters, 2023, 55 : 1293 - 1316
  • [39] Cognitive structure learning model for hierarchical multi-label text classification
    Wang, Boyan
    Hu, Xuegang
    Li, Peipei
    Yu, Philip S.
    KNOWLEDGE-BASED SYSTEMS, 2021, 218
  • [40] Hierarchical Sequence-to-Sequence Model for Multi-Label Text Classification
    Yang, Zhenyu
    Liu, Guojing
    IEEE ACCESS, 2019, 7 : 153012 - 153020