Exploring Hierarchical Multi-Label Text Classification Models using Attention-Based Approaches for Vietnamese language

被引:0
|
作者
Lam, Van [1 ,2 ]
Quach, Khoi [1 ,2 ]
Nguyen, Long [1 ,2 ]
Dinh, Dien [1 ,2 ]
机构
[1] Univ Sci Ho Chi Minh City, Fac Informat Technol, Ho Chi Minh City, Vietnam
[2] Vietnam Natl Univ, Ho Chi Minh City, Vietnam
来源
PROCEEDINGS OF 2023 7TH INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND INFORMATION RETRIEVAL, NLPIR 2023 | 2023年
关键词
Hierarchical Attention-based Recurrent Neural Network; Word Embedding; Vietnamese articles;
D O I
10.1145/3639233.3639244
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Hierarchical Attention-based Recurrent Neural Network (HARNN) is a system designed to categorize documents efficiently, taking into account both the content of the texts and their hierarchical category structure. This system is comprised of three primary components: the Document Representation Layer (DRL), which is used for semantic encoding, the Hierarchical Attention-based Recurrent Layer (HARL), that models dependencies between different hierarchical levels, and the Hybrid Predicting Layer (HPL), which is responsible for accurate category predictions. In this research, we put HARNN to the test, using a dataset of Vietnamese articles from VnExpress. We then contrast the performance of four different word embeddings (Word2Vec, FastText, PhoBERT, and BERT multilingual). Additionally, we introduce a domain-based approach for the HARNN model to compare the accuracy with the original manner. Experimental findings indicate that HARNN performs effectively in the context of Vietnamese language and that our domain-based approach can be advantageous in specific domains HMTC task.
引用
收藏
页码:38 / 43
页数:6
相关论文
共 50 条
  • [1] Hierarchical Multi-label Text Classification: An Attention-based Recurrent Network Approach
    Huang, Wei
    Chen, Enhong
    Liu, Qi
    Chen, Yuying
    Huang, Zai
    Liu, Yang
    Zhao, Zhou
    Zhang, Dan
    Wang, Shijin
    PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19), 2019, : 1051 - 1060
  • [2] MAGNET: Multi-Label Text Classification using Attention-based Graph Neural Network
    Pal, Ankit
    Selvakumar, Muru
    Sankarasubbu, Malaikannan
    ICAART: PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 2, 2020, : 494 - 505
  • [3] InPHYNet: Leveraging attention-based multitask recurrent networks for multi-label physics text classification
    Udandarao, Vishaal
    Agarwal, Abhishek
    Gupta, Anubha
    Chakraborty, Tanmoy
    KNOWLEDGE-BASED SYSTEMS, 2021, 211
  • [4] Multi-label text classification using multinomial models
    Vilar, D
    Castro, MJ
    Sanchis, E
    ADVANCES IN NATURAL LANGUAGE PROCESSING, 2004, 3230 : 220 - 230
  • [5] All is attention for multi-label text classification
    Liu, Zhi
    Huang, Yunjie
    Xia, Xincheng
    Zhang, Yihao
    KNOWLEDGE AND INFORMATION SYSTEMS, 2025, 67 (02) : 1249 - 1270
  • [6] On Exploring Attention-based Explanation for Transformer Models in Text Classification
    Liu, Shengzhong
    Le, Franck
    Chakraborty, Supriyo
    Abdelzaher, Tarek
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 1193 - 1203
  • [7] LA-HCN: Label-based Attention for Hierarchical Multi-label Text Classification Neural Network
    Zhang, Xinyi
    Xu, Jiahao
    Soh, Charlie
    Chen, Lihui
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 187
  • [8] Multi-label Aspect Classification on Question-Answering Text with Contextualized Attention-Based Neural Network
    Wu, Hanqian
    Zhang, Shangbin
    Wang, Jingjing
    Liu, Mumu
    Li, Shoushan
    CHINESE COMPUTATIONAL LINGUISTICS, CCL 2019, 2019, 11856 : 479 - 491
  • [9] Research of multi-label text classification based on label attention and correlation networks
    Yuan, Ling
    Xu, Xinyi
    Sun, Ping
    Yu, Hai ping
    Wei, Yin Zhen
    Zhou, Jun jie
    PLOS ONE, 2024, 19 (09):
  • [10] A Label-Specific Attention-Based Network with Regularized Loss for Multi-label Classification
    Luo, Xiangyang
    Ran, Xiangying
    Sun, Wei
    Xu, Yunlai
    Wang, Chongjun
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: DEEP LEARNING, PT II, 2019, 11728 : 731 - 742