Topic-aware hierarchical multi-attention network for text classification

被引:4
|
作者
Jiang, Ye [1 ]
Wang, Yimin [1 ]
机构
[1] Qingdao Univ Sci & Technol, Sch Informat Sci & Technol, Qingdao, Peoples R China
关键词
Text classification; Topic model; Attention mechanism; Natural language processing; LDA;
D O I
10.1007/s13042-022-01734-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Neural networks, primarily recurrent and convolutional Neural networks, have been proven successful in text classification. However, convolutional models could be limited when classification tasks are determined by long-range semantic dependency. While the recurrent ones can capture long-range dependency, the sequential architecture of which could constrain the training speed. Meanwhile, traditional networks encode the entire document in a single pass, which omits the hierarchical structure of the document. To address the above issues, this study presents T-HMAN, a Topic-aware Hierarchical Multiple Attention Network for text classification. A multi-head self-attention coupled with convolutional filters is developed to capture long-range dependency via integrating the convolution features from each attention head. Meanwhile, T-HMAN combines topic distributions generated by Latent Dirichlet Allocation (LDA) with sentence-level and document-level inputs respectively in a hierarchical architecture. The proposed model surpasses the accuracies of the current state-of-the-art hierarchical models on five publicly accessible datasets. The ablation study demonstrates that the involvement of multiple attention mechanisms brings significant improvement. The current topic distributions are fixed vectors generated by LDA, the topic distributions will be parameterized and updated simultaneously with the model weights in future work.
引用
收藏
页码:1863 / 1875
页数:13
相关论文
共 50 条
  • [21] Topic-Aware Multi-turn Dialogue Modeling
    Xu, Yi
    Zhao, Hai
    Zhang, Zhuosheng
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 14176 - 14184
  • [22] Text-Aware Recommendation Model Based on Multi-attention Neural Networks
    Qiu, Gang
    Yu, Xiaoli
    Jiang, Liping
    Ma, Baoying
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT I, 2021, 12815 : 590 - 603
  • [23] Multi-attention aggregation network for remote sensing scene classification
    Wang, Xin
    Li, Yingying
    Shi, Aiye
    Zhou, Huiyu
    JOURNAL OF APPLIED REMOTE SENSING, 2023, 17 (04)
  • [24] T-BERTSum: Topic-Aware Text Summarization Based on BERT
    Ma, Tinghuai
    Pan, Qian
    Rong, Huan
    Qian, Yurong
    Tian, Yuan
    Al-Nabhan, Najla
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2022, 9 (03): : 879 - 890
  • [25] Aspect-based sentiment classification with multi-attention network
    Xu, Qiannan
    Zhu, Li
    Dai, Tao
    Yan, Chengbing
    NEUROCOMPUTING, 2020, 388 : 135 - 143
  • [26] Multi-Attention Ghost Residual Fusion Network for Image Classification
    Jia, Xiaofen
    Du, Shengjie
    Guo, Yongcun
    Huang, Yourui
    Zhao, Baiting
    IEEE ACCESS, 2021, 9 : 81421 - 81431
  • [27] Hierarchical multi-label text classification of tourism resources using a label-aware dual graph attention network
    Cheng, Quan
    Shi, Wenwan
    INFORMATION PROCESSING & MANAGEMENT, 2025, 62 (01)
  • [28] Multi-attention mechanism based on gate recurrent unit for English text classification
    Liu, Haiying
    EAI ENDORSED TRANSACTIONS ON SCALABLE INFORMATION SYSTEMS, 2022, 9 (04):
  • [29] Dissecting Twitter Discussion Threads with Topic-aware Network Visualization
    Babvey, Pouria
    Lipizzi, Carlo
    Ramirez-Marquez, Jose Emmanuel
    2019 6TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI 2019), 2019, : 1359 - 1364
  • [30] Topic-Aware Physical Activity Propagation in a Health Social Network
    Phan, Nhathai
    Ebrahimi, Javid
    Kil, Dave
    Piniewski, Brigitte
    Dou, Dejing
    IEEE INTELLIGENT SYSTEMS, 2016, 31 (01) : 5 - 14