Topic-aware hierarchical multi-attention network for text classification

被引:4
|
作者
Jiang, Ye [1 ]
Wang, Yimin [1 ]
机构
[1] Qingdao Univ Sci & Technol, Sch Informat Sci & Technol, Qingdao, Peoples R China
关键词
Text classification; Topic model; Attention mechanism; Natural language processing; LDA;
D O I
10.1007/s13042-022-01734-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Neural networks, primarily recurrent and convolutional Neural networks, have been proven successful in text classification. However, convolutional models could be limited when classification tasks are determined by long-range semantic dependency. While the recurrent ones can capture long-range dependency, the sequential architecture of which could constrain the training speed. Meanwhile, traditional networks encode the entire document in a single pass, which omits the hierarchical structure of the document. To address the above issues, this study presents T-HMAN, a Topic-aware Hierarchical Multiple Attention Network for text classification. A multi-head self-attention coupled with convolutional filters is developed to capture long-range dependency via integrating the convolution features from each attention head. Meanwhile, T-HMAN combines topic distributions generated by Latent Dirichlet Allocation (LDA) with sentence-level and document-level inputs respectively in a hierarchical architecture. The proposed model surpasses the accuracies of the current state-of-the-art hierarchical models on five publicly accessible datasets. The ablation study demonstrates that the involvement of multiple attention mechanisms brings significant improvement. The current topic distributions are fixed vectors generated by LDA, the topic distributions will be parameterized and updated simultaneously with the model weights in future work.
引用
收藏
页码:1863 / 1875
页数:13
相关论文
共 50 条
  • [41] Multi-Attention Network for Sentiment Analysis
    Du, Tingting
    Huang, Yunyin
    Wu, Xian
    Chang, Huiyou
    PROCEEDINGS OF THE 2018 2ND INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND INFORMATION RETRIEVAL (NLPIR 2018), 2018, : 49 - 54
  • [42] Multi-Attention Network for Stereo Matching
    Yang, Xiaowei
    He, Lin
    Zhao, Yong
    Sang, Haiwei
    Yang, Zuliu
    Cheng, Xianjing
    IEEE ACCESS, 2020, 8 : 113371 - 113382
  • [43] Hierarchical Multi-Granularity Attention- Based Hybrid Neural Network for Text Classification
    Liu Z.
    Lu C.
    Huang H.
    Lyu S.
    Tao Z.
    IEEE Access, 2020, 8 : 149362 - 149371
  • [44] Hierarchical Multi-label Text Classification: An Attention-based Recurrent Network Approach
    Huang, Wei
    Chen, Enhong
    Liu, Qi
    Chen, Yuying
    Huang, Zai
    Liu, Yang
    Zhao, Zhou
    Zhang, Dan
    Wang, Shijin
    PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19), 2019, : 1051 - 1060
  • [45] Feature Fusion Text Classification Model Combining CNN and BiGRU with Multi-Attention Mechanism
    Zhang, Jingren
    Liu, Fang'ai
    Xu, Weizhi
    Yu, Hui
    FUTURE INTERNET, 2019, 11 (11):
  • [46] TOPIC-AWARE DIALOGUE GENERATION WITH TWO-HOP BASED GRAPH ATTENTION
    Zhou, Shijie
    Rong, Wenge
    Zhang, Jianfei
    Wang, Yanmeng
    Shi, Libin
    Xiong, Zhang
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 7428 - 7432
  • [47] Multi-turn dialogue comprehension from a topic-aware perspective
    Ma, Xinbei
    Xu, Yi
    Zhao, Hai
    Zhang, Zhuosheng
    NEUROCOMPUTING, 2024, 578
  • [48] Double-Branch Multi-Attention Mechanism Network for Hyperspectral Image Classification
    Ma, Wenping
    Yang, Qifan
    Wu, Yue
    Zhao, Wei
    Zhang, Xiangrong
    REMOTE SENSING, 2019, 11 (11)
  • [49] MARec: A multi-attention aware paper recommendation method
    Wang, Jie
    Zhou, Jingya
    Wu, Zhen
    Sun, Xigang
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 232
  • [50] Multi-attention Fusion for Multimodal Sentiment Classification
    Li, Guangmin
    Zeng, Xin
    Chen, Chi
    Zhou, Long
    PROCEEDINGS OF 2024 ACM ICMR WORKSHOP ON MULTIMODAL VIDEO RETRIEVAL, ICMR-MVR 2024, 2024, : 1 - 7