Topic-aware hierarchical multi-attention network for text classification

被引:4
|
作者
Jiang, Ye [1 ]
Wang, Yimin [1 ]
机构
[1] Qingdao Univ Sci & Technol, Sch Informat Sci & Technol, Qingdao, Peoples R China
关键词
Text classification; Topic model; Attention mechanism; Natural language processing; LDA;
D O I
10.1007/s13042-022-01734-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Neural networks, primarily recurrent and convolutional Neural networks, have been proven successful in text classification. However, convolutional models could be limited when classification tasks are determined by long-range semantic dependency. While the recurrent ones can capture long-range dependency, the sequential architecture of which could constrain the training speed. Meanwhile, traditional networks encode the entire document in a single pass, which omits the hierarchical structure of the document. To address the above issues, this study presents T-HMAN, a Topic-aware Hierarchical Multiple Attention Network for text classification. A multi-head self-attention coupled with convolutional filters is developed to capture long-range dependency via integrating the convolution features from each attention head. Meanwhile, T-HMAN combines topic distributions generated by Latent Dirichlet Allocation (LDA) with sentence-level and document-level inputs respectively in a hierarchical architecture. The proposed model surpasses the accuracies of the current state-of-the-art hierarchical models on five publicly accessible datasets. The ablation study demonstrates that the involvement of multiple attention mechanisms brings significant improvement. The current topic distributions are fixed vectors generated by LDA, the topic distributions will be parameterized and updated simultaneously with the model weights in future work.
引用
收藏
页码:1863 / 1875
页数:13
相关论文
共 50 条
  • [31] Topic-aware Heterogeneous Graph Neural Network for Link Prediction
    Xu, Siyong
    Yang, Cheng
    Shi, Chuan
    Fang, Yuan
    Guo, Yuxin
    Yang, Tianchi
    Zhang, Luhao
    Hu, Maodi
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 2261 - 2270
  • [32] Topic-aware Intention Network for Explainable Recommendation with Knowledge Enhancement
    Li, Qiming
    Zhang, Zhao
    Zhuang, Fuzhen
    Xu, Yongjun
    Li, Chao
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2023, 41 (04)
  • [33] Fusion of ConvLSTM and Multi-Attention Mechanism Network for Hyperspectral Image Classification
    Tang Ting
    Xin, Pan
    Luo Xiao-ling
    Gao Xiao-jing
    SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43 (08) : 2608 - 2616
  • [34] TNERec: Topic-aware Network Embedding for Scientific Collaborator Recommendation
    Kong, Xiangjie
    Mao, Mengyi
    Liu, Jiaying
    Xu, Bo
    Huang, Runhe
    Jin, Qun
    2018 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING & COMMUNICATIONS, CLOUD & BIG DATA COMPUTING, INTERNET OF PEOPLE AND SMART CITY INNOVATION (SMARTWORLD/SCALCOM/UIC/ATC/CBDCOM/IOP/SCI), 2018, : 1007 - 1014
  • [35] Topic-aware Masked Attentive Network for Information Cascade Prediction
    Tai, Yu
    Yang, Hongwei
    He, Hui
    Wu, Xinglong
    Shao, Yuanming
    Zhang, Weizhe
    Sangaiah, Arun Kumar
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2024, 23 (08)
  • [36] Multi-task Hierarchical Cross-Attention Network for Multi-label Text Classification
    Lu, Junyu
    Zhang, Hao
    Shen, Zhexu
    Shi, Kaiyuan
    Yang, Liang
    Xu, Bo
    Zhang, Shaowu
    Lin, Hongfei
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2022, PT II, 2022, 13552 : 156 - 167
  • [37] Hierarchical Multi-Attention Transfer for Knowledge Distillation
    Gou, Jianping
    Sun, Liyuan
    Yu, Baosheng
    Wan, Shaohua
    Tao, Dacheng
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (02)
  • [38] MATNet: A Combining Multi-Attention and Transformer Network for Hyperspectral Image Classification
    Zhang, Bo
    Chen, Yaxiong
    Rong, Yi
    Xiong, Shengwu
    Lu, Xiaoqiang
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [39] Comparative relation mining of online reviews: a hierarchical multi-attention network model
    Gao, Song
    Wang, Hongwei
    Liu, Jiaqi
    Zhu, Yuanjun
    Tang, Ou
    INTERNATIONAL JOURNAL OF MOBILE COMMUNICATIONS, 2023, 22 (02) : 212 - 236
  • [40] TITA: A Two-stage Interaction and Topic-Aware Text Matching Model
    Sun, Xingwu
    Cui, Yanling
    Tang, Hongyin
    Zhu, Qiuyu
    Zhang, Fuzheng
    Jin, Beihong
    2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 5431 - 5440