Topic-aware hierarchical multi-attention network for text classification

被引：4

作者：

Jiang, Ye ^{[1
]}

Wang, Yimin ^{[1
]}

机构：

[1] Qingdao Univ Sci & Technol, Sch Informat Sci & Technol, Qingdao, Peoples R China

来源：

INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS | 2023年 / 14卷 / 05期

关键词：

Text classification; Topic model; Attention mechanism; Natural language processing; LDA;

D O I：

10.1007/s13042-022-01734-0

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Neural networks, primarily recurrent and convolutional Neural networks, have been proven successful in text classification. However, convolutional models could be limited when classification tasks are determined by long-range semantic dependency. While the recurrent ones can capture long-range dependency, the sequential architecture of which could constrain the training speed. Meanwhile, traditional networks encode the entire document in a single pass, which omits the hierarchical structure of the document. To address the above issues, this study presents T-HMAN, a Topic-aware Hierarchical Multiple Attention Network for text classification. A multi-head self-attention coupled with convolutional filters is developed to capture long-range dependency via integrating the convolution features from each attention head. Meanwhile, T-HMAN combines topic distributions generated by Latent Dirichlet Allocation (LDA) with sentence-level and document-level inputs respectively in a hierarchical architecture. The proposed model surpasses the accuracies of the current state-of-the-art hierarchical models on five publicly accessible datasets. The ablation study demonstrates that the involvement of multiple attention mechanisms brings significant improvement. The current topic distributions are fixed vectors generated by LDA, the topic distributions will be parameterized and updated simultaneously with the model weights in future work.

引用

页码：1863 / 1875

页数：13

共 50 条

[21] Topic-Aware Multi-turn Dialogue Modeling
Xu, Yi
Zhao, Hai
Zhang, Zhuosheng
THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 14176 - 14184
[22] Text-Aware Recommendation Model Based on Multi-attention Neural Networks
Qiu, Gang
Yu, Xiaoli
Jiang, Liping
Ma, Baoying
KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT I, 2021, 12815 : 590 - 603
[23] Multi-attention aggregation network for remote sensing scene classification
Wang, Xin
Li, Yingying
Shi, Aiye
Zhou, Huiyu
JOURNAL OF APPLIED REMOTE SENSING, 2023, 17 (04)
[24] T-BERTSum: Topic-Aware Text Summarization Based on BERT
Ma, Tinghuai
Pan, Qian
Rong, Huan
Qian, Yurong
Tian, Yuan
Al-Nabhan, Najla
IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2022, 9 (03): : 879 - 890
[25] Aspect-based sentiment classification with multi-attention network
Xu, Qiannan
Zhu, Li
Dai, Tao
Yan, Chengbing
NEUROCOMPUTING, 2020, 388 : 135 - 143
[26] Multi-Attention Ghost Residual Fusion Network for Image Classification
Jia, Xiaofen
Du, Shengjie
Guo, Yongcun
Huang, Yourui
Zhao, Baiting
IEEE ACCESS, 2021, 9 : 81421 - 81431
[27] Hierarchical multi-label text classification of tourism resources using a label-aware dual graph attention network
Cheng, Quan
Shi, Wenwan
INFORMATION PROCESSING & MANAGEMENT, 2025, 62 (01)
[28] Multi-attention mechanism based on gate recurrent unit for English text classification
Liu, Haiying
EAI ENDORSED TRANSACTIONS ON SCALABLE INFORMATION SYSTEMS, 2022, 9 (04):
[29] Dissecting Twitter Discussion Threads with Topic-aware Network Visualization
Babvey, Pouria
Lipizzi, Carlo
Ramirez-Marquez, Jose Emmanuel
2019 6TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI 2019), 2019, : 1359 - 1364
[30] Topic-Aware Physical Activity Propagation in a Health Social Network
Phan, Nhathai
Ebrahimi, Javid
Kil, Dave
Piniewski, Brigitte
Dou, Dejing
IEEE INTELLIGENT SYSTEMS, 2016, 31 (01) : 5 - 14

← 1 2 3 4 5 →