Hierarchical Multi-Granularity Attention- Based Hybrid Neural Network for Text Classification

被引:6
|
作者
Liu Z. [1 ]
Lu C. [2 ]
Huang H. [2 ]
Lyu S. [2 ]
Tao Z. [3 ,4 ]
机构
[1] School of Information Management for Law, China University of Political Science and Law, Beijing
[2] School of Computer Science, University of Science and Technology of China, Hefei
[3] Division of Life Sciences and Medicine, First Affiliated Hospital of USTC, University of Science and Technology of China, Hefei
[4] Anhui Provincial Cancer Hospital, Hefei
关键词
Attention mechanism; convolutional neural network; multichannel; text classification;
D O I
10.1109/ACCESS.2020.3016727
中图分类号
学科分类号
摘要
Neural network-based approaches have become the driven forces for Natural Language Processing (NLP) tasks. Conventionally, there are two mainstream neural architectures for NLP tasks: the recurrent neural network (RNN) and the convolution neural network (ConvNet). RNNs are good at modeling long-term dependencies over input texts, but preclude parallel computation. ConvNets do not have memory capability and it has to model sequential data as un-ordered features. Therefore, ConvNets fail to learn sequential dependencies over the input texts, but it is able to carry out high-efficient parallel computation. As each neural architecture, such as RNN and ConvNets, has its own pro and con, integration of different architectures is assumed to be able to enrich the semantic representation of texts, thus enhance the performance of NLP tasks. However, few investigation explores the reconciliation of these seemingly incompatible architectures. To address this issue, we propose a hybrid architecture based on a novel hierarchical multi-granularity attention mechanism, named Multi-granularity Attention-based Hybrid Neural Network (MahNN). The attention mechanism is to assign different weights to different parts of the input sequence to increase the computation efficiency and performance of neural models. In MahNN, two types of attentions are introduced: the syntactical attention and the semantical attention. The syntactical attention computes the importance of the syntactic elements (such as words or sentence) at the lower symbolic level and the semantical attention is used to compute the importance of the embedded space dimension corresponding to the upper latent semantics. We adopt the text classification as an exemplifying way to illustrate the ability of MahNN to understand texts. The experimental results on a variety of datasets demonstrate that MahNN outperforms most of the state-of-the-arts for text classification. © 2013 IEEE.
引用
收藏
页码:149362 / 149371
页数:9
相关论文
共 50 条
  • [1] Multi-granularity Hierarchical Attention Siamese Network for Visual Tracking
    Chen, Xing
    Zhang, Xiang
    Tan, Huibin
    Lan, Long
    Luo, Zhigang
    Huang, Xuhui
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [2] Image classification based on multi-granularity convolutional Neural network model
    Wu, Xiaogang
    Tanprasert, Thitipong
    Jing, Wang
    2022 19TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER SCIENCE AND SOFTWARE ENGINEERING (JCSSE 2022), 2022,
  • [3] Deconfounded hierarchical multi-granularity classification
    Zhao, Ziyu
    Gan, Leilei
    Shen, Tao
    Kuang, Kun
    Wu, Fei
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 248
  • [4] Hierarchical multi-granularity classification based on bidirectional knowledge transfer
    Jiang, Juan
    Yang, Jingmin
    Zhang, Wenjie
    Zhang, Hongbin
    MULTIMEDIA SYSTEMS, 2024, 30 (04)
  • [5] Pose-Guided Multi-Granularity Attention Network for Text-Based Person Search
    Jing, Ya
    Si, Chenyang
    Wang, Junbo
    Wang, Wei
    Wang, Liang
    Tan, Tieniu
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11189 - 11196
  • [6] Label Relation Graphs Enhanced Hierarchical Residual Network for Hierarchical Multi-Granularity Classification
    Chen, Jingzhou
    Wang, Peng
    Liu, Jian
    Qian, Yuntao
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 4848 - 4857
  • [7] Hierarchical Multi-Granularity Interaction Graph Convolutional Network for Long Document Classification
    Liu, Tengfei
    Hu, Yongli
    Gao, Junbin
    Sun, Yanfeng
    Yin, Baocai
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 1762 - 1775
  • [8] Multi-granularity sequence generation for hierarchical image classification
    Liu, Xinda
    Wang, Lili
    COMPUTATIONAL VISUAL MEDIA, 2024, 10 (02) : 243 - 260
  • [9] Multi-granularity sequence generation for hierarchical image classification
    Xinda Liu
    Lili Wang
    Computational Visual Media, 2024, 10 : 243 - 260
  • [10] A Multi-Granularity Semantic Extraction Method for Text Classification
    Li, Min
    Liu, Zeyu
    Li, Gang
    Han, Delong
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT XIII, ICIC 2024, 2024, 14874 : 224 - 236