Deep Pyramid Convolutional Neural Networks for Text Categorization

被引:499
|
作者
Johnson, Rie [1 ]
Zhang, Tong [2 ]
机构
[1] RJ Res Consulting, Tarrytown, NY 10591 USA
[2] Tencent AI Lab, Shenzhen, Peoples R China
关键词
D O I
10.18653/v1/P17-1052
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This paper proposes a low-complexity word-level deep convolutional neural network (CNN) architecture for text categorization that can efficiently represent long-range associations in text. In the literature, several deep and complex neural networks have been proposed for this task, assuming availability of relatively large amounts of training data. However, the associated computational complexity increases as the networks go deeper, which poses serious challenges in practical applications. Moreover, it was shown recently that shallow word-level CNNs are more accurate and much faster than the state-of-the-art very deep nets such as character-level CNNs even in the setting of large training data. Motivated by these findings, we carefully studied deepening of word-level CNNs to capture global representations of text, and found a simple network architecture with which the best accuracy can be obtained by increasing the network depth without increasing computational cost by much. We call it deep pyramid CNN. The proposed model with 15 weight layers outperforms the previous best models on six benchmark datasets for sentiment classification and topic categorization.
引用
收藏
页码:562 / 570
页数:9
相关论文
共 50 条
  • [21] A hybrid method based on estimation of distribution algorithms to train convolutional neural networks for text categorization
    Grabiel Toledano-Lopez, Orlando
    Madera, Julio
    Gonzalez, Hector
    Simon-Cuevas, Alfredo
    PATTERN RECOGNITION LETTERS, 2022, 160 : 105 - 111
  • [22] Semantic Clustering and Convolutional Neural Network for Short Text Categorization
    Wang, Peng
    Xu, Jiaming
    Xu, Bo
    Liu, Cheng-Lin
    Zhang, Heng
    Wang, Fangyuan
    Hao, Hongwei
    PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL) AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (IJCNLP), VOL 2, 2015, : 352 - 357
  • [23] Text normalization with convolutional neural networks
    Yolchuyeva, Sevinj
    Nemeth, Geza
    Gyires-Toth, Balint
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2018, 21 (03) : 589 - 600
  • [24] Convolutional Neural Networks for Text Hashing
    Xu, Jiaming
    Wang, Peng
    Tian, Guanhua
    Xu, Bo
    Zhao, Jun
    Wang, Fangyuan
    Hao, Hongwei
    PROCEEDINGS OF THE TWENTY-FOURTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI), 2015, : 1369 - 1375
  • [25] MapReduce-Based Convolutional Neural Network for Text Categorization
    Ferjani, Eman
    Hidri, Adel
    Sassi Hidri, Minyar
    Frihida, Ali
    COMPUTATIONAL COLLECTIVE INTELLIGENCE, PT II, 2019, 11684 : 155 - 166
  • [26] Text detection with convolutional neural networks
    Delakis, Manolis
    Garcia, Christophe
    VISAPP 2008: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON COMPUTER VISION THEORY AND APPLICATIONS, VOL 2, 2008, : 290 - 294
  • [27] Deep Convolutional Neural Networks
    Gonzalez, Rafael C.
    IEEE SIGNAL PROCESSING MAGAZINE, 2018, 35 (06) : 79 - 87
  • [28] Ligature Recognition in Urdu Caption Text using Deep Convolutional Neural Networks
    Hayat, Umar
    Aatif, Muhammad
    Zeeshan, Osama
    Siddiqi, Imran
    2018 14TH INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES (ICET), 2018,
  • [29] Hybrid photonic deep convolutional residual spiking neural networks for text classification
    Zhang, Yahui
    Xiang, Shuiying
    Jiang, Shuqing
    Han, Yanan
    Guo, Xingxing
    Zheng, Ling
    Shi, Yuechun
    Hao, Yue
    OPTICS EXPRESS, 2023, 31 (17): : 28489 - 28502
  • [30] Deep Convolutional Neural Networks for Text Localisation in Figures From Biomedical Literature
    Almakky, Ibrahim
    Palade, Vasile
    Ruiz-Garcia, Ariel
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,