Deep Pyramid Convolutional Neural Networks for Text Categorization

被引:499
|
作者
Johnson, Rie [1 ]
Zhang, Tong [2 ]
机构
[1] RJ Res Consulting, Tarrytown, NY 10591 USA
[2] Tencent AI Lab, Shenzhen, Peoples R China
关键词
D O I
10.18653/v1/P17-1052
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This paper proposes a low-complexity word-level deep convolutional neural network (CNN) architecture for text categorization that can efficiently represent long-range associations in text. In the literature, several deep and complex neural networks have been proposed for this task, assuming availability of relatively large amounts of training data. However, the associated computational complexity increases as the networks go deeper, which poses serious challenges in practical applications. Moreover, it was shown recently that shallow word-level CNNs are more accurate and much faster than the state-of-the-art very deep nets such as character-level CNNs even in the setting of large training data. Motivated by these findings, we carefully studied deepening of word-level CNNs to capture global representations of text, and found a simple network architecture with which the best accuracy can be obtained by increasing the network depth without increasing computational cost by much. We call it deep pyramid CNN. The proposed model with 15 weight layers outperforms the previous best models on six benchmark datasets for sentiment classification and topic categorization.
引用
收藏
页码:562 / 570
页数:9
相关论文
共 50 条
  • [31] Deep Neural Models and Retrofitting for Arabic Text Categorization
    El-Alami, Fatima-Zahra
    El Alaoui, Said Ouatik
    En-Nahnahi, Noureddine
    INTERNATIONAL JOURNAL OF INTELLIGENT INFORMATION TECHNOLOGIES, 2020, 16 (02) : 74 - 86
  • [32] Chinese Text Categorization Based on Deep Belief Networks
    Song, Jia
    Qin, Sijun
    Zhang, Pengzhou
    2016 IEEE/ACIS 15TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE (ICIS), 2016, : 1123 - 1127
  • [33] Deep Pyramid Convolutional Neural Network Integrated with Self-attention Mechanism and Highway Network for Text Classification
    Li, Xuewei
    Ning, Hongyun
    4TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE APPLICATIONS AND TECHNOLOGIES (AIAAT 2020), 2020, 1642
  • [34] Very Deep Convolutional Networks for Text Classification
    Conneau, Alexis
    Schwenk, Holger
    Le Cun, Yann
    Barrault, Loic
    15TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2017), VOL 1: LONG PAPERS, 2017, : 1107 - 1116
  • [35] Reading Text in the Wild with Convolutional Neural Networks
    Max Jaderberg
    Karen Simonyan
    Andrea Vedaldi
    Andrew Zisserman
    International Journal of Computer Vision, 2016, 116 : 1 - 20
  • [36] Reading Text in the Wild with Convolutional Neural Networks
    Jaderberg, Max
    Simonyan, Karen
    Vedaldi, Andrea
    Zisserman, Andrew
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2016, 116 (01) : 1 - 20
  • [37] On the Interpretation of Convolutional Neural Networks for Text Classification
    Xu, Jincheng
    Du, Qingfeng
    ECAI 2020: 24TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, 325 : 2252 - 2259
  • [38] Convolutional Neural Networks for Financial Text Regression
    Dereli, Nesat
    Saraclar, Murat
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019:): STUDENT RESEARCH WORKSHOP, 2019, : 331 - 337
  • [39] Convolutional Recurrent Neural Networks for Text Classification
    Lyu, Shengfei
    Liu, Jiaqi
    JOURNAL OF DATABASE MANAGEMENT, 2021, 32 (04) : 65 - 82
  • [40] Recurrent Convolutional Neural Networks for Text Classification
    Lai, Siwei
    Xu, Liheng
    Liu, Kang
    Zhao, Jun
    PROCEEDINGS OF THE TWENTY-NINTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2015, : 2267 - 2273