Deep learning- and word embedding-based heterogeneous classifier ensembles for text classification

被引:22
|
作者
Kilimci Z.H. [1 ]
Akyokus S. [2 ]
机构
[1] Computer Engineering Department, Dogus University, Istanbul
[2] Computer Engineering Department, İstanbul Medipol University, Istanbul
关键词
All Open Access; Gold;
D O I
10.1155/2018/7130146
中图分类号
学科分类号
摘要
The use of ensemble learning, deep learning, and effective document representation methods is currently some of the most common trends to improve the overall accuracy of a text classification/categorization system. Ensemble learning is an approach to raise the overall accuracy of a classification system by utilizing multiple classifiers. Deep learning-based methods provide better results in many applications when compared with the other conventional machine learning algorithms. Word embeddings enable representation of words learned from a corpus as vectors that provide a mapping of words with similar meaning to have similar representation. In this study, we use different document representations with the benefit of word embeddings and an ensemble of base classifiers for text classification. The ensemble of base classifiers includes traditional machine learning algorithms such as naïve Bayes, support vector machine, and random forest and a deep learning-based conventional network classifier. We analysed the classification accuracy of different document representations by employing an ensemble of classifiers on eight different datasets. Experimental results demonstrate that the usage of heterogeneous ensembles together with deep learning methods and word embeddings enhances the classification performance of texts. Copyright © 2018 Zeynep H. Kilimci and Selim Akyokus.
引用
收藏
相关论文
共 50 条
  • [21] Deep Learning Algorithms Based Text Classifier
    Venkataraman, Arthi
    PROCEEDINGS OF THE 2016 2ND INTERNATIONAL CONFERENCE ON APPLIED AND THEORETICAL COMPUTING AND COMMUNICATION TECHNOLOGY (ICATCCT), 2016, : 220 - 224
  • [22] A Text Classifier Using Weighted Average Word Embedding
    Elsaadawy, AbdAllah
    Torki, Marwan
    El-Makky, Nagwa
    2018 PROCEEDINGS OF THE INTERNATIONAL JAPAN-AFRICA CONFERENCE ON ELECTRONICS, COMMUNICATIONS, AND COMPUTATIONS (JAC-ECC 2018), 2018, : 151 - 154
  • [23] Heterogeneous IoT Intrusion Detection Based on Fusion Word Embedding Deep Transfer Learning
    Chen, Di
    Zhang, Fengbin
    Zhang, Xinpeng
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2023, 19 (08) : 9183 - 9193
  • [24] WEDL-NIDS: Improving Network Intrusion Detection Using Word Embedding-Based Deep Learning Method
    Cui, Jianjing
    Long, Jun
    Min, Erxue
    Mao, Yugang
    MODELING DECISIONS FOR ARTIFICIAL INTELLIGENCE (MDAI 2018), 2018, 11144 : 283 - 295
  • [25] Text Sentiment Polarity Classification Method Based on Word Embedding
    Sun, Xiaojie
    Du, Menghao
    Shi, Hua
    Huang, Wenming
    PROCEEDINGS OF THE 2018 2ND INTERNATIONAL CONFERENCE ON ALGORITHMS, COMPUTING AND SYSTEMS (ICACS 2018), 2018, : 99 - 104
  • [26] Chinese Text Classification Method Based on BERT Word Embedding
    Wang, Ziniu
    Huang, Zhilin
    Gao, Jianling
    2020 5TH INTERNATIONAL CONFERENCE ON MATHEMATICS AND ARTIFICIAL INTELLIGENCE (ICMAI 2020), 2020, : 66 - 71
  • [27] An embedding-based text classification approach for understanding micro-geographic housing dynamics
    Nilsson, Isabelle
    Delmelle, Elizabeth C.
    INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SCIENCE, 2023, 37 (12) : 2487 - 2513
  • [28] Few-Shot Transfer Learning for Text Classification With Lightweight Word Embedding Based Models
    Pan, Chongyu
    Huang, Jian
    Gong, Jianxing
    Yuan, Xingsheng
    IEEE ACCESS, 2019, 7 : 53296 - 53304
  • [29] Exploring the effectiveness of word embedding based deep learning model for improving email classification
    Asudani, Deepak Suresh
    Nagwani, Naresh Kumar
    Singh, Pradeep
    DATA TECHNOLOGIES AND APPLICATIONS, 2022, 56 (04) : 483 - 505
  • [30] An Embedding-Based Topic Model for Document Classification
    Seifollahi, Sattar
    Piccardi, Massimo
    Jolfaei, Alireza
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2021, 20 (03)