Satire identification in Turkish news articles based on ensemble of classifiers

被引:34
|
作者
Onan, Aytug [1 ]
Tocoglu, Mansur Alp [2 ]
机构
[1] Izmir Katip Celebi Univ, Fac Engn & Architecture, Dept Comp Engn, Izmir, Turkey
[2] Manisa Celal Bayar Univ, Fac Technol, Dept Software Engn, Manisa, Turkey
关键词
Satire detection; figurative language; machine learning; classifier ensembles;
D O I
10.3906/elk-1907-11
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Social media and microblogging platforms generally contain elements of figurative and nonliteral language, including satire. The identification of figurative language is a fundamental task for sentiment analysis. It will not be possible to obtain sentiment analysis methods with high classification accuracy if elements of figurative language have not been properly identified. Satirical text is a kind of figurative language, in which irony and humor have been utilized to ridicule or criticize an event or entity. Satirical news is a pervasive issue on social media platforms, which can be deceptive and harmful. This paper presents an ensemble scheme for satirical news identification in Turkish news articles. In the presented scheme, linguistic and psychological feature sets have been utilized to extract the feature sets (i.e. linguistic, psychological, personal, spoken categories, and punctuation). In the classification phase, accuracy rates of five supervised learning algorithms (i.e. naive Bayes algorithm, logistic regression, support vector machines, random forest, and k-nearest neighbor algorithm) with three widely utilized ensemble methods (i.e. AdaBoost, bagging, and random subspace) have been considered. Based on the results, we concluded that the random forest algorithm yielded the highest performance, with a classification accuracy of 96.92% for satire detection in Turkish. For deep learning-based architectures, we have achieved classification accuracy of 97.72% with the recurrent neural network architecture with attention mechanism.
引用
收藏
页码:1086 / 1106
页数:21
相关论文
共 50 条
  • [21] Text Authorship Identification Based On Ensemble Learning and Genetic Algorithm Combination in Turkish Text
    Gullu, Merve
    Polat, Huseyin
    JOURNAL OF POLYTECHNIC-POLITEKNIK DERGISI, 2022, 25 (03): : 1287 - 1297
  • [22] Dialect Identification Using Spectral and Prosodic Features on Single and Ensemble Classifiers
    Nagaratna B. Chittaragi
    Ambareesh Prakash
    Shashidhar G. Koolagudi
    Arabian Journal for Science and Engineering, 2018, 43 : 4289 - 4302
  • [23] Dialect Identification Using Spectral and Prosodic Features on Single and Ensemble Classifiers
    Chittaragi, Nagaratna B.
    Prakash, Ambareesh
    Koolagudi, Shashidhar G.
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2018, 43 (08) : 4289 - 4302
  • [24] Classifiers selection for ensemble learning based on accuracy and diversity
    Yang, Liying
    CEIS 2011, 2011, 15
  • [25] Sea Water Pollution Assessment Based On Ensemble of Classifiers
    Zeng, Bin
    Luo, Zhaohui
    Wei, Jun
    ICNC 2008: FOURTH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 1, PROCEEDINGS, 2008, : 241 - 245
  • [26] Algorithm of Ensemble of Multi-classifiers Based on Roughness
    Zhang Jun
    Li Peng
    INTERNATIONAL CONFERENCE OF CHINA COMMUNICATION (ICCC2010), 2010, : 289 - 290
  • [27] Selective Ensemble Based on Transformation of Classifiers Used SPCA
    Xiong, Lin
    Mao, Shasha
    Jiao, Licheng
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2015, 29 (01)
  • [28] Identification of clickbait news articles using SBERT and correlation matrix
    Jyoti Prakash Supriya
    Gunjan Singh
    Social Network Analysis and Mining, 13
  • [29] Cost Complexity-Based Pruning of Ensemble Classifiers
    Prodromidis, Andreas L.
    Stolfo, Salvatore J.
    Knowledge and Information Systems, 2001, Springer Science and Business Media Deutschland GmbH (03) : 449 - 469
  • [30] An intrusion detection scheme based on the ensemble of discriminant classifiers
    Bhati, Bhoopesh Singh
    Rai, C. S.
    Balamurugan, B.
    Al-Turjman, Fadi
    COMPUTERS & ELECTRICAL ENGINEERING, 2020, 86