Classification of Exaggerated News Headlines

被引:0
|
作者
Rangata, Mapitsi Roseline [1 ]
Sefara, Tshephisho Joseph [1 ]
机构
[1] CSIR, Pretoria, South Africa
关键词
Classification; News headlines; Machine learning; Natural language processing; Exaggerated News;
D O I
10.1007/978-3-031-53731-8_20
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The amount of data online is increasing as companies generate news articles daily. These news articles contain headlines that have a level of exaggeration aimed to win the readers. In addition, these companies are competing against one another; hence creating appealing and exaggerated news headlines is one of the options to win the readers. Some of the exaggerated headlines contain some level of misleading information. Hence, this paper aims to apply machine learning methods and natural language processing to detect and identify exaggerated news headlines in South African context. Machine learning models such as logistic regression, decision trees, support vector machines, and XGBoost are trained on data that contain labelled news headlines as binary classification. The models produced good results, with XGboost and SVM obtaining 70% in terms of accuracy. Furthermore, the F measure was used to evaluate the models and decision trees obtained 56% followed by SVM with 53%. The classification of exaggerated news headlines is a difficult task. Therefore, we oversampled the data to obtain balanced labels. The performance of the models was increased. SVM obtained 84% followed by logistic regression, XGBoost, and decision trees with accuracy of 78%, 72% and 71%, respectively.
引用
收藏
页码:248 / 260
页数:13
相关论文
共 50 条
  • [1] News Classification Based On Their Headlines: A Review
    Rana, Mazhar Iqbal
    Khalid, Shehzad
    Akbar, Muhammad Usman
    17TH IEEE INTERNATIONAL MULTI TOPIC CONFERENCE 2014, 2014, : 211 - 216
  • [2] Reader Emotion Classification of News Headlines
    Jia, Yuxiang
    Chen, Zhengyan
    Yu, Shiwen
    IEEE NLP-KE 2009: PROCEEDINGS OF INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, 2009, : 268 - +
  • [3] Classification and Detection of Emotions in Czech News Headlines
    Burget, Radim
    Smekal, Zdenek
    Karasek, Jan
    TSP 2010: 33RD INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING, 2010, : 64 - 68
  • [4] A News Headlines Classification Method Based on the Fusion of Related Words
    Wang, Yongguan
    Meng, Binjie
    Liu, Pengyuan
    Yang, Erhong
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2017, 2018, 10619 : 818 - 827
  • [5] Kurdish News Dataset Headlines (KNDH) through multiclass classification
    Badawi, Soran
    Saeed, Ari M.
    Ahmed, Sara A.
    Abdalla, Peshraw Ahmed
    Hassan, Diyari A.
    DATA IN BRIEF, 2023, 48
  • [6] Improved data labelling method for news headlines classification in cloud environment
    Alphonse, A. Sherly
    Abinaya, S.
    Verma, Nirvik
    CONNECTION SCIENCE, 2025, 37 (01)
  • [7] Predicting Impact of Published News Headlines Using Text Mining and Classification Techniques
    Banerjee, Parikshit
    Ananthakumar, Usha
    Singh, Shubham
    ADVANCES IN NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, ICNC-FSKD 2022, 2023, 153 : 983 - 992
  • [8] Clickbait Pattern Detection and Classification of News Headlines using Natural Language Processing
    Manjesh, Suraj
    Kanakagiri, Tushar
    Vaishak, P.
    Chettiar, Vivek
    Shobha, G.
    2017 2ND INTERNATIONAL CONFERENCE ON COMPUTATIONAL SYSTEMS AND INFORMATION TECHNOLOGY FOR SUSTAINABLE SOLUTION (CSITSS-2017), 2017, : 153 - 158
  • [9] Translation of English News Headlines
    张敏洁
    校园英语, 2017, (39) : 233 - 233
  • [10] The evolution of online news headlines
    Nickl, Pietro
    Moussaid, Mehdi
    Lorenz-Spreen, Philipp
    HUMANITIES & SOCIAL SCIENCES COMMUNICATIONS, 2025, 12 (01):