Classification of Exaggerated News Headlines

被引:0
|
作者
Rangata, Mapitsi Roseline [1 ]
Sefara, Tshephisho Joseph [1 ]
机构
[1] CSIR, Pretoria, South Africa
关键词
Classification; News headlines; Machine learning; Natural language processing; Exaggerated News;
D O I
10.1007/978-3-031-53731-8_20
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The amount of data online is increasing as companies generate news articles daily. These news articles contain headlines that have a level of exaggeration aimed to win the readers. In addition, these companies are competing against one another; hence creating appealing and exaggerated news headlines is one of the options to win the readers. Some of the exaggerated headlines contain some level of misleading information. Hence, this paper aims to apply machine learning methods and natural language processing to detect and identify exaggerated news headlines in South African context. Machine learning models such as logistic regression, decision trees, support vector machines, and XGBoost are trained on data that contain labelled news headlines as binary classification. The models produced good results, with XGboost and SVM obtaining 70% in terms of accuracy. Furthermore, the F measure was used to evaluate the models and decision trees obtained 56% followed by SVM with 53%. The classification of exaggerated news headlines is a difficult task. Therefore, we oversampled the data to obtain balanced labels. The performance of the models was increased. SVM obtained 84% followed by logistic regression, XGBoost, and decision trees with accuracy of 78%, 72% and 71%, respectively.
引用
收藏
页码:248 / 260
页数:13
相关论文
共 50 条
  • [21] The Effects of Subtle Misinformation in News Headlines
    Ecker, Ullrich K. H.
    Lewandowsky, Stephan
    Chang, Ee Pin
    Pillai, Rekha
    JOURNAL OF EXPERIMENTAL PSYCHOLOGY-APPLIED, 2014, 20 (04) : 323 - 335
  • [22] WITNESS POETRY + NEWS HEADLINES OF THE DAY
    GUSTAFSON, R
    MALAHAT REVIEW, 1979, (51): : 8 - 16
  • [23] Recommendation on Keyword Combination of News Headlines
    Weng, Sung-Shun
    Wu, Jing-Yi
    2018 5TH INTERNATIONAL CONFERENCE ON SYSTEMS AND INFORMATICS (ICSAI), 2018, : 1146 - 1151
  • [24] Learning to Determine the Quality of News Headlines
    Omidvar, Amin
    Pourmodheji, Hossein
    An, Aijun
    Edall, Gordon
    ICAART: PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 1, 2020, : 401 - 409
  • [25] Analysis on Features of News Headlines Translation
    庞丹笛
    英语广场(学术研究), 2012, (05) : 29 - 32
  • [26] Personal Deixis in English News Headlines
    杨康
    校园英语, 2019, (37) : 237 - 237
  • [27] On the Strategies for Translating English News Headlines
    Wang Chao
    校园英语, 2016, (08) : 240 - 240
  • [28] Generating Representative Headlines for News Stories
    Gu, Xiaotao
    Mao, Yuning
    Han, Jiawei
    Liu, Jialu
    Yu, Hongkun
    Wu, You
    Yu, Cong
    Finnie, Daniel
    Zhai, Jiaqi
    Zukoski, Nicholas
    WEB CONFERENCE 2020: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2020), 2020, : 1773 - 1784
  • [29] Intonation trends in news headlines in Catalan
    Font-Rotches, Dolors
    Paloma, David
    CIRCULO DE LINGUISTICA APLICADA A LA COMUNICACION, 2012, (51): : 50 - 81
  • [30] Human Body Metaphor in News Headlines
    Yang, Bingbing
    Wang, Zhimin
    CHINESE LEXICAL SEMANTICS, CLSW 2021, PT II, 2022, 13250 : 3 - 17