Classifying Fake News Articles Using Natural Language Processing to Identify In-Article Attribution as a Supervised Learning Estimator

被引:0
|
作者
Traylor, Terry [1 ]
Straub, Jeremy [2 ]
Gurmeet [2 ]
Snell, Nicholas [2 ]
机构
[1] US Marine Corps, Fargo, ND 58103 USA
[2] North Dakota State Univ, Dept Comp Sci, Fargo, ND 58105 USA
关键词
component; Fake News; Machine Learning; Natural Language Processing; Attribution Classification; Influence Mining;
D O I
10.1109/ICSC.2019.00086
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Intentionally deceptive content presented under the guise of legitimate journalism is a worldwide information accuracy and integrity problem that affects opinion forming, decision making, and voting patterns. Most so-called 'fake news' is initially distributed over social media conduits like Facebook and Twitter and later finds its way onto mainstream media platforms such as traditional television and radio news. The fake news stories that are initially seeded over social media platforms share key linguistic characteristics such as making excessive use of unsubstantiated hyperbole and non-attributed quoted content. In this paper, the results of a fake news identification study that documents the performance of a fake news classifier are presented. The Textblob, Natural Language, and SciPy Toolkits were used to develop a novel fake news detector that uses quoted attribution in a Bayesian machine learning system as a key feature to estimate the likelihood that a news article is fake. The resultant process precision is 63.333% effective at assessing the likelihood that an article with quotes is fake. This process is called influence mining and this novel technique is presented as a method that can be used to enable fake news and even propaganda detection. In this paper, the research process, technical analysis, technical linguistics work, and classifier performance and results are presented. The paper concludes with a discussion of how the current system will evolve into an influence mining system.
引用
收藏
页码:445 / 449
页数:5
相关论文
共 43 条
  • [31] Improvement of a Machine Learning Model Using a Sentiment Analysis Algorithm to Detect Fake News: A Case Study of Health and Medical Articles on Thai Language Websites
    Atchariyachanvanich, Kanokwan
    Saengkhunthod, Chotipong
    Kerdnoonwong, Parischaya
    Chanlekha, Hutchatai
    Cooharojananone, Nagul
    JOURNAL OF CASES ON INFORMATION TECHNOLOGY, 2024, 26 (01)
  • [32] Using Natural Language Processing and Machine Learning to Identify Internal Medicine-Pediatrics Residency Values in Applications
    Drum, Benjamin
    Shi, Jianlin
    Peterson, Bennet
    Lamb, Sara
    Hurdle, John F.
    Gradick, Casey
    ACADEMIC MEDICINE, 2023, 98 (11) : 1278 - 1282
  • [33] Stylometric Fake News Detection Based on Natural Language Processing Using Named Entity Recognition: In-Domain and Cross-Domain Analysis
    Tsai, Chih-Ming
    ELECTRONICS, 2023, 12 (17)
  • [34] A New Method to Identify Short-Text Authors Using Combinations of Machine Learning and Natural Language Processing Techniques
    Vijayakumar, Biveeken
    Fuad, Muhammad Marwan Muhammad
    KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS (KES 2019), 2019, 159 : 428 - 436
  • [35] NATURAL LANGUAGE PROCESSING BASED MACHINE LEARNING MODEL USING CARDIAC MRI REPORTS TO IDENTIFY HYPERTROPHIC CARDIOMYOPATHY PATIENTS
    Sundaram, Divaakar Siva Baala
    Arunachalam, Shivaram P.
    Damani, Devanshi N.
    Farahani, Nasibeh Z.
    Enayati, Moein
    Pasupathy, Kalyan S.
    Arruda-Olson, Adelaide M.
    PROCEEDINGS OF THE 2021 DESIGN OF MEDICAL DEVICES CONFERENCE (DMD2021), 2021,
  • [36] Automatized spatio-temporal detection of drought impacts from newspaper articles using natural language processing and machine learning
    Sodoge, Jan
    Kuhlicke, Christian
    de Brito, Mariana Madruga
    WEATHER AND CLIMATE EXTREMES, 2023, 41
  • [37] Classifying social determinants of health from unstructured electronic health records using deep learning-based natural language processing
    Han, Sifei
    Zhang, Robert F.
    Shi, Lingyun
    Richie, Russell
    Liu, Haixia
    Tseng, Andrew
    Quan, Wei
    Ryan, Neal
    Brent, David
    Tsui, Fuchiang R.
    JOURNAL OF BIOMEDICAL INFORMATICS, 2022, 127
  • [38] The Influence of Fake News on Social Media: Analysis and Verification of Web Content during the COVID-19 Pandemic by Advanced Machine Learning Methods and Natural Language Processing
    Nistor, Andreea
    Zadobrischi, Eduard
    SUSTAINABILITY, 2022, 14 (17)
  • [39] Using deep learning-based natural language processing to identify reasons for statin nonuse in patients with atherosclerotic cardiovascular disease
    Ashish Sarraju
    Jean Coquet
    Alban Zammit
    Antonia Chan
    Summer Ngo
    Tina Hernandez-Boussard
    Fatima Rodriguez
    Communications Medicine, 2
  • [40] Automating incidental findings in radiology reports using natural language processing and machine learning to identify and classify pulmonary nodules.
    French, Christi
    Makowski, Maciek
    Terker, Samantha
    Clark, Paul Alexander
    JOURNAL OF CLINICAL ONCOLOGY, 2019, 37 (15)