Classifying Fake News Articles Using Natural Language Processing to Identify In-Article Attribution as a Supervised Learning Estimator

被引:0
|
作者
Traylor, Terry [1 ]
Straub, Jeremy [2 ]
Gurmeet [2 ]
Snell, Nicholas [2 ]
机构
[1] US Marine Corps, Fargo, ND 58103 USA
[2] North Dakota State Univ, Dept Comp Sci, Fargo, ND 58105 USA
关键词
component; Fake News; Machine Learning; Natural Language Processing; Attribution Classification; Influence Mining;
D O I
10.1109/ICSC.2019.00086
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Intentionally deceptive content presented under the guise of legitimate journalism is a worldwide information accuracy and integrity problem that affects opinion forming, decision making, and voting patterns. Most so-called 'fake news' is initially distributed over social media conduits like Facebook and Twitter and later finds its way onto mainstream media platforms such as traditional television and radio news. The fake news stories that are initially seeded over social media platforms share key linguistic characteristics such as making excessive use of unsubstantiated hyperbole and non-attributed quoted content. In this paper, the results of a fake news identification study that documents the performance of a fake news classifier are presented. The Textblob, Natural Language, and SciPy Toolkits were used to develop a novel fake news detector that uses quoted attribution in a Bayesian machine learning system as a key feature to estimate the likelihood that a news article is fake. The resultant process precision is 63.333% effective at assessing the likelihood that an article with quotes is fake. This process is called influence mining and this novel technique is presented as a method that can be used to enable fake news and even propaganda detection. In this paper, the research process, technical analysis, technical linguistics work, and classifier performance and results are presented. The paper concludes with a discussion of how the current system will evolve into an influence mining system.
引用
收藏
页码:445 / 449
页数:5
相关论文
共 43 条
  • [41] Using deep learning-based natural language processing to identify reasons for statin nonuse in patients with atherosclerotic cardiovascular disease
    Sarraju, Ashish
    Coquet, Jean
    Zammit, Alban
    Chan, Antonia
    Ngo, Summer
    Hernandez-Boussard, Tina
    Rodriguez, Fatima
    COMMUNICATIONS MEDICINE, 2022, 2 (01):
  • [42] Predictive article recommendation using natural language processing and machine learning to support evidence updates in domain-specific knowledge graphs
    Sharma, Bhuvan
    Willis, Van C.
    Huettner, Claudia S.
    Beaty, Kirk
    Snowdon, Jane L.
    Xue, Shang
    South, Brett R.
    Jackson, Gretchen P.
    Weeraratne, Dilhan
    Michelini, Vanessa
    JAMIA OPEN, 2020, 3 (03) : 332 - 337
  • [43] Detecting intimate partner violence circumstance for suicide: development and validation of a tool using natural language processing and supervised machine learning in the National Violent Death Reporting System
    Kafka, Julie M.
    Fliss, Mike D.
    Trangenstein, Pamela J.
    Reyes, Luz McNaughton
    Pence, Brian W.
    Moracco, Kathryn E.
    INJURY PREVENTION, 2023, 29 (02) : 134 - 141