Detecting science-based health disinformation: a stylometric machine learning approach

被引:1
|
作者
Williams, Jason A. [1 ]
Aleroud, Ahmed [1 ]
Zimmerman, Danielle [1 ]
机构
[1] Augusta Univ, Sch Comp & Cyber Sci, Augusta, GA 30192 USA
来源
关键词
Health disinformation; COVID-19; Machine learning; Science; Human behavior; MISINFORMATION; READABILITY;
D O I
10.1007/s42001-023-00213-y
中图分类号
O1 [数学]; C [社会科学总论];
学科分类号
03 ; 0303 ; 0701 ; 070101 ;
摘要
The COVID-19 pandemic showed that misleading scientific health information has become widespread and is challenging to counteract. Some of this disinformation comes from modification of medical research results. This paper investigates how humans create health disinformation through controlled changes of text from abstracts of peer-reviewed COVID-19 research papers. We also developed a machine learning model that used statement embeddings, readability, and text quality features to create datasets that contain falsified scientific statements. We then created machine learning classification models to identify statements containing disinformation. Our results reveal the importance of readability metrics and information quality features in identifying which statements were falsified. We show that text embeddings and semantic similarity do not yield a high detection rate of true/falsified statements compared to using information quality and readability features.
引用
收藏
页码:817 / 843
页数:27
相关论文
共 50 条
  • [21] Adult Science-Based Learning The Intersection of Digital, Science, and Information Literacies
    Bliss, Angela Collier
    ADULT LEARNING, 2019, 30 (03) : 128 - 137
  • [22] Science-Based Communication Strategy for a Federal Health Agency
    Weber, Mark A.
    Backer, Thomas E.
    SCIENCE COMMUNICATION, 2013, 35 (05) : 667 - 677
  • [23] FDA reform: The need for a sound science-based approach
    Cady, J
    FOOD AND DRUG LAW JOURNAL, 1996, 51 (03): : 407 - 412
  • [24] SCIENCE-BASED HEALTH MANAGEMENT PLANNING FOR GREAT APES
    Travis, D. A.
    Lonsdorf, E. V.
    Gillespie, T. R.
    Lipende, I.
    Raphael, J.
    Terio, K. A.
    Murray, C. M.
    Mjungu, D.
    Collins, A.
    Parsons, M. B.
    Wolf, T.
    Singer, R.
    Hahn, B. H.
    Wilson, M. L.
    Pusey, A. E.
    AMERICAN JOURNAL OF PRIMATOLOGY, 2014, 76 : 39 - 39
  • [25] Detecting Refactoring Commits in Machine Learning Python']Python Projects: A Machine Learning-Based Approach
    Noei, Shayan
    Li, Heng
    Zou, Ying
    ACM TRANSACTIONS ON SOFTWARE ENGINEERING AND METHODOLOGY, 2025, 34 (03)
  • [26] Venture funding for science-based African health innovation
    Masum, Hassan
    Chakma, Justin
    Simiyu, Ken
    Ronoh, Wesley
    Daar, Abdallah S.
    Singer, Peter A.
    BMC INTERNATIONAL HEALTH AND HUMAN RIGHTS, 2010, 10
  • [27] A Multidisciplinary, Science-Based Approach to the Economics of Climate Change
    Carlin, Alan
    INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2011, 8 (04): : 985 - 1031
  • [28] A Social Science-based Approach to Explanations for (Game) AI
    Volz, Vanessa
    Majchrzak, Kevin
    Preuss, Mike
    PROCEEDINGS OF THE 2018 IEEE CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND GAMES (CIG'18), 2018, : 474 - 481
  • [29] Detecting Encrypted Traffic: A Machine Learning Approach
    Cha, Seunghun
    Kim, Hyoungshick
    INFORMATION SECURITY APPLICATIONS, WISA 2016, 2017, 10144 : 54 - 65
  • [30] Learning Science-Based Healthy Lifestyles Knowledge in Physical Education
    Deng, Anqi
    Wang, Yubing
    Deng, Yangyang
    Chen, Ang
    Zhang, Tan
    Schweighardt, Ray
    RESEARCH QUARTERLY FOR EXERCISE AND SPORT, 2019, 90 : A133 - A134