Detecting science-based health disinformation: a stylometric machine learning approach

被引:1
|
作者
Williams, Jason A. [1 ]
Aleroud, Ahmed [1 ]
Zimmerman, Danielle [1 ]
机构
[1] Augusta Univ, Sch Comp & Cyber Sci, Augusta, GA 30192 USA
来源
关键词
Health disinformation; COVID-19; Machine learning; Science; Human behavior; MISINFORMATION; READABILITY;
D O I
10.1007/s42001-023-00213-y
中图分类号
O1 [数学]; C [社会科学总论];
学科分类号
03 ; 0303 ; 0701 ; 070101 ;
摘要
The COVID-19 pandemic showed that misleading scientific health information has become widespread and is challenging to counteract. Some of this disinformation comes from modification of medical research results. This paper investigates how humans create health disinformation through controlled changes of text from abstracts of peer-reviewed COVID-19 research papers. We also developed a machine learning model that used statement embeddings, readability, and text quality features to create datasets that contain falsified scientific statements. We then created machine learning classification models to identify statements containing disinformation. Our results reveal the importance of readability metrics and information quality features in identifying which statements were falsified. We show that text embeddings and semantic similarity do not yield a high detection rate of true/falsified statements compared to using information quality and readability features.
引用
收藏
页码:817 / 843
页数:27
相关论文
共 50 条
  • [41] Science-based approach for credible accounting of mitigation in managed forests
    Giacomo Grassi
    Roberto Pilli
    Jo House
    Sandro Federici
    Werner A. Kurz
    Carbon Balance and Management, 13
  • [42] A Machine Learning Based Approach for Detecting DRDoS Attacks and Its Performance Evaluation
    Gao, Yuxuan
    Feng, Yaokai
    Kawamoto, Junpei
    Sakurai, Kouichi
    2016 11TH ASIA JOINT CONFERENCE ON INFORMATION SECURITY (ASIAJCIS), 2016, : 80 - 86
  • [43] A machine learning approach for detecting CNAME cloaking-based tracking on the Web
    Dao, Ha
    Fukuda, Kensuke
    2020 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2020,
  • [44] An Advanced Approach for Detecting Behavior-Based Intranet Attacks by Machine Learning
    Jang, Myongwon
    Lee, Kyungho
    IEEE ACCESS, 2024, 12 : 52480 - 52495
  • [45] A Network Science-Based Approach for Temporal Hot Spot Policing
    Wu, Yu
    Meghanathan, Natarajan
    DATA SCIENCE AND ALGORITHMS IN SYSTEMS, 2022, VOL 2, 2023, 597 : 700 - 709
  • [46] Detecting and responding to hostile disinformation activities on social media using machine learning and deep neural networks
    Barry Cartwright
    Richard Frank
    George Weir
    Karmvir Padda
    Neural Computing and Applications, 2022, 34 : 15141 - 15163
  • [47] Detecting fake news and disinformation using artificial intelligence and machine learning to avoid supply chain disruptions
    Pervaiz Akhtar
    Arsalan Mujahid Ghouri
    Haseeb Ur Rehman Khan
    Mirza Amin ul Haq
    Usama Awan
    Nadia Zahoor
    Zaheer Khan
    Aniqa Ashraf
    Annals of Operations Research, 2023, 327 : 633 - 657
  • [48] Detecting fake news and disinformation using artificial intelligence and machine learning to avoid supply chain disruptions
    Akhtar, Pervaiz
    Ghouri, Arsalan Mujahid
    Khan, Haseeb Ur Rehman
    ul Haq, Mirza Amin
    Awan, Usama
    Zahoor, Nadia
    Khan, Zaheer
    Ashraf, Aniqa
    ANNALS OF OPERATIONS RESEARCH, 2023, 327 (02) : 633 - 657
  • [49] Detecting and responding to hostile disinformation activities on social media using machine learning and deep neural networks
    Cartwright, Barry
    Frank, Richard
    Weir, George
    Padda, Karmvir
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (18): : 15141 - 15163
  • [50] A machine learning approach to detecting cracks in levees and floodwalls
    Kuchi, Aditi
    Panta, Manisha
    Hoque, Md Tamjidul
    Abdelguerfi, Mahdi
    Flanagin, Maik C.
    REMOTE SENSING APPLICATIONS-SOCIETY AND ENVIRONMENT, 2021, 22