Synthetic Disinformation Attacks on Automated Fact Verification Systems

被引:0
|
作者
Du, Yibing [1 ]
Bosselut, Antoine [2 ]
Manning, Christopher D. [1 ]
机构
[1] Stanford Univ, Stanford, CA 94305 USA
[2] Ecole Polytech Fed Lausanne, Lausanne, Switzerland
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automated fact-checking is a needed technology to curtail the spread of online misinformation. One current framework for such solutions proposes to verify claims by retrieving supporting or refuting evidence from related textual sources. However, the realistic use cases for fact-checkers will require verifying claims against evidence sources that could be affected by the same misinformation. Furthermore, the development of modern NLP tools that can produce coherent, fabricated content would allow malicious actors to systematically generate adversarial disinformation for fact-checkers. In this work, we explore the sensitivity of automated fact-checkers to synthetic adversarial evidence in two simulated settings: ADVERSARIAL ADDITION, where we fabricate documents and add them to the evidence repository available to the fact-checking system, and ADVERSARIAL MODIFICATION, where existing evidence source documents in the repository are automatically altered. Our study across multiple models on three benchmarks demonstrates that these systems suffer significant performance drops against these attacks. Finally, we discuss the growing threat of modern NLG systems as generators of disinformation in the context of the challenges they pose to automated fact-checkers.
引用
收藏
页码:10581 / 10589
页数:9
相关论文
共 50 条
  • [21] On the vulnerability of fingerprint verification systems to fake fingerprints attacks
    Galbally-Herrero, J.
    Fierrez-Aguilar, J.
    Rodriguez-Gonzalez, J. D.
    Alonso-Fernandez, F.
    Ortega-Garcia, Javier
    Tapiador, M.
    2006: 40TH ANNUAL IEEE INTERNATIONAL CARNAHAN CONFERENCES SECURITY TECHNOLOGY, PROCEEDINGS, 2006, : 130 - +
  • [22] Protecting infrastructure performance from disinformation attacks
    Saeed Jamalzadeh
    Kash Barker
    Andrés D. González
    Sridhar Radhakrishnan
    Scientific Reports, 12
  • [23] Protecting infrastructure performance from disinformation attacks
    Jamalzadeh, Saeed
    Barker, Kash
    Gonzalez, Andres D.
    Radhakrishnan, Sridhar
    SCIENTIFIC REPORTS, 2022, 12 (01)
  • [24] The Challenges of Verification and Validation of Automated Planning Systems
    Frank, Jeremy
    2013 28TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING (ASE), 2013, : 2 - 2
  • [25] Model Checking Automated Verification of Computational Systems
    Mukund, Madhavan
    RESONANCE-JOURNAL OF SCIENCE EDUCATION, 2009, 14 (07): : 667 - 681
  • [26] Automated formal verification for flexible manufacturing systems
    Carpanzano, E.
    Ferrucci, L.
    Mandrioli, D.
    Mazzolini, M.
    Morzenti, A.
    Rossi, M.
    JOURNAL OF INTELLIGENT MANUFACTURING, 2014, 25 (05) : 1181 - 1195
  • [27] Automated verification of infinite state concurrent systems
    Dembinski, P
    Penczek, W
    Pólrola, A
    PARALLEL PROCESSING APPLIED MATHEMATICS, 2002, 2328 : 247 - 255
  • [28] THROUGHPUT CAPACITY VERIFICATION OF AUTOMATED PARKING SYSTEMS
    Zottolo, Marcelo
    Peacock, Kathryn
    Lammers, Eric
    Williams, Edward
    2008 WINTER SIMULATION CONFERENCE, VOLS 1-5, 2008, : 2926 - 2926
  • [29] Formal methods and automated verification of critical systems
    ter Beek, Maurice H.
    Gnesi, Stefania
    Knapp, Alexander
    INTERNATIONAL JOURNAL ON SOFTWARE TOOLS FOR TECHNOLOGY TRANSFER, 2018, 20 (04) : 355 - 358
  • [30] Formal methods and automated verification of critical systems
    Maurice H. ter Beek
    Stefania Gnesi
    Alexander Knapp
    International Journal on Software Tools for Technology Transfer, 2018, 20 : 355 - 358