Synthetic Disinformation Attacks on Automated Fact Verification Systems

被引:0
|
作者
Du, Yibing [1 ]
Bosselut, Antoine [2 ]
Manning, Christopher D. [1 ]
机构
[1] Stanford Univ, Stanford, CA 94305 USA
[2] Ecole Polytech Fed Lausanne, Lausanne, Switzerland
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automated fact-checking is a needed technology to curtail the spread of online misinformation. One current framework for such solutions proposes to verify claims by retrieving supporting or refuting evidence from related textual sources. However, the realistic use cases for fact-checkers will require verifying claims against evidence sources that could be affected by the same misinformation. Furthermore, the development of modern NLP tools that can produce coherent, fabricated content would allow malicious actors to systematically generate adversarial disinformation for fact-checkers. In this work, we explore the sensitivity of automated fact-checkers to synthetic adversarial evidence in two simulated settings: ADVERSARIAL ADDITION, where we fabricate documents and add them to the evidence repository available to the fact-checking system, and ADVERSARIAL MODIFICATION, where existing evidence source documents in the repository are automatically altered. Our study across multiple models on three benchmarks demonstrates that these systems suffer significant performance drops against these attacks. Finally, we discuss the growing threat of modern NLG systems as generators of disinformation in the context of the challenges they pose to automated fact-checkers.
引用
收藏
页码:10581 / 10589
页数:9
相关论文
共 50 条
  • [11] Automated Compositional Verification of Interlocking Systems
    Haxthausen, Anne E.
    Fantechi, Alessandro
    Gori, Gloria
    Mikkelsen, Oli Karason
    Petersen, Sofie-Amalie
    RELIABILITY, SAFETY, AND SECURITY OF RAILWAY SYSTEMS, RSSRAIL 2023, 2023, 14198 : 146 - 164
  • [12] Safety Verification of Automated Driving Systems
    Kianfar, Roozbeh
    Falcone, Paolo
    Fredriksson, Jonas
    IEEE INTELLIGENT TRANSPORTATION SYSTEMS MAGAZINE, 2013, 5 (04) : 73 - 86
  • [13] Automated verification of programs and Web systems
    ter Beek, Maurice H.
    Lisitsa, Alexei
    Nemytykh, Andrei P.
    Ravara, Antonio
    JOURNAL OF LOGICAL AND ALGEBRAIC METHODS IN PROGRAMMING, 2016, 85 (05) : 653 - 654
  • [14] Automated Verification Techniques for Probabilistic Systems
    Forejt, Vojtech
    Kwiatkowska, Marta
    Norman, Gethin
    Parker, David
    FORMAL METHODS FOR ETERNAL NETWORKED SOFTWARE SYSTEMS, SFM 2011, 2011, 6659 : 53 - 113
  • [15] Credible, Unreliable or Leaked?: Evidence Verification for Enhanced Automated Fact-checking
    Chrysidis, Zacharias
    Papadopoulos, Stefanos-Iordanis
    Papadopoulos, Symeon
    Petrantonakis, Panagiotis C.
    PROCEEDINGS OF THE 3RD ACM INTERNATIONAL WORKSHOP ON MULTIMEDIA AI AGAINST DISINFORMATION, MAD 2024, 2024, : 73 - 81
  • [16] Defending Against Adversarial Attacks in Speaker Verification Systems
    Chang, Li-Chi
    Chen, Zesheng
    Chen, Chao
    Wang, Guoping
    Bi, Zhuming
    2021 IEEE INTERNATIONAL PERFORMANCE, COMPUTING, AND COMMUNICATIONS CONFERENCE (IPCCC), 2021,
  • [17] An evaluation of indirect attacks and countermeasures in fingerprint verification systems
    Martinez-Diaz, Marcos
    Fierrez, Julian
    Galbally, Javier
    Ortega-Garcia, Javier
    PATTERN RECOGNITION LETTERS, 2011, 32 (12) : 1643 - 1651
  • [18] Ensemble Adversarial Defenses and Attacks in Speaker Verification Systems
    Chen, Zesheng
    Li, Jack
    Chen, Chao
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (20): : 32645 - 32655
  • [19] On the Detection of Adaptive Adversarial Attacks in Speaker Verification Systems
    Chen, Zesheng
    IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (18) : 16271 - 16283
  • [20] Toward Practical Adversarial Attacks on Face Verification Systems
    Kakizaki, Kazuya
    Miyagawa, Taiki
    Singh, Inderjeet
    Sakuma, Jun
    PROCEEDINGS OF THE 20TH INTERNATIONAL CONFERENCE OF THE BIOMETRICS SPECIAL INTEREST GROUP (BIOSIG 2021), 2021, 315