Synthetic Disinformation Attacks on Automated Fact Verification Systems

被引:0
|
作者
Du, Yibing [1 ]
Bosselut, Antoine [2 ]
Manning, Christopher D. [1 ]
机构
[1] Stanford Univ, Stanford, CA 94305 USA
[2] Ecole Polytech Fed Lausanne, Lausanne, Switzerland
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automated fact-checking is a needed technology to curtail the spread of online misinformation. One current framework for such solutions proposes to verify claims by retrieving supporting or refuting evidence from related textual sources. However, the realistic use cases for fact-checkers will require verifying claims against evidence sources that could be affected by the same misinformation. Furthermore, the development of modern NLP tools that can produce coherent, fabricated content would allow malicious actors to systematically generate adversarial disinformation for fact-checkers. In this work, we explore the sensitivity of automated fact-checkers to synthetic adversarial evidence in two simulated settings: ADVERSARIAL ADDITION, where we fabricate documents and add them to the evidence repository available to the fact-checking system, and ADVERSARIAL MODIFICATION, where existing evidence source documents in the repository are automatically altered. Our study across multiple models on three benchmarks demonstrates that these systems suffer significant performance drops against these attacks. Finally, we discuss the growing threat of modern NLG systems as generators of disinformation in the context of the challenges they pose to automated fact-checkers.
引用
收藏
页码:10581 / 10589
页数:9
相关论文
共 50 条
  • [31] Automated formal verification for flexible manufacturing systems
    E. Carpanzano
    L. Ferrucci
    D. Mandrioli
    M. Mazzolini
    A. Morzenti
    M. Rossi
    Journal of Intelligent Manufacturing, 2014, 25 : 1181 - 1195
  • [32] Automated Crowdturfing Attacks and Defenses in Online Review Systems
    Yao, Yuanshun
    Viswanath, Bimal
    Cryan, Jenna
    Zheng, Haitao
    Zhao, Ben Y.
    CCS'17: PROCEEDINGS OF THE 2017 ACM SIGSAC CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, 2017, : 1143 - 1158
  • [33] VERICA-Verification of Combined Attacks: Automated formal verification of security against simultaneous information leakage and tampering
    Richter-Brockmann J.
    Feldtkeller J.
    Sasdrich P.
    Güneysu T.
    IACR Transactions on Cryptographic Hardware and Embedded Systems, 2022, 2022 (04): : 255 - 284
  • [34] Fact or fake: information, misinformation and disinformation via social media
    Lim, Xin-Jean
    Quach, Sara
    Thaichon, Park
    Cheah, Jun-Hwa
    Ting, Hiram
    JOURNAL OF STRATEGIC MARKETING, 2024, : 659 - 664
  • [35] PCA-Based Adversarial Attacks on Signature Verification Systems
    Jahangir, Maham
    Basa, Azka
    Younis, Muhammad Shahzad
    Shafait, Faisal
    DOCUMENT ANALYSIS AND RECOGNITION-ICDAR 2024, PT II, 2024, 14805 : 364 - 379
  • [36] Countermeasure to Handle Replay Attacks in Practical Speaker Verification Systems
    Paul, Anupanta
    Das, Rohan Kumar
    Sinha, Rohit
    Prasanna, S. R. Mahadeva
    2016 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS (SPCOM), 2016,
  • [37] VeriFace: Defending against Adversarial Attacks in Face Verification Systems
    Sayed, Awny
    Kinlany, Sohair
    Zaki, Alaa
    Mahfouz, Ahmed
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 76 (03): : 3151 - 3166
  • [38] Quasi-Newton Adversarial Attacks on Speaker Verification Systems
    Goto, Keita
    Inoue, Nakamasa
    2020 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2020, : 527 - 531
  • [39] On the vulnerability of face verification systems to hill-climbing attacks
    Galbally, Javier
    McCool, Chris
    Fierrez, Julian
    Marcel, Sebastien
    Ortega-Garcia, Javier
    PATTERN RECOGNITION, 2010, 43 (03) : 1027 - 1038
  • [40] On the Importance of Delexicalization for Fact Verification
    Suntwal, Sandeep
    Paul, Mithun
    Sharp, Rebecca
    Surdeanu, Mihai
    2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 3413 - 3418