Synthetic Disinformation Attacks on Automated Fact Verification Systems

被引:0
|
作者
Du, Yibing [1 ]
Bosselut, Antoine [2 ]
Manning, Christopher D. [1 ]
机构
[1] Stanford Univ, Stanford, CA 94305 USA
[2] Ecole Polytech Fed Lausanne, Lausanne, Switzerland
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automated fact-checking is a needed technology to curtail the spread of online misinformation. One current framework for such solutions proposes to verify claims by retrieving supporting or refuting evidence from related textual sources. However, the realistic use cases for fact-checkers will require verifying claims against evidence sources that could be affected by the same misinformation. Furthermore, the development of modern NLP tools that can produce coherent, fabricated content would allow malicious actors to systematically generate adversarial disinformation for fact-checkers. In this work, we explore the sensitivity of automated fact-checkers to synthetic adversarial evidence in two simulated settings: ADVERSARIAL ADDITION, where we fabricate documents and add them to the evidence repository available to the fact-checking system, and ADVERSARIAL MODIFICATION, where existing evidence source documents in the repository are automatically altered. Our study across multiple models on three benchmarks demonstrates that these systems suffer significant performance drops against these attacks. Finally, we discuss the growing threat of modern NLG systems as generators of disinformation in the context of the challenges they pose to automated fact-checkers.
引用
收藏
页码:10581 / 10589
页数:9
相关论文
共 50 条
  • [41] A Review on Fact Extraction and Verification
    Bekoulis, Giannis
    Papagiannopoulou, Christina
    Deligiannis, Nikos
    1600, Association for Computing Machinery (55):
  • [42] VERIFICATION SYSTEMS IN DIGITAL NATIVE MEDIA AND AUDIENCE INVOLVEMENT IN THE FIGHT AGAINST DISINFORMATION IN THE IBERIAN MODEL
    Sixto-Garcia, Jose
    Rodriguez-Vazquez, Ana-Isabel
    Lopez-Garcia, Xose
    REVISTA DE COMUNICACION DE LA SEECI, 2021, (54): : 41 - +
  • [43] Counterfactual Debiasing for Fact Verification
    Xu, Weizhi
    Liu, Qiang
    Wu, Shu
    Wang, Liang
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 6777 - 6789
  • [44] Constrained Fact Verification for FEVER
    Pratapa, Adithya
    Jayanthi, Sai Muralidhar
    Nerella, Kavya
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 7826 - 7832
  • [45] A Review on Fact Extraction and Verification
    Bekoulis, Giannis
    Papagiannopoulou, Christina
    Deligiannis, Nikos
    ACM COMPUTING SURVEYS, 2023, 55 (01)
  • [46] Special issue on Automated Specification and Verification of Web Systems
    Kovacs, Laura
    Pugliese, Rosario
    Silva, Josep
    Tiezzi, Francesco
    JOURNAL OF LOGIC AND ALGEBRAIC PROGRAMMING, 2013, 82 (08): : 241 - 242
  • [47] The invariant checker: Automated deductive verification of reactive systems
    Saidi, H
    COMPUTER AIDED VERIFICATION, 1997, 1254 : 436 - 439
  • [48] Automated Verification of Stochastic Spiking Neural P Systems
    Aman, Bogdan
    Ciobanu, Gabriel
    MEMBRANE COMPUTING (CMC 2015), 2015, 9504 : 77 - 91
  • [49] Automated Verification of Signalling Principles in Railway Interlocking Systems
    Kanso, Karim
    Moller, Faron
    Setzer, Anton
    ELECTRONIC NOTES IN THEORETICAL COMPUTER SCIENCE, 2009, 250 (02) : 19 - 31
  • [50] Multidimensional Framework for Characterizing Verification and Validation of Automated Systems
    Agirre, Joseba A.
    Yazici, Ahmet
    Di Blasio, Katia
    Luis de la Vara, Jose
    Sangchoolie, Behrooz
    Yayan, Ugur
    Barbosa, Raul
    Etxeberria, Leire
    Nazaria, Massimo
    Karaca, Mustafa
    2022 18TH EUROPEAN DEPENDABLE COMPUTING CONFERENCE (EDCC 2022), 2022, : 41 - 48