Fact-Saboteurs: A Taxonomy of Evidence Manipulation Attacks against Fact-Verification Systems

被引:0
|
作者
Abdelnabi, Sahar [1 ]
Fritz, Mario [1 ]
机构
[1] CISPA Helmholtz Ctr Informat Secur, Saarbrucken, Germany
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Mis- and disinformation are a substantial global threat to our security and safety. To cope with the scale of online misinformation, researchers have been working on automating fact-checking by retrieving and verifying against relevant evidence. However, despite many advances, a comprehensive evaluation of the possible attack vectors against such systems is still lacking. Particularly, the automated fact-verification process might be vulnerable to the exact disinformation campaigns it is trying to combat. In this work, we assume an adversary that automatically tampers with the online evidence in order to disrupt the fact-checking model via camouflaging the relevant evidence or planting a misleading one. We first propose an exploratory taxonomy that spans these two targets and the different threat model dimensions. Guided by this, we design and propose several potential attack methods. We show that it is possible to subtly modify claim-salient snippets in the evidence and generate diverse and claim-aligned evidence. Thus, we highly degrade the fact-checking performance under many different permutations of the taxonomy's dimensions. The attacks are also robust against post-hoc modifications of the claim. Our analysis further hints at potential limitations in models' inference when faced with contradicting evidence. We emphasize that these attacks can have harmful implications on the inspectable and human-in-the-loop usage scenarios of such models, and we conclude by discussing challenges and directions for future defenses.
引用
收藏
页码:6719 / 6736
页数:18
相关论文
共 42 条
  • [1] Evaluating adversarial attacks against multiple fact verification systems
    Thorne, James
    Vlachos, Andreas
    Christodoulopoulos, Christos
    Mittal, Arpit
    2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 2944 - 2953
  • [2] Synthetic Disinformation Attacks on Automated Fact Verification Systems
    Du, Yibing
    Bosselut, Antoine
    Manning, Christopher D.
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 10581 - 10589
  • [3] Evidence Distilling for Fact Extraction and Verification
    Lin, Yang
    Huang, Pengyu
    Lai, Yuxuan
    Feng, Yansong
    Zhao, Dongyan
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING (NLPCC 2019), PT I, 2019, 11838 : 211 - 222
  • [4] UNIFEE: Unified Evidence Extraction for Fact Verification
    Hu, Nan
    Wu, Zirui
    Lai, Yuxuan
    Zhang, Chen
    Feng, Yansong
    17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 1150 - 1160
  • [5] Distilling the Evidence to Augment Fact Verification Models
    Portelli, Beatrice
    Zhao, Jason
    Schuster, Tal
    Serra, Giuseppe
    Santus, Enrico
    FACT EXTRACTION AND VERIFICATION (FEVER), 2020, : 47 - 51
  • [6] A syntactic evidence network model for fact verification
    Chen, Zhendong
    Hui, Siu Cheung
    Zhuang, Fuzhen
    Liao, Lejian
    Jia, Meihuizi
    Li, Jiaqi
    Huang, Heyan
    NEURAL NETWORKS, 2024, 178
  • [7] GERE: Generative Evidence Retrieval for Fact Verification
    Chen, Jiangui
    Zhang, Ruqing
    Guo, Jiafeng
    Fan, Yixing
    Cheng, Xueqi
    PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22), 2022, : 2184 - 2189
  • [8] Enhancing Structured Evidence Extraction for Fact Verification
    Wu, Zirui
    Hu, Nan
    Feng, Yansong
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023, 2023, : 6631 - 6641
  • [9] EvidenceNet: Evidence Fusion Network for Fact Verification
    Chen, Zhendong
    Hui, Siu Cheung
    Zhuang, Fuzhen
    Liao, Lejian
    Li, Fei
    Jia, Meihuizi
    Li, Jiaqi
    PROCEEDINGS OF THE ACM WEB CONFERENCE 2022 (WWW'22), 2022, : 2636 - 2645
  • [10] Explainability of Automated Fact Verification Systems: A Comprehensive Review
    Vallayil, Manju
    Nand, Parma
    Yan, Wei Qi
    Allende-Cid, Hector
    APPLIED SCIENCES-BASEL, 2023, 13 (23):