Credible, Unreliable or Leaked?: Evidence Verification for Enhanced Automated Fact-checking

被引:1
|
作者
Chrysidis, Zacharias [1 ]
Papadopoulos, Stefanos-Iordanis [2 ]
Papadopoulos, Symeon [2 ]
Petrantonakis, Panagiotis C. [1 ]
机构
[1] Aristotle Univ Thessaloniki, Dept Elect & Comp Engn, Thessaloniki, Greece
[2] CERTH, Informat Technol Inst, Thessaloniki, Greece
关键词
Deep Learning; Misinformation Detection; Automated Fact-Checking; Evidence Filtering; Information Leakage; INFORMATION;
D O I
10.1145/3643491.3660278
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automated fact-checking (AFC) is garnering increasing attention by researchers aiming to help fact-checkers combat the increasing spread of misinformation online. While many existing AFC methods incorporate external information from theWeb to help examine the veracity of claims, they often overlook the importance of verifying the source and quality of collected "evidence". One overlooked challenge involves the reliance on "leaked evidence", information gathered directly from fact-checking websites and used to train AFC systems, resulting in an unrealistic setting for early misinformation detection. Similarly, the inclusion of information from unreliable sources can undermine the effectiveness of AFC systems. To address these challenges, we present a comprehensive approach to evidence verification and filtering. We create the "CREDible, Unreliable or LEaked" (CREDULE) dataset, which consists of 91,632 articles classified as Credible, Unreliable and Fact-checked (Leaked). Additionally, we introduce the EVidence VERification Network (EVVER-Net), trained on CREDULE to detect leaked and unreliable evidence in both short and long texts. EVVER-Net can be used to filter evidence collected from theWeb, thus enhancing the robustness of end-to-end AFC systems. We experiment with various language models and show that EVVER-Net can demonstrate impressive performance of up to 91.5% and 94.4% accuracy, while leveraging domain credibility scores along with short or long texts, respectively. Finally, we assess the evidence provided by widely-used fact-checking datasets including LIAR-PLUS, MOCHEG, FACTIFY, NewsCLIPpings+ and VERITE, some of which exhibit concerning rates of leaked and unreliable evidence.
引用
收藏
页码:73 / 81
页数:9
相关论文
共 50 条
  • [1] A Survey on Automated Fact-Checking
    Guo, Zhijiang
    Schlichtkrull, Michael
    Vlachos, Andreas
    TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2022, 10 : 178 - 206
  • [2] Improving Evidence Retrieval for Automated Explainable Fact-Checking
    Samarinas, Chris
    Hsu, Wynne
    Lee, Mong Li
    2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES: DEMONSTRATIONS (NAACL-HLT 2021), 2021, : 84 - 91
  • [3] Automated fact-checking: A survey
    Zeng, Xia
    Abumansour, Amani S.
    Zubiaga, Arkaitz
    LANGUAGE AND LINGUISTICS COMPASS, 2021, 15 (10):
  • [4] Multimodal Automated Fact-Checking: A Survey
    Akhtar, Mubashara
    Schlichtkrull, Michael
    Guo, Zhijiang
    Cocarascu, Oana
    Simperl, Elena
    Vlachos, Andreas
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS - EMNLP 2023, 2023, : 5430 - 5448
  • [5] Automated Fact-Checking of Claims from Wikipedia
    Sathe, Aalok
    Ather, Salar
    Tuan Manh Le
    Perry, Nathan
    Park, Joonsuk
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 6874 - 6882
  • [6] Automated Fact-Checking for Assisting Human Fact-Checkers
    Nakov, Preslav
    Corney, David
    Hasanain, Maram
    Alam, Firoj
    Elsayed, Tamer
    Barron-Cedeno, Alberto
    Papotti, Paolo
    Shaar, Shaden
    Da San Martino, Giovanni
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 4551 - 4558
  • [7] Fact-Checking and Information Verification in the Context of Journalism Education
    Shesterkina, Lyudmla P.
    Lobodenko, Lidiya K.
    Krasavina, Anna, V
    Marfitsyna, Arina R.
    THEORETICAL AND PRACTICAL ISSUES OF JOURNALISM, 2021, 10 (01): : 94 - 108
  • [8] AmbiFC : Fact-Checking Ambiguous Claims with Evidence
    Glockner, Max
    Staliunaite, Ieva
    Thorne, James
    Vallejo, Gisela
    Vlachos, Andreas
    Gurevych, Iryna
    TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2024, 12 : 1 - 18
  • [9] Automated Fact-Checking in Dialogue: Are Specialized Models Needed?
    Chamoun, Eric
    Saeidi, Marzieh
    Vlachos, Andreas
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2023), 2023, : 16009 - 16020
  • [10] Explainable Automated Fact-Checking for Public Health Claims
    Kotonya, Neema
    Toni, Francesca
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 7740 - 7754