Automated Fact-Checking of Claims from Wikipedia

被引:0
|
作者
Sathe, Aalok [1 ]
Ather, Salar [1 ]
Tuan Manh Le [1 ]
Perry, Nathan [2 ]
Park, Joonsuk [1 ]
机构
[1] Univ Richmond, Dept Math & Comp Sci, Richmond, VA 23173 USA
[2] Williams Coll, Dept Comp Sci, Williamstown, MA 01267 USA
关键词
fact-checking; fact-verification; natural language inference; textual entailment; corpus;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Automated fact checking is becoming increasingly vital as both truthful and fallacious information accumulate online. Research on fact checking has benefited from large-scale datasets such as FEVER and SNLI. However, such datasets suffer from limited applicability due to the synthetic nature of claims and/or evidence written by annotators that differ from real claims and evidence on the internet. To this end, we present WIKIFACTCHECK-ENGLISH, a dataset of 124k+ triples consisting of a claim, context and an evidence document extracted from English Wikipedia articles and citations, as well as 34k+ manually written claims that are refuted by the evidence documents. This is the largest fact checking dataset consisting of real claims and evidence to date; it will allow the development of fact checking systems that can better process claims and evidence in the real world. We also show that for the NLI subtask, a logistic regression system trained using existing and novel features achieves peak accuracy of 68%, providing a competitive baseline for future work. Also, a decomposable attention model trained on SNLI significantly underperforms the models trained on this dataset, suggesting that models trained on manually generated data may not be sufficiently generalizable or suitable for fact checking real-world claims.
引用
收藏
页码:6874 / 6882
页数:9
相关论文
共 50 条
  • [31] FACT-CHECKING IN A POLARIZED WORLD
    Cavus, Gulin
    TURKISH POLICY QUARTERLY, 2020, 19 (03): : 57 - 65
  • [32] The fact-checking dilemma: Fact-checking increases the reputation of the fact-checker but creates perceptions of ideological bias
    Aruguete, Natalia
    Calvo, Ernesto
    Ventura, Tiago
    RESEARCH & POLITICS, 2025, 12 (01)
  • [33] Fact-checking as a deterrent? A conceptual replication of the influence of fact-checking on the sharing of misinformation by political elites
    Ma, Siyuan
    Bergan, Daniel
    Ahn, Suhwoo
    Carnahan, Dustin
    Gimby, Nate
    McGraw, Johnny
    Virtue, Isabel
    HUMAN COMMUNICATION RESEARCH, 2023, 49 (03) : 321 - 338
  • [34] The Chicago Guide to Fact-Checking
    Dar, Mahnaz
    Knapp, Maggie
    Lothrop, Patricia
    Medina, Gary
    Parrott, Kiera
    Pugl, Dave
    Selwyn, Laurie
    Sendaula, Stephanie
    Tench, Robert
    LIBRARY JOURNAL, 2017, 142 (04) : 32 - 32
  • [35] Increasing Demand for Fact-Checking
    Graham, Matthew H.
    Porter, Ethan V.
    POLITICAL COMMUNICATION, 2025, 42 (02) : 325 - 348
  • [36] FACT-CHECKING FISA APPLICATIONS
    Groden, Claire
    NEW YORK UNIVERSITY LAW REVIEW, 2021, 96 (05) : 1634 - 1674
  • [37] Fact-Checking Lawrence of Arabia
    Powell, Eric A.
    ARCHAEOLOGY, 2016, 69 (04) : 12 - 12
  • [39] Unpacking Multimodal Fact-Checking: Features and Engagement of Fact-Checking Videos on Chinese TikTok (Douyin)
    Lu, Yingdan
    Shen, Cuihua
    SOCIAL MEDIA + SOCIETY, 2023, 9 (01):
  • [40] Fact-checking algorithms for the Internet
    Martynov, A. S.
    Voronina, I. E.
    APPLIED MATHEMATICS, COMPUTATIONAL SCIENCE AND MECHANICS: CURRENT PROBLEMS, 2020, 1479