Vandalism Detection in Wikidata

被引:47
|
作者
Heindorf, Stefan [1 ]
Potthast, Martin [2 ]
Stein, Benno [2 ]
Engels, Gregor [1 ]
机构
[1] Univ Paderborn, Paderborn, Germany
[2] Bauhaus Univ Weimar, Weimar, Germany
关键词
Knowledge Base; Vandalism; Data Quality; Trust;
D O I
10.1145/2983323.2983740
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Wikidata is the new, large-scale knowledge base of the Wikimedia Foundation. Its knowledge is increasingly used within Wikipedia itself and various other kinds of information systems, imposing high demands on its integrity. Wikidata can be edited by anyone and, unfortunately, it frequently gets vandalized, exposing all information systems using it to the risk of spreading vandalized and falsified information. In this paper, we present a new machine learning-based approach to detect vandalism in Wikidata. We propose a set of 47 features that exploit both content and context information, and we report on 4 classifiers of increasing effectiveness tailored to this learning task. Our approach is evaluated on the recently published Wikidata Vandalism Corpus WDVC-2015 and it achieves an area under curve value of the receiver operating characteristic, ROCAUC, of 0.991. It significantly outperforms the state of the art represented by the rule-based Wikidata Abuse Filter (0.865 ROCAUC) and a prototypical vandalism detector recently introduced by Wikimedia within the Objective Revision Evaluation Service (0.859 ROCAUC).
引用
收藏
页码:327 / 336
页数:10
相关论文
共 50 条
  • [41] VANDALISM, WHATS THAT
    RODRIGUEZ, S
    THRUST-FOR EDUCATIONAL LEADERSHIP, 1977, 6 (05): : 3 - 3
  • [42] Vandalism in voluntary geographic information From concept to unsupervised anomaly detection
    Quy Thy Truong
    Touya, Guillaume
    de Runz, Cyril
    REVUE INTERNATIONALE DE GEOMATIQUE, 2019, 29 (01): : 31 - 56
  • [43] REVOLUTIONARY VANDALISM
    LANGLOIS, C
    HISTOIRE, 1987, (99): : 8 - 14
  • [44] VANDALISM IN CAPITOL
    不详
    NEW REPUBLIC, 1966, 155 (01) : 10 - 11
  • [45] Verbal vandalism
    Greaves, S
    CHEMISTRY & INDUSTRY, 1999, (09) : 330 - 330
  • [46] VANDALISM PREVENTION
    TORRES, DA
    POLICE CHIEF, 1981, 48 (03): : 21 - 23
  • [47] FIGHTING VANDALISM
    LESLIE, J
    SOUTH DAKOTA FARM & HOME RESEARCH, 1981, 32 (04): : 10 - 12
  • [48] Real-time automatic detection of vandalism behavior in video sequences
    Ghazal, Mohammed
    Vazquez, Carlos
    Amer, Aishy
    2007 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS, VOLS 1-8, 2007, : 2756 - +
  • [49] Space vandalism
    Matcham, J
    NEW SCIENTIST, 1999, 162 (2185) : 54 - 54
  • [50] About Vandalism
    Engelmeier, Hanna
    MERKUR-DEUTSCHE ZEITSCHRIFT FUR EUROPAISCHES DENKEN, 2022, 76 (878): : 99 - 102