WikipEvent: Leveraging Wikipedia Edit History for Event Detection

被引:0
|
作者
Tran, Tuan [1 ]
Ceroni, Andrea [1 ]
Georgescu, Mihai [1 ]
Naini, Kaweh Djafari [1 ]
Fisichella, Marco [1 ]
机构
[1] L3S Res Ctr, Hannover, Germany
关键词
Event Detection; Temporal Retrieval; Wikipedia; Clustering;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Much of existing work in information extraction assumes the static nature of relationships in fixed knowledge bases. However, in collaborative environments such as Wikipedia, information and structures are highly dynamic over time. In this work, we introduce a new method to extract complex event structures from Wikipedia. We propose a new model to represent events by engaging multiple entities, generalizable to an arbitrary language. The evolution of an event is captured effectively based on analyzing the user edits history in Wikipedia. Our work provides a foundation for a novel class of evolution-aware entity-based enrichment algorithms, and considerably increases the quality of entity accessibility and temporal retrieval for Wikipedia. We formalize this problem and introduce an efficient end-to-end platform as a solution. We conduct comprehensive experiments on a real dataset of 1.8 million Wikipedia articles to show the effectiveness of our proposed solution. Our results demonstrate that we are able to achieve a precision of 70% when evaluated using manually annotated data. Finally, we make a comparative analysis of our work with the well established Current Event Portal of Wikipedia and find that our system WikipEvent using Co-References method can be used in a complementary way to deliver new and more information about events.
引用
收藏
页码:90 / 108
页数:19
相关论文
共 50 条
  • [1] WikipEvent: Leveraging wikipedia edit history for event detection
    Tran, Tuan
    Ceroni, Andrea
    Georgescu, Mihai
    Naini, Kaweh Djafari
    Fisichella, Marco
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2014, 8787 : 90 - 108
  • [2] Event Detection in Wikipedia Edit History Improved by Documents Web Based Automatic Assessment
    Fisichella, Marco
    Ceroni, Andrea
    BIG DATA AND COGNITIVE COMPUTING, 2021, 5 (03)
  • [3] Detection of Bursty and Significant Keyphrases from Wikipedia edit history
    Chen, Zihang
    Iwaihara, Mizuho
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP), 2019, : 99 - 102
  • [4] Edit-History Vis: An Interactive Visual Exploration and Analysis on Wikipedia Edit History
    Guo, Yuhan
    Han, Qin
    Lou, Yuke
    Wang, Yiming
    Liu, Can
    Yuan, Xiaoru
    2023 IEEE 16TH PACIFIC VISUALIZATION SYMPOSIUM, PACIFICVIS, 2023, : 157 - 166
  • [5] WHAD: Wikipedia historical attributes dataHistorical structured data extraction and vandalism detection from the Wikipedia edit history
    Enrique Alfonseca
    Guillermo Garrido
    Jean-Yves Delort
    Anselmo Peñas
    Language Resources and Evaluation, 2013, 47 : 1163 - 1190
  • [6] WHAD: Wikipedia historical attributes data Historical structured data extraction and vandalism detection from the Wikipedia edit history
    Alfonseca, Enrique
    Garrido, Guillermo
    Delort, Jean-Yves
    Penas, Anselmo
    LANGUAGE RESOURCES AND EVALUATION, 2013, 47 (04) : 1163 - 1190
  • [7] Learning To Split and Rephrase From Wikipedia Edit History
    Botha, Jan A.
    Faruqui, Manual
    Alex, John
    Baldridge, Jason
    Das, Dipanjan
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 732 - 737
  • [8] Tracking Topics on Revision Graphs of Wikipedia Edit History
    Li, Bonan
    Wu, Jianmin
    Iwaihara, Mizuho
    WEB-AGE INFORMATION MANAGEMENT, WAIM 2014, 2014, 8485 : 204 - 207
  • [9] Fluctuations in Wikipedia access-rate and edit-event data
    Kaempf, Mirko
    Tismer, Sebastian
    Kantelhardt, Jan W.
    Muchnik, Lev
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2012, 391 (23) : 6101 - 6111
  • [10] Quality Evaluation of Wikipedia Articles through Edit History and Editor Groups
    Wang, Se
    Iwaihara, Mizuho
    WEB TECHNOLOGIES AND APPLICATIONS, 2011, 6612 : 188 - 199