Event Detection in Wikipedia Edit History Improved by Documents Web Based Automatic Assessment

被引:4
|
作者
Fisichella, Marco [1 ]
Ceroni, Andrea [2 ]
机构
[1] Leibniz Univ Hannover, Res Ctr L3S, D-30167 Hannover, Germany
[2] Joblift, D-10437 Berlin, Germany
关键词
Wikipedia; user edits; event detection; event validation; temporal retrieval; clustering;
D O I
10.3390/bdcc5030034
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A majority of current work in events extraction assumes the static nature of relationships in constant expertise knowledge bases. However, in collaborative environments, such as Wikipedia, information and systems are extraordinarily dynamic over time. In this work, we introduce a new approach for extracting complex structures of events from Wikipedia. We advocate a new model to represent events by engaging more than one entities that are generalizable to an arbitrary language. The evolution of an event is captured successfully primarily based on analyzing the user edits records in Wikipedia. Our work presents a basis for a singular class of evolution-aware entity-primarily based enrichment algorithms and will extensively increase the quality of entity accessibility and temporal retrieval for Wikipedia. We formalize this problem case and conduct comprehensive experiments on a real dataset of 1.8 million Wikipedia articles in order to show the effectiveness of our proposed answer. Furthermore, we suggest a new event validation automatic method relying on a supervised model to predict the presence of events in a non-annotated corpus. As the extra document source for event validation, we chose the Web due to its ease of accessibility and wide event coverage. Our outcomes display that we are capable of acquiring 70% precision evaluated on a manually annotated corpus. Ultimately, we conduct a comparison of our strategy versus the Current Event Portal of Wikipedia and discover that our proposed WikipEvent along with the usage of Co-References technique may be utilized to provide new and more data on events.
引用
收藏
页数:23
相关论文
共 50 条
  • [1] WikipEvent: Leveraging wikipedia edit history for event detection
    Tran, Tuan
    Ceroni, Andrea
    Georgescu, Mihai
    Naini, Kaweh Djafari
    Fisichella, Marco
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2014, 8787 : 90 - 108
  • [2] WikipEvent: Leveraging Wikipedia Edit History for Event Detection
    Tran, Tuan
    Ceroni, Andrea
    Georgescu, Mihai
    Naini, Kaweh Djafari
    Fisichella, Marco
    WEB INFORMATION SYSTEMS ENGINEERING, PT II, 2014, 8787 : 90 - 108
  • [3] Detection of Bursty and Significant Keyphrases from Wikipedia edit history
    Chen, Zihang
    Iwaihara, Mizuho
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP), 2019, : 99 - 102
  • [4] Automatic genre detection of Web documents
    Lim, CS
    Lee, KJ
    Kim, GC
    NATURAL LANGUAGE PROCESSING - IJCNLP 2004, 2005, 3248 : 310 - 319
  • [5] Improved Automatic Maturity Assessment of Wikipedia Medical Articles
    Marzini, Emanuel
    Spognardi, Angelo
    Matteucci, Ilaria
    Mori, Paolo
    Petrocchi, Marinella
    Conti, Riccardo
    ON THE MOVE TO MEANINGFUL INTERNET SYSTEMS: OTM 2014 CONFERENCES, 2014, 8841 : 612 - 622
  • [6] WHAD: Wikipedia historical attributes dataHistorical structured data extraction and vandalism detection from the Wikipedia edit history
    Enrique Alfonseca
    Guillermo Garrido
    Jean-Yves Delort
    Anselmo Peñas
    Language Resources and Evaluation, 2013, 47 : 1163 - 1190
  • [7] WHAD: Wikipedia historical attributes data Historical structured data extraction and vandalism detection from the Wikipedia edit history
    Alfonseca, Enrique
    Garrido, Guillermo
    Delort, Jean-Yves
    Penas, Anselmo
    LANGUAGE RESOURCES AND EVALUATION, 2013, 47 (04) : 1163 - 1190
  • [8] Ontology-based automatic classification of web documents
    Song, MuHee
    Lim, SooYeon
    Kang, DongJin
    Lee, SangJo
    COMPUTATIONAL INTELLIGENCE, PT 2, PROCEEDINGS, 2006, 4114 : 690 - 700
  • [9] History-based Article Quality Assessment on Wikipedia
    Zhang, Shiyue
    Hu, Zheng
    Zhang, Chunhong
    Yu, Ke
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP), 2018, : 1 - 8
  • [10] Automatic Quality Assessment of Content Created Collaboratively by Web Communities: A Case Study of Wikipedia
    Dalip, Daniel Hasan
    Goncalves, Marcos Andre
    Cristo, Marco
    Calado, Pavel
    JCDL 09: PROCEEDINGS OF THE 2009 ACM/IEEE JOINT CONFERENCE ON DIGITAL LIBRARIES, 2009, : 295 - 304