Detecting ongoing events using contextual word and sentence embeddings

被引:1
|
作者
Maisonnave, Mariano [1 ]
Delbianco, Fernando [1 ]
Tohme, Fernando [1 ]
Maguitman, Ana [1 ]
Milios, Evangelos [2 ]
机构
[1] Univ Nacl Sur, Bahia Blanca, Buenos Aires, Argentina
[2] Dalhousie Univ, Halifax, NS, Canada
关键词
Ongoing Event Detection; Information Extraction; Contextual embeddings; BERT; RNN; CNN; EXTRACTION;
D O I
10.1016/j.eswa.2022.118257
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper introduces the Ongoing Event Detection (OED) task, which is a specific Event Detection task where the goal is to detect ongoing event mentions only, as opposed to historical, future, hypothetical, or other forms or events that are neither fresh nor current. Any application that needs to extract structured information about ongoing events from unstructured texts can take advantage of an OED system. The main contribution of this paper are the following: (1) it introduces the OED task along with a dataset manually labeled for the task; (2) it presents the design and implementation of an RNN model for the task that uses BERT embeddings to define contextual word and contextual sentence embeddings as attributes, which to the best of our knowledge were never used before for detecting ongoing events in news; (3) it presents an extensive empirical evaluation that includes (i) the exploration of different architectures and hyperparameters, (ii) an ablation test to study the impact of each attribute, and (iii) a comparison with a replication of a state-of-the-art model. The results offer several insights into the importance of contextual embeddings and indicate that the proposed approach is effective in the OED task, outperforming the baseline models.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] On Character vs Word Embeddings as Input for English Sentence Classification
    Hammerton, James
    Vintro, Merce
    Kapetanakis, Stelios
    Sama, Michele
    INTELLIGENT SYSTEMS AND APPLICATIONS, VOL 1, 2019, 868 : 550 - 566
  • [32] Joint Model Using Character and Word Embeddings for Detecting Internet Slang Words
    Liu, Yihong
    Seki, Yohei
    TOWARDS OPEN AND TRUSTWORTHY DIGITAL SOCIETIES, ICADL 2021, 2021, 13133 : 18 - 33
  • [33] Joint Model Using Character and Word Embeddings for Detecting Internet Slang Words
    Liu, Yihong
    Seki, Yohei
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2021, 13133 LNCS : 18 - 33
  • [34] Semi-supervised Learning of Dialogue Acts Using Sentence Similarity Based on Word Embeddings
    Yang, Xiaohao
    Liu, Jia
    Chen, Zhenfeng
    Wu, Weilan
    2014 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING (ICALIP), VOLS 1-2, 2014, : 882 - 886
  • [35] Hybrid query expansion using lexical resources and word embeddings for sentence retrieval in question answering
    Esposito, Massimo
    Damiano, Ernanuele
    Minutolo, Aniello
    De Pietro, Giuseppe
    Fujita, Hamido
    INFORMATION SCIENCES, 2020, 514 : 88 - 105
  • [36] Unsupervised Word Polysemy Quantification with Multiresolution Grids of Contextual Embeddings
    Xypolopoulos, Christos
    Tixier, Antoine J-P
    Vazirgiannis, Michalis
    16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), 2021, : 3391 - 3401
  • [37] Contextual Word Embeddings and Topic Modeling in Healthy Dieting and Obesity
    Yeruva, Vijaya Kumari
    Junaid, Sidrah
    Lee, Yugyung
    JOURNAL OF HEALTHCARE INFORMATICS RESEARCH, 2019, 3 (02) : 159 - 183
  • [38] Contextual Word Embeddings and Topic Modeling in Healthy Dieting and Obesity
    Vijaya Kumari Yeruva
    Sidrah Junaid
    Yugyung Lee
    Journal of Healthcare Informatics Research, 2019, 3 : 159 - 183
  • [39] A Study on the Relevance of Generic Word Embeddings for Sentence Classification in Hepatic Surgery
    Oukelmoun, Achir
    Semmar, Nasredine
    de Chalendar, Gael
    Habran, Enguerrand
    Vibert, Eric
    Goblet, Emma
    Oukelmoun, Mariame
    Allard, Marc-Antoine
    2023 20TH ACS/IEEE INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS, AICCSA, 2023,
  • [40] Performance Evaluation of Word and Sentence Embeddings for Finance Headlines Sentiment Analysis
    Mishev, Kostadin
    Gjorgjevikj, Ana
    Stojanov, Riste
    Mishkovski, Igor
    Vodenska, Irena
    Chitkushev, Ljubomir
    Trajanov, Dimitar
    ICT INNOVATIONS 2019: BIG DATA PROCESSING AND MINING, 2019, 1110 : 161 - 172