Event type induction using latent variables with hierarchical relationship analysis

被引:0
|
作者
Yan, Xin [1 ]
Liu, Fangchang [1 ]
Jiang, Lincheng [2 ]
Long, Youlong [1 ]
机构
[1] Wuhan Univ Technol, Sch Comp & Artificial Intelligence, Luoshi Rd 122, Wuhan 430000, Peoples R China
[2] Natl Univ Def Technol, Coll Adv Interdisciplinary Studies, Changsha 410200, Peoples R China
基金
中国国家自然科学基金;
关键词
Event extraction; event type induction; deep learning; latent variable;
D O I
10.3233/WEB-230394
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The conventional approach to event extraction requires predefined event types and their corresponding annotations to train event extractors. However, these prerequisites are often difficult to satisfy in real-world applications. To automatically induct event types, most work has been devoted to clustering event triggers, where a cluster of event triggers is represented as an event type. Some works use trigger semantics, while others use co-occurrence relationships to cluster triggers. However, the clustering results of event triggers obtained by the above work are not sufficiently detailed in describing event types, making it difficult to accurately determine the corresponding event types manually. This paper proposes an open-domain event type induction framework that automatically discovers a set of event types from a given corpus. Unlike previous work on event trigger clustering, this paper takes into consideration the hierarchical relationship of event types to partition the event trigger clusters into event mains and subtypes. The framework employs a latent variable-based neural generation module and a semantic-based clustering module, the former of which obtains event trigger clusters representing the main types of events by jointly projecting the co-occurrence and semantic information of event triggers into a latent space for event type latent variable mining, and the latter of which further divides these event trigger clusters into event subtypes based on semantic information. Finally, experiment results show that, compared with the benchmark model, the ETGen-Clus can improve event type quality scores of 6.23% and 3.11% on the two datasets, respectively.
引用
收藏
页码:611 / 626
页数:16
相关论文
共 50 条
  • [31] High dimensional mediation analysis with latent variables
    Derkach, Andriy
    Pfeiffer, Ruth M.
    Chen, Ting-Huei
    Sampson, Joshua N.
    BIOMETRICS, 2019, 75 (03) : 745 - 756
  • [32] Hierarchical latent variables structure for topic aware multi-turn conversation
    Cui, Fuwei
    Di, Hui
    Huang, Hui
    Ouchi, Kazushige
    Liu, Ze
    Xu, Jinan
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 43 (03) : 3805 - 3814
  • [33] A Bayesian Analysis of Exogeneity in Models with Latent Variables
    Pajor, Anna
    CENTRAL EUROPEAN JOURNAL OF ECONOMIC MODELLING AND ECONOMETRICS, 2011, 3 (02): : 49 - 73
  • [34] Regression analysis of logistic model with latent variables
    Ye, Yuan
    Liu, Zhongchun
    Pan, Deng
    Wu, Yuanshan
    STATISTICS IN MEDICINE, 2023, 42 (06) : 860 - 877
  • [35] A Tutorial in Bayesian Mediation Analysis With Latent Variables
    Miocevic, Milica
    METHODOLOGY-EUROPEAN JOURNAL OF RESEARCH METHODS FOR THE BEHAVIORAL AND SOCIAL SCIENCES, 2019, 15 (04) : 137 - 146
  • [36] Hierarchical latent class models for cluster analysis
    Zhang, NL
    EIGHTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-02)/FOURTEENTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE (IAAI-02), PROCEEDINGS, 2002, : 230 - 237
  • [37] Hierarchical latent class models for cluster analysis
    Zhang, NL
    JOURNAL OF MACHINE LEARNING RESEARCH, 2004, 5 : 697 - 723
  • [38] Formation of Hierarchical Object Concept Using Hierarchical Latent Dirichlet Allocation
    Ando, Yoshiki
    Nakamura, Tomoaki
    Araki, Takaya
    Nagai, Takayuki
    2013 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2013, : 2272 - 2279
  • [39] LATENT SEMANTIC ANALYSIS IN SOUND EVENT DETECTION
    Mesaros, Annamaria
    Heittola, Toni
    Klapuri, Anssi
    19TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO-2011), 2011, : 1307 - 1311
  • [40] Identification of latent variables in a semantic odor profile database using principal component analysis
    Zarzo, M.
    Stanton, D.
    CHEMICAL SENSES, 2006, 31 (05) : A144 - A144