Empower event detection with bi-directional neural language model

被引:15
|
作者
Zhang, Yunyan [1 ,2 ]
Xu, Guangluan [1 ]
Wang, Yang [1 ]
Liang, Xiao [1 ]
Wang, Lei [1 ]
Huang, Tinglei [1 ]
机构
[1] Chinese Acad Sci, Inst Elect, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100190, Peoples R China
关键词
Information extraction; Event detection; Multi-task learning; Language model;
D O I
10.1016/j.knosys.2019.01.008
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Event detection is an essential and challenging task in Information Extraction (IE). Recent advances in neural networks make it possible to build reliable models without complicated feature engineering. However, data scarcity hinders their further performance. Moreover, training data has been underused since majority of labels in datasets are not event triggers and contribute very little to the training process. In this paper, we propose a novel multi-task learning framework to extract more general patterns from raw data and make better use of the training data. Specifically, we present two paradigms to incorporate neural language model into event detection model on both word and character levels: (1) we use the features extracted by language model as an additional input to event detection model. (2) We use a hard parameter sharing approach between language model and event detection model. The extensive experiments demonstrate the benefits of the proposed multi-task learning framework for event detection. Compared to the previous methods, our method does not rely on any additional supervision but still beats the majority of them and achieves a competitive performance on the ACE 2005 benchmark. (C) 2019 Elsevier B.V. All rights reserved.
引用
收藏
页码:87 / 97
页数:11
相关论文
共 50 条
  • [21] Fake News Detection using Bi-directional LSTM-Recurrent Neural Network
    Bahad, Pritika
    Saxena, Preeti
    Kamal, Raj
    2ND INTERNATIONAL CONFERENCE ON RECENT TRENDS IN ADVANCED COMPUTING ICRTAC -DISRUP - TIV INNOVATION , 2019, 2019, 165 : 74 - 82
  • [22] Bi-Directional Recurrent Attentional Topic Model
    Li, Shuangyin
    Zhang, Yu
    Pan, Rong
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2020, 14 (06)
  • [23] The Model and Application of Bi-directional Electrothermal Microactuator
    Shen, Xuejin
    Hu, Yi
    Zhou, Ling
    Wang, Zhenlu
    Chen, Xiaoyang
    INTEGRATED FERROELECTRICS, 2014, 153 (01) : 9 - 22
  • [24] A Bi-Directional Procedural Model for Architectural Design
    Hua, H.
    COMPUTER GRAPHICS FORUM, 2017, 36 (08) : 219 - 231
  • [25] Bi-directional associative memory model with delays
    Fu, Yuli
    Zhao, Yong
    Fan, Zun
    Liao, Xiaoxin
    Huazhong Ligong Daxue Xuebao/Journal Huazhong (Central China) University of Science and Technology, 2000, 28 (07): : 80 - 82
  • [26] A Novel Bi-directional Interrelated Model for Joint Intent Detection and Slot Filling
    E, Haihong
    Niu, Peiqing
    Chen, Zhongfu
    Song, Meina
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 5467 - 5471
  • [27] Socialized Language Model Smoothing via Bi-directional Influence Propagation on Social Networks
    Yan, Rui
    Li, Cheng-Te
    Hsieh, Hsun-Ping
    Hu, Po
    Hu, Xiaohua
    He, Tingting
    PROCEEDINGS OF THE 25TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'16), 2016, : 1395 - 1405
  • [28] Time series prediction by a neural network model based on the bi-directional computation style
    Wakuya, H
    Zurada, JM
    IJCNN 2000: PROCEEDINGS OF THE IEEE-INNS-ENNS INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOL II, 2000, : 225 - 230
  • [29] Analysis of Bi-directional Reranking Model for Uyghur-Chinese Neural Machine Translation
    Zhang X.
    Li X.
    Yang Y.
    Wang L.
    Dong R.
    Beijing Daxue Xuebao (Ziran Kexue Ban)/Acta Scientiarum Naturalium Universitatis Pekinensis, 2020, 56 (01): : 31 - 38
  • [30] Neurstruc Energy:A bi-directional GNN model for energy prediction of neural networks in IoT
    Chaopeng Guo
    Zhaojin Zhong
    Zexin Zhang
    Jie Song
    Digital Communications and Networks, 2024, 10 (02) : 439 - 449