Causal Relation Extraction Based on Graph Attention Networks

被引:0
|
作者
Xu J. [1 ]
Zuo W. [1 ,2 ]
Liang S. [1 ]
Wang Y. [1 ,2 ]
机构
[1] College of Computer Science and Technology, Jilin University, Changchun
[2] Key Laboratory of Symbol Computation and Knowledge Engineering (Jilin University), Ministry of Education, Changchun
基金
中国国家自然科学基金;
关键词
Bidirectional long short-term memory (Bi-LSTM); Causal relation extraction; Graph attention networks (GATs); Sequence labeling; Syntactic dependency graph;
D O I
10.7544/issn1000-1239.2020.20190042
中图分类号
学科分类号
摘要
Causality represents a kind of correlation between cause and effect, where the happening of cause will leads to the happening of effect. As the most important type of relationship between entities, causality plays a vital role in many fields such as automatic reasoning and scenario generation. Therefore, extracting causal relation becomes a basic task in natural language processing and text mining. Different from traditional text classification methods or relation extraction methods, this paper proposes a sequence labeling method to extract causal entity in text and identify direction of causality, without relying on feature engineering or causal background knowledge. The main contributions of this paper can be summarized as follows: 1) we extend syntactic dependency tree to the syntactic dependency graph, adopt graph attention networks in natural language processing, and introduce the concept of S-GAT(graph attention network based on syntactic dependency graph); 2) Bi-LSTM+CRF+S-GAT model for causal extraction is proposed, which generates causal label of each word in sentence based on input word vectors; 3) SemEval data set is modified and extended, and rules are defined to relabel experimental data with an aim of overcoming defects of the original labeling method. Extensive experiments are conducted on the expanded SemEval dataset, which shows that our model achieves 0.064 improvement over state-of-the-art model Bi-LSTM+CRF+self-ATT in terms of prediction accuracy. © 2020, Science Press. All right reserved.
引用
收藏
页码:159 / 174
页数:15
相关论文
共 22 条
  • [1] Radinsky K., Davidovich S., Markovitch S., Learning causality for news events prediction, Proc of the 21st Int Conf on World Wide Web, pp. 909-918, (2012)
  • [2] Girju R., Automatic detection of causal relations for question answering, Proc of the ACL 2003 Workshop on Multilingual Summarization and Question Answering, 12, pp. 76-83, (2003)
  • [3] Hashimoto C., Torisawa K., Kloetzer J., Et al., Toward future scenario generation: Extracting event causality exploiting semantic relation, context, and association features, Proc of the 52nd Annual Meeting of the Association for Computational Linguistics, pp. 987-997, (2014)
  • [4] Velickovic P., Cucurull G., Casanova A., Et al., Graph attention networks, (2017)
  • [5] Garcia D., COATIS, an NLP system to locate expressions of actions connected by causality links, Proc of the Int Conf on Knowledge Engineering and Knowledge Management, pp. 347-352, (1997)
  • [6] Khoo C.S.G., Kornfilt J., Oddy R.N., Et al., Automatic extraction of cause-effect information from newspaper text without knowledge-based inferencing, Literary and Linguistic Computing, 13, 4, pp. 177-186, (1998)
  • [7] de Silva T.N., Xiao Z., Zhao R., Et al., Causal relation identification using convolutional neural networks and knowledge based features, International Journal of Computer and Systems Engineering, 11, 6, pp. 697-702, (2017)
  • [8] Hidey C., McKeown K., Identifying causal relations using parallel Wikipedia articles, Proc of the 54th Annual Meeting of the Association for Computational Linguistics, pp. 1424-1433, (2016)
  • [9] Zhao S., Liu T., Zhao S., Et al., Event causality extraction based on connectives analysis, Neurocomputing, 173, pp. 1943-1950, (2016)
  • [10] Feng C., Kang L., Shi G., Et al., Causality extraction with GAN, Acta Automatica Sinica, 44, 5, pp. 811-818, (2018)