Causal Pattern Representation Learning for Extracting Causality from Literature

被引:2
|
作者
Yang, Jiaoyun [1 ]
Xiong, Hao [1 ]
Zhang, Hongjin [2 ]
Hu, Min [1 ]
An, Ning [1 ]
机构
[1] Hefei Univ Technol, Hefei, Anhui, Peoples R China
[2] Bepsun Eurotech Solut Oy, Helsinki, Finland
基金
中国国家自然科学基金;
关键词
Causality Extraction; Causal Pattern; Representation Learning; Graph Convolution Networks;
D O I
10.1145/3578741.3578787
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Extracting causality from literature has become an important task due to the essential role of causality. Traditional methods use pattern matching to extract causality, requiring domain knowledge and extensive human effort. Recent researches focus on utilizing pre-trained language models due to their success in Natural Language Processing (NLP). However, long sentences in literature hinders the performance of causality extraction. In this paper, we propose to focus on the representation of causal virtual pattern <head_entity, causal_virtual_trigger, tail_entity> and design a Causal Pattern Representation Learning (CPRL) method to tackle this challenge. For the causal_virtual_trigger representation, CPRL applies the attention mechanism on the shortest dependency path between entities to filter irrelevant information. For the head_entity and tail_entity representation, CPRL applies graph convolution networks to encode word dependency on entities. By crawling health-related literature abstracts, we create a new causality extraction dataset, namely HealthCE, with a size of 3479. Experiments on HealthCE demonstrate the effectiveness of our approach over existing causality extraction and general relation extraction baselines on the task of causality extraction.
引用
收藏
页码:229 / 233
页数:5
相关论文
共 50 条
  • [1] Extracting causal relations on HIV drug resistance from literature
    Bui, Quoc-Chinh
    Nuallain, Breanndan O.
    Boucher, Charles A.
    Sloot, Peter M. A.
    BMC BIOINFORMATICS, 2010, 11
  • [2] Extracting causal relations from the literature with word vector mapping
    An, Ning
    Xiao, Yongbo
    Yuan, Jing
    Yang, Jiaoyun
    Alterovitz, Gil
    COMPUTERS IN BIOLOGY AND MEDICINE, 2019, 115
  • [3] Extracting causal relations on HIV drug resistance from literature
    Quoc-Chinh Bui
    Breanndán Ó Nualláin
    Charles A Boucher
    Peter MA Sloot
    BMC Bioinformatics, 11
  • [4] BISCUIT: Causal Representation Learning from Binary Interactions
    Lippe, Phillip
    Magliacane, Sara
    Lowe, Sindy
    Asano, Yuki M.
    Cohen, Taco
    Gavves, Efstratios
    UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2023, 216 : 1263 - 1273
  • [5] Learning Causal Semantic Representation from Information Extraction
    Zuo Xin
    Wang LiMin
    Zhou Shuang
    2009 INTERNATIONAL SYMPOSIUM ON INTELLIGENT UBIQUITOUS COMPUTING AND EDUCATION, 2009, : 404 - +
  • [6] Toward Causal Representation Learning
    Schoelkopf, Bernhard
    Locatello, Francesco
    Bauer, Stefan
    Ke, Nan Rosemary
    Kalchbrenner, Nal
    Goyal, Anirudh
    Bengio, Yoshua
    PROCEEDINGS OF THE IEEE, 2021, 109 (05) : 612 - 634
  • [7] Interventional Causal Representation Learning
    Ahuja, Kartik
    Mahajan, Divyat
    Wang, Yixin
    Bengio, Yoshua
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 202, 2023, 202 : 372 - 407
  • [8] Improving Event Causality Identification via Self-Supervised Representation Learning on External Causal Statement
    Zuo, Xinyu
    Cao, Pengfei
    Chen, Yubo
    Liu, Kang
    Zhao, Jun
    Peng, Weihua
    Chen, Yuguang
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 2162 - 2172
  • [9] Learning Distortion Invariant Representation for Image Restoration from A Causality Perspective
    Li, Xin
    Li, Bingchen
    Jin, Xin
    Lan, Cuiling
    Chen, Zhibo
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 1714 - 1724
  • [10] From session causality to causal consistency
    Brzezinski, J
    Sobaniec, C
    Wawrzyniak, D
    12TH EUROMICRO CONFERENCE ON PARALLEL, DISTRIBUTED AND NETWORK-BASED PROCESSING, PROCEEDINGS, 2004, : 152 - 158