Improving Open Information Extraction with Distant Supervision Learning

被引:3
|
作者
Han, Jiabao [1 ]
Wang, Hongzhi [1 ]
机构
[1] Harbin Inst Technol, Sch Comp Sci & Technol, Harbin 150001, Peoples R China
关键词
Distant supervision learning; Open information extraction; Neural network; Sequence-to-sequence model;
D O I
10.1007/s11063-021-10548-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Open information extraction (Open IE), as one of the essential applications in the area of Natural Language Processing (NLP), has gained great attention in recent years. As a critical technology for building Knowledge Bases (KBs), it converts unstructured natural language sentences into structured representations, usually expressed in the form of triples. Most conventional open information extraction approaches leverage a series of manual pre-defined extraction patterns or learn patterns from labeled training examples, which requires a large number of human resources. Additionally, many Natural Language Processing tools are involved, which leads to error accumulation and propagation. With the rapid development of neural networks, neural-based models can minimize the error propagation problem, but it also faces the problem of data-hungry in supervised learning. Especially, they leverage existing Open IE tools to generate training data, and it causes data quality issues. In this paper, we employ a distant supervision learning approach to improve the Open IE task. We conduct extensive experiments by employing two popular sequence-to-sequence models (RNN and Transformer) and a large benchmark data set to demonstrate the performance of our approach.
引用
收藏
页码:3287 / 3306
页数:20
相关论文
共 50 条
  • [31] Pattern Learning for Chinese Open Information Extraction
    Li, Yang
    Miao, Qingliang
    Guo, Tong
    Geng, Ji
    Hu, Changjian
    Xu, Feiyu
    KNOWLEDGE GRAPH AND SEMANTIC COMPUTING: KNOWLEDGE COMPUTING AND LANGUAGE UNDERSTANDING (CCKS 2018), 2019, 957 : 78 - 90
  • [32] Ontology-Based Information Extraction for Labeling Radical Online Content Using Distant Supervision
    Etudo, Ugochukwu
    Yoon, Victoria Y.
    INFORMATION SYSTEMS RESEARCH, 2024, 35 (01) : 203 - 225
  • [33] Distant Supervision for Relation Extraction beyond the Sentence Boundary
    Quirk, Chris
    Poon, Hoifung
    15TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2017), VOL 1: LONG PAPERS, 2017, : 1171 - 1182
  • [34] Infusion of Labeled Data into Distant Supervision for Relation Extraction
    Pershina, Maria
    Min, Bonan
    Xu, Wei
    Grishman, Ralph
    PROCEEDINGS OF THE 52ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 2, 2014, : 732 - 738
  • [35] Distant Supervision for Relation Extraction with Neural Instance Selector
    Chen, Yubo
    Liu, Hongtao
    Wu, Chuhan
    Yuan, Zhigang
    Jiang, Minyu
    Huang, Yongfeng
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, PT I, 2018, 11108 : 209 - 220
  • [36] A Brief Survey of Relation Extraction Based on Distant Supervision
    Shi, Yong
    Xiao, Yang
    Niu, Lingfeng
    COMPUTATIONAL SCIENCE - ICCS 2019, PT III, 2019, 11538 : 293 - 303
  • [38] Distant Supervision for Relation Extraction via Sparse Representation
    Zeng, Daojian
    Lai, Siwei
    Wang, Xuepeng
    Liu, Kang
    Zhao, Jun
    Lv, Xueqiang
    CHINESE COMPUTATIONAL LINGUISTICS AND NATURAL LANGUAGE PROCESSING BASED ON NATURALLY ANNOTATED BIG DATA, CCL 2014, 2014, 8801 : 151 - 162
  • [39] Risks of misinterpretation in the evaluation of Distant Supervision for Relation Extraction
    Garcia-Mendoza, Juan-Luis
    Villasenor-Pineda, Luis
    Orihuela-Espina, Felipe
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2022, (68): : 71 - 83
  • [40] Adversarial Discriminative Denoising for Distant Supervision Relation Extraction
    Liu, Bing
    Gao, Huan
    Qi, Guilin
    Duan, Shangfu
    Wu, Tianxing
    Wang, Meng
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, 2019, 11448 : 282 - 286