Terminological paraphrase extraction from scientific literature based on predicate argument tuples

被引:4
|
作者
Choi, Sung-Pil [2 ]
Myaeng, Sung-Hyon [1 ]
机构
[1] Korea Adv Inst Sci & Technol, Div Web Sci & Technol, Taejon 305701, South Korea
[2] Korea Inst Sci & Technol Informat, Dept Software Res, Seoul, South Korea
基金
新加坡国家研究基金会;
关键词
information extraction; paraphrase extraction; predicate argument tuple; technical terms; terminological paraphrase; TEXTUAL ENTAILMENT; QUERY EXPANSION; RETRIEVAL; IMPACT;
D O I
10.1177/0165551512459920
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Terminological paraphrases (TPs) are sentences or phrases that express the concepts of terminologies in a different form. Here we propose an effective way to identify and extract TPs from large-scale scientific literature databases. We propose a novel method for effectively retrieving sentences that contain a given terminological concept based on semantic units called predicate-argument tuples. This method enables effective textual similarity computations and minimized errors based on six TP ranking models. For evaluation, we constructed an evaluation collection for the TP recognition task by extracting TPs from a target literature database using the proposed method. Through the two experiments, we learned that scientific literature contain many TPs that could not have been identified so far. Also, the experimental results showed the potential and extensibility of our proposed methods to extract the TPs.
引用
收藏
页码:593 / 611
页数:19
相关论文
共 50 条
  • [21] Term extraction and correlation analysis based on massive scientific and technical literature
    Zeng W.
    Xu H.
    Zhang J.
    Zeng, Wen (zengw@istic.ac.cn), 1600, Inderscience Publishers, 29, route de Pre-Bois, Case Postale 856, CH-1215 Geneva 15, CH-1215, Switzerland (15): : 248 - 255
  • [22] Review of Knowledge Elements Extraction in Scientific Literature Based on Deep Learning
    Li G.
    Yuan Y.
    Data Analysis and Knowledge Discovery, 2023, 7 (07) : 1 - 17
  • [23] Methodological Challenges for the Comparison of Results of Topic Extraction from Scientific Literature
    Velden, Theresa
    16TH INTERNATIONAL CONFERENCE ON SCIENTOMETRICS & INFORMETRICS (ISSI 2017), 2017, : 1558 - 1568
  • [24] PPaxe: easy extraction of protein occurrence and interactions from the scientific literature
    Castillo-Lara, S.
    Abril, J. F.
    BIOINFORMATICS, 2019, 35 (14) : 2523 - 2524
  • [25] ChemDataExtractor: A Toolkit for Automated Extraction of Chemical Information from the Scientific Literature
    Swain, Matthew C.
    Cole, Jacqueline M.
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2016, 56 (10) : 1894 - 1904
  • [26] Biological network extraction from scientific literature: state of the art and challenges
    Li, Chen
    Liakata, Maria
    Rebholz-Schuhmann, Dietrich
    BRIEFINGS IN BIOINFORMATICS, 2014, 15 (05) : 856 - 877
  • [27] High-Precision Extraction of Emerging Concepts from Scientific Literature
    King, Daniel
    Downey, Doug
    Weld, Daniel S.
    PROCEEDINGS OF THE 43RD INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '20), 2020, : 1549 - 1552
  • [28] Multi-Input Multi-Output Sequence Labeling for Joint Extraction of Fact and Condition Tuples from Scientific Text
    Jiang, Tianwen
    Zhao, Tong
    Qin, Bing
    Liu, Ting
    Chawla, Nitesh, V
    Jiang, Meng
    2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 302 - 312
  • [29] Extraction of optimal synthesis conditions from scientific literature using a knowledge graph
    Kobayashi, Shigeru
    Kuwashiro, Norikazu
    Itoh, Fumiaki
    Sakurai, Dai
    Hitosugi, Taro
    SCIENCE AND TECHNOLOGY OF ADVANCED MATERIALS-METHODS, 2024, 4 (01):
  • [30] A Hybrid Human-Computer Approach to the Extraction of Scientific Facts from the Literature
    Tchoua, Roselyne B.
    Chard, Kyle
    Audus, Debra
    Qin, Jian
    de Pablo, Juan
    Foster, Ian
    INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE 2016 (ICCS 2016), 2016, 80 : 386 - 397