Extraction and representation of contextual information for knowledge discovery in texts

被引:25
|
作者
Perrin, P [1 ]
Petry, FE
机构
[1] Merck Res Labs, Med Chem Mol Syst, Rahway, NJ 07065 USA
[2] Tulane Univ, Dept Elect Engn & Comp Sci, New Orleans, LA 70118 USA
关键词
text mining; text feature construction; extraction and selection; collocational expressions; text representation; first-order logic;
D O I
10.1016/S0020-0255(02)00400-0
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper studies the role of lexical contextual relations for the problem of unsupervised knowledge discovery in full texts. Narrative texts have inherent structure dictated by language usage in generating them. We suggest that the relative distance of terms within a text gives sufficient information about its structure and its relevant content. Furthermore, this structure can be used to discover implicit knowledge embedded in the text, therefore serving as a good candidate to represent effectively the text content for knowledge elicitation tasks. We qualitatively demonstrate that a useful text structure and content can be systematically extracted by collocational lexical analysis without the need to encode any supplemental sources of knowledge. We present an algorithm that systematically extracts the most relevant facts in the texts and labels them by their overall theme, dictated by local contextual information. It exploits domain independent lexical frequencies and mutual information measures to find the relevant Contextual units in the texts. We report results from experiments in a real-world textual database of psychiatric evaluation reports. (C) 2002 Elsevier Science Inc. All rights reserved.
引用
收藏
页码:125 / 152
页数:28
相关论文
共 50 条
  • [21] Uncertainty Reduction for Knowledge Discovery and Information Extraction on the World Wide Web
    Ji, Heng
    Deng, Hongbo
    Han, Jiawei
    PROCEEDINGS OF THE IEEE, 2012, 100 (09) : 2658 - 2674
  • [22] Using background contextual knowledge for documents representation
    Kosmynin, A
    Davidson, I
    PRINCIPLES OF DOCUMENT PROCESSING, 1997, 1293 : 123 - 133
  • [23] EXTRACTION OF CONTEXTUAL INFORMATION FOR AUTOMOTIVE APPLICATIONS
    Beoldo, Andrea
    Dore, Alessio
    Regazzoni, Carlo S.
    2009 16TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-6, 2009, : 1153 - 1156
  • [24] Information extraction and knowledge acquisition from texts using bilingual question-answering
    Kontos, J
    Malagardi, I
    JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 1999, 26 (02) : 103 - 122
  • [25] Information extraction and knowledge acquisition from texts using bilingual question-answering
    Kontos, John
    Malagardi, Ioanna
    Journal of Intelligent and Robotic Systems: Theory and Applications, 1999, 26 (02): : 103 - 122
  • [26] ARISTA causal knowledge discovery from texts
    Kontos, J
    Elmaoglou, A
    Malagardi, I
    DISCOVERY SCIENCE, PROCEEDINGS, 2002, 2534 : 348 - 355
  • [27] Integration of Contextual Information in Online Handwriting Representation
    Izadi, Sara
    Suen, Ching Y.
    IMAGE ANALYSIS AND PROCESSING - ICIAP 2009, PROCEEDINGS, 2009, 5716 : 132 - 142
  • [28] Information extraction from Greek texts
    Karra, M
    Bekakos, MP
    NEURAL, PARALLEL, AND SCIENTIFIC COMPUTATIONS, VOL 2, PROCEEDINGS, 2002, : 17 - 20
  • [29] Information Extraction of Texts in the Biomedical Domain
    Cotik, Viviana
    PROCEEDINGS OF THE TWENTY-FOURTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI), 2015, : 4357 - 4358
  • [30] Information Extraction for Cultural Heritage Knowledge Acquisition Using Word Vector Representation
    Buranasing, Watchira
    Phoomvuthisarn, Suronapee
    COMPLEX, INTELLIGENT, AND SOFTWARE INTENSIVE SYSTEMS, 2019, 772 : 419 - 430