Extraction and representation of contextual information for knowledge discovery in texts

被引:25
|
作者
Perrin, P [1 ]
Petry, FE
机构
[1] Merck Res Labs, Med Chem Mol Syst, Rahway, NJ 07065 USA
[2] Tulane Univ, Dept Elect Engn & Comp Sci, New Orleans, LA 70118 USA
关键词
text mining; text feature construction; extraction and selection; collocational expressions; text representation; first-order logic;
D O I
10.1016/S0020-0255(02)00400-0
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper studies the role of lexical contextual relations for the problem of unsupervised knowledge discovery in full texts. Narrative texts have inherent structure dictated by language usage in generating them. We suggest that the relative distance of terms within a text gives sufficient information about its structure and its relevant content. Furthermore, this structure can be used to discover implicit knowledge embedded in the text, therefore serving as a good candidate to represent effectively the text content for knowledge elicitation tasks. We qualitatively demonstrate that a useful text structure and content can be systematically extracted by collocational lexical analysis without the need to encode any supplemental sources of knowledge. We present an algorithm that systematically extracts the most relevant facts in the texts and labels them by their overall theme, dictated by local contextual information. It exploits domain independent lexical frequencies and mutual information measures to find the relevant Contextual units in the texts. We report results from experiments in a real-world textual database of psychiatric evaluation reports. (C) 2002 Elsevier Science Inc. All rights reserved.
引用
收藏
页码:125 / 152
页数:28
相关论文
共 50 条
  • [1] Contextual text representation for unsupervised knowledge discovery in texts
    Perrin, P
    Petry, F
    RESEARCH AND DEVELOPMENT IN KNOWLEDGE DISCOVERY AND DATA MINING, 1998, 1394 : 246 - 257
  • [2] Knowledge representation issues in information extraction
    Wee, LKA
    Tong, LC
    Tan, CL
    PRICAI'98: TOPICS IN ARTIFICIAL INTELLIGENCE, 1998, 1531 : 448 - 458
  • [3] Conditional Knowledge Extraction Using Contextual Information Enhancement
    Xu, Zhangbiao
    Zhang, Botao
    Gu, Jinguang
    Gao, Feng
    APPLIED SCIENCES-BASEL, 2023, 13 (08):
  • [4] Information extraction and knowledge linkage of geological profiles and related contextual texts from mineral exploration reports for geological knowledge graphs construction
    Qiu, Qinjun
    Duan, Yuxi
    Ma, Kai
    Tao, Liufeng
    Xie, Zhong
    ORE GEOLOGY REVIEWS, 2023, 163
  • [5] Representation, Analysis, and Extraction of Knowledge from Unstructured Natural Language Texts
    Hoherchak, H.
    Darchuk, N.
    Kryvyi, S.
    CYBERNETICS AND SYSTEMS ANALYSIS, 2021, 57 (03) : 481 - 500
  • [6] Representation, Analysis, and Extraction of Knowledge from Unstructured Natural Language Texts
    H. Hoherchak
    N. Darchuk
    S. Kryvyi
    Cybernetics and Systems Analysis, 2021, 57 : 481 - 500
  • [7] Semantic Network Formalism for Knowledge Representation: Towards Consideration of Contextual Information
    Mallat, Souheyl
    Hkiri, Emna
    Maraoui, Mohsen
    Zrigui, Mounir
    INTERNATIONAL JOURNAL ON SEMANTIC WEB AND INFORMATION SYSTEMS, 2015, 11 (04) : 64 - 85
  • [8] Knowledge representation and information extraction for analysing architectural patterns
    Velasco-Elizondo, Perla
    Marin-Pina, Rosario
    Vazquez-Reyes, Sodel
    Mora-Soto, Arturo
    Mejia, Jezreel
    SCIENCE OF COMPUTER PROGRAMMING, 2016, 121 : 176 - 189
  • [9] FrameNet CNL: A Knowledge Representation and Information Extraction Language
    Barzdins, Guntis
    CONTROLLED NATURAL LANGUAGE, CNL 2014, 2014, 8625 : 90 - 101
  • [10] Searching for image information content, its discovery, extraction, and representation
    Diamant, E
    JOURNAL OF ELECTRONIC IMAGING, 2005, 14 (01) : 1 - 11