Extraction and representation of contextual information for knowledge discovery in texts

被引:25
|
作者
Perrin, P [1 ]
Petry, FE
机构
[1] Merck Res Labs, Med Chem Mol Syst, Rahway, NJ 07065 USA
[2] Tulane Univ, Dept Elect Engn & Comp Sci, New Orleans, LA 70118 USA
关键词
text mining; text feature construction; extraction and selection; collocational expressions; text representation; first-order logic;
D O I
10.1016/S0020-0255(02)00400-0
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper studies the role of lexical contextual relations for the problem of unsupervised knowledge discovery in full texts. Narrative texts have inherent structure dictated by language usage in generating them. We suggest that the relative distance of terms within a text gives sufficient information about its structure and its relevant content. Furthermore, this structure can be used to discover implicit knowledge embedded in the text, therefore serving as a good candidate to represent effectively the text content for knowledge elicitation tasks. We qualitatively demonstrate that a useful text structure and content can be systematically extracted by collocational lexical analysis without the need to encode any supplemental sources of knowledge. We present an algorithm that systematically extracts the most relevant facts in the texts and labels them by their overall theme, dictated by local contextual information. It exploits domain independent lexical frequencies and mutual information measures to find the relevant Contextual units in the texts. We report results from experiments in a real-world textual database of psychiatric evaluation reports. (C) 2002 Elsevier Science Inc. All rights reserved.
引用
收藏
页码:125 / 152
页数:28
相关论文
共 50 条
  • [31] Natural language processing for knowledge discovery and information extraction from energetics corpora
    VanGessel, Francis G.
    Perry, Efrem
    Mohan, Salil
    Barham, Oliver M.
    Cavolowsky, Mark
    PROPELLANTS EXPLOSIVES PYROTECHNICS, 2023, 48 (11)
  • [32] Ontological Approach: Knowledge Representation and Knowledge Extraction
    Ataeva, O. M.
    Serebryakov, V. A.
    Tuchkova, N. P.
    LOBACHEVSKII JOURNAL OF MATHEMATICS, 2020, 41 (10) : 1938 - 1948
  • [33] Ontological Approach: Knowledge Representation and Knowledge Extraction
    O. M. Ataeva
    V. A. Serebryakov
    N. P. Tuchkova
    Lobachevskii Journal of Mathematics, 2020, 41 : 1938 - 1948
  • [34] The role of classification in knowledge representation and discovery
    Kwasnik, BH
    LIBRARY TRENDS, 1999, 48 (01) : 22 - 47
  • [35] ContextMiner: Mining Contextual Features for Conceptualizing Knowledge in Security Texts
    Gutierrez, Luis Felipe
    Namin, Akbar
    IEEE ACCESS, 2022, 10 : 85891 - 85904
  • [36] SEARCHING FOR INFORMATION IN KNOWLEDGE MAPS AND TEXTS
    ODONNELL, A
    CONTEMPORARY EDUCATIONAL PSYCHOLOGY, 1993, 18 (02) : 222 - 239
  • [37] SEARCHING FOR INFORMATION IN KNOWLEDGE MAPS OR TEXTS
    ODONNELL, A
    INTERNATIONAL JOURNAL OF PSYCHOLOGY, 1992, 27 (3-4) : 130 - 130
  • [38] KNOWLEDGE STRUCTURE AND SELECTION OF INFORMATION TEXTS
    FLAMMER, A
    BUCHEL, F
    GUTMANN, W
    ZEITSCHRIFT FUR EXPERIMENTELLE UND ANGEWANDTE PSYCHOLOGIE, 1976, 23 (01): : 30 - 44
  • [39] INFORMATION EXTRACTION AND REPRESENTATION IN MRI
    HYLTON, NM
    ORTENDAHL, DA
    KAUFMAN, L
    CROOKS, LE
    IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 1985, 32 (10) : 885 - 885
  • [40] On information and knowledge representation in the brain
    Wang, YX
    Liu, D
    SECOND IEEE INTERNATIONAL CONFERENCE ON COGNITIVE INFORMATICS, PROCEEDINGS, 2003, : 26 - 31