Graphical Models for Text: A New Paradigm for Text Representation and Processing

被引:0
|
作者
Aggarwal, Charu C. [1 ]
Zhao, Peixiang [1 ]
机构
[1] IBM TJ Watson Res Ctr, Hawthorne, NY USA
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Almost all text applications use the well known vector-space model for text representation and analysis. While the vector-space model has proven itself to be an effective and efficient representation for mining purposes, it does not preserve information about the ordering of the words in the representation. In this paper, we will introduce the concept of distance graph representations of text data. Such representations preserve distance and ordering information between the words, and provide a much richer representation of the underlying text. This approach enables knowledge discovery from text which is not possible with the use of a pure vector-space representation, because it loses much less information about the ordering of the underlying words. Furthermore, this representation does not require the development of new mining and management techniques. This is because the technique can also be converted into a structural version of the vector-space representation, which allows the use of all existing tools for text. In addition, existing techniques for graph and XML data can be directly leveraged with this new representation. Thus, a much wider spectrum of algorithms is available for processing this representation.
引用
收藏
页码:899 / 900
页数:2
相关论文
共 50 条
  • [1] Towards graphical models for text processing
    Charu C. Aggarwal
    Peixiang Zhao
    Knowledge and Information Systems, 2013, 36 : 1 - 21
  • [2] Towards graphical models for text processing
    Aggarwal, Charu C.
    Zhao, Peixiang
    KNOWLEDGE AND INFORMATION SYSTEMS, 2013, 36 (01) : 1 - 21
  • [3] Graphical Representation of Text Semantics
    Fossoh, Karl Kevin Tiba
    Lo, Dan
    ACMSE 2020: PROCEEDINGS OF THE 2020 ACM SOUTHEAST CONFERENCE, 2020, : 318 - 319
  • [4] Experiments with learning graphical models on text
    Capdevila J.
    Zhao H.
    Petitjean F.
    Buntine W.
    Behaviormetrika, 2018, 45 (2) : 363 - 387
  • [5] DISCOURSE PROCESSING AND TEXT REPRESENTATION FROM A MENTAL MODELS PERSPECTIVE
    GARNHAM, A
    OAKHILL, J
    LANGUAGE AND COGNITIVE PROCESSES, 1992, 7 (3-4): : 193 - 204
  • [6] Improving cross-content transfer in text processing by means of active graphical representation
    Stern, E
    Aprea, C
    Ebner, HG
    LEARNING AND INSTRUCTION, 2003, 13 (02) : 191 - 203
  • [7] Representation of structured data of the text genre as a technique for automatic text processing
    Fonseca, Claudia Aparecida
    Carvalho Guelpeli, Marcus Vinicius
    de Souza Netto, Rafael Santiago
    TEXTO LIVRE-LINGUAGEM E TECNOLOGIA, 2022, 15
  • [8] COGNITIVE-PSYCHOLOGY AND TEXT-PROCESSING - FROM TEXT REPRESENTATION TO TEXT-WORLD
    DENHIERE, G
    BAUDET, S
    SEMIOTICA, 1989, 77 (1-3) : 271 - 293
  • [9] Survey of Neural Text Representation Models
    Babic, Karlo
    Martincic-Ipsic, Sanda
    Mestrovic, Ana
    INFORMATION, 2020, 11 (11) : 1 - 32
  • [10] Pyramid Text Recognition Based on A New Text Representation Model
    Su, Shaoxun
    Zhu, Nafei
    He, Jingsha
    PROCEEDINGS OF THE 2019 IEEE 16TH INTERNATIONAL CONFERENCE ON NETWORKING, SENSING AND CONTROL (ICNSC 2019), 2019, : 195 - 199