Identifying Patterns in Texts

被引:2
|
作者
Huang, Minhua [1 ]
Haralick, Robert M. [1 ]
机构
[1] CUNY, Grad Ctr, New York, NY 10016 USA
关键词
D O I
10.1109/ICSC.2009.22
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We discuss a probabilistic graphical model for recognizing patterns in texts. It is derived from the probability function for a sequence of categories given a sequence of symbols under two reasonable conditional independence assumptions and represented by a product of combinations of conditional and marginal probability functions The novelty of our model is that it has a mathematical representation which is completely different from existing graphical models such as CRFs, HMMs, and MEMMs. Moreover, it can be used for identifying various patterns in texts. Up to now, we have used this model for recognizing NP chunks and senses of a polysemous word in sentences. This model has achieved very promising results on standard data sets. In the future, we will use this model for extracting semantic roles in a sentence
引用
收藏
页码:59 / 64
页数:6
相关论文
共 50 条
  • [21] Identifying Patterns in Sequences of Variables
    Zanarini, Alessandro
    Van Hentenryck, Pascal
    INTEGRATION OF AI AND OR TECHNIQUES IN CONSTRAINT PROGRAMMING FOR COMBINATORIAL OPTIMIZATION PROBLEMS, 2011, 6697 : 246 - 251
  • [22] Communicative patterns in Romanian workplace written texts
    Saftoiu, Razvan
    Gheorghe, Mihaela
    Mada, Stanca
    REVISTA SIGNOS, 2010, 43 (74): : 489 - 515
  • [23] Patterns of similarity in the language of Middle English texts
    Jones, A
    PARERGON, 1997, 14 (02) : 51 - 65
  • [24] Identifying and Removing the Silences of Roma Culture in Polish School Texts
    Swietek, Agnes
    Brunn, Stanley
    Wogtowicz, Bazena
    JOURNAL OF GEOGRAPHY, 2019, 118 (04) : 169 - 184
  • [25] Identifying the Presence of Graphical Texts in Scene Images using CNN
    Ghosh, Mridul
    Mukherjee, Himadri
    Obaidullah, Sk Md
    Santosh, K. C.
    Das, Nibaran
    Roy, Kaushik
    2019 INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION WORKSHOPS (ICDARW) AND 13TH IAPR INTERNATIONAL WORKSHOP ON GRAPHICS RECOGNITION (GREC 2019), VOL 1, 2019, : 86 - 91
  • [26] Identifying and Improving Dataset References in Social Sciences Full Texts
    Ghavimi, Behnam
    Mayr, Philipp
    Vahdati, Sahar
    Lange, Christoph
    POSITIONING AND POWER IN ACADEMIC PUBLISHING: PLAYERS, AGENTS AND AGENDAS, 2016, : 105 - 114
  • [27] IDENTIFYING KEYWORDS ON THE BASIS OF CONTENT MONITORING METHOD IN UKRAINIAN TEXTS
    Bisikalo, O., V
    Vysotska, V. A.
    RADIO ELECTRONICS COMPUTER SCIENCE CONTROL, 2016, (01) : 74 - 83
  • [28] A Novel Method for Identifying Bipolar Disorder Based on Diagnostic Texts
    Gao, Hua
    Chen, Li
    Zhou, Yi
    Chi, Kaikai
    Chan, Sixian
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT IV, 2024, 14428 : 52 - 63
  • [29] Concept Class Analysis: A Method for Identifying Cultural Schemas in Texts
    Taylor, Marshall A.
    Stoltz, Dustin S.
    SOCIOLOGICAL SCIENCE, 2020, 7 : 544 - 569
  • [30] Identifying Neuroimaging Patterns in Mitochondrial Leukoencephalopathies
    Sharma, S.
    Peterson, J.
    Alves, C.
    Goldstein, A.
    ANNALS OF NEUROLOGY, 2023, 94 : S100 - S100