Combination of similarity measures for effective spoken document retrieval

被引:8
|
作者
Crestani, F [1 ]
机构
[1] Univ Strathclyde, Dept Comp & Informat Sci, Glasgow G1 1XH, Lanark, Scotland
关键词
D O I
10.1177/016555103763031572
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Often users of information retrieval systems and document authors use different terms to refer to the same concept. For this simple reason, information retrieval is affected by the 'term mismatch' problem. The term mismatch problem does not only have the effect of hindering the retrieval of relevant documents, it also produces bad rankings of relevant documents. A similar problem can be found in spoken document retrieval, where terms misrecognized by the speech recognition process can hinder the retrieval of potentially relevant spoken documents. We will call this problem 'term misrecognition', by analogy to the term mismatch problem. This paper presents two classes of retrieval models that attempt to tackle both the term mismatch and the term misrecognition problems at retrieval time using term similarity information. The models use either complete or partial knowledge of semantic and phonetic term similarity, evaluated using statistical methods from the corpus.
引用
收藏
页码:87 / 96
页数:10
相关论文
共 50 条
  • [1] Divergence-based similarity measure for spoken document retrieval
    Liu, Peng
    Soong, Frank K.
    Zhou, Jian-Lai
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 89 - +
  • [2] Using semantic and phonetic term similarity for spoken document retrieval and spoken query processing
    Crestani, F
    TECHNOLOGIES FOR CONSTRUCTING INTELLIGENT SYSTEMS 1: TASKS, 2002, 89 : 363 - 375
  • [3] Effective Measures for Inter-Document Similarity
    Whissell, John S.
    Clarke, Charles L. A.
    PROCEEDINGS OF THE 22ND ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM'13), 2013, : 1361 - 1370
  • [4] Ranking invariance based on similarity measures in document retrieval
    Omhover, JF
    Rifqi, M
    Detyniecki, M
    ADAPTIVE MULTIMEDIA RETRIEVAL: USER, CONTEXT, AND FEEDBACK, 2006, 3877 : 55 - 64
  • [5] EFFECTIVE PSEUDO-RELEVANCE FEEDBACK FOR SPOKEN DOCUMENT RETRIEVAL
    Chen, Yi-Wen
    Chen, Kuan-Yu
    Wang, Hsin-Min
    Chen, Berlin
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 8535 - 8539
  • [6] Experiments in spoken document retrieval
    Jones, K.Sparck
    Jones, G.J.F.
    Foote, J.T.
    Young, S.J.
    Information Processing and Management, 1996, 32 (04): : 399 - 417
  • [7] An architecture for spoken document retrieval
    Terol, RM
    Martínez-Barco, P
    Palomar, M
    TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2004, 3206 : 505 - 511
  • [8] Experiments in spoken document retrieval
    Sparck-Jones, K
    Jones, GJF
    Foote, JT
    Young, SJ
    INFORMATION PROCESSING & MANAGEMENT, 1996, 32 (04) : 399 - 417
  • [9] Novel similarity measures for the effective and efficient retrieval of pharmacological datasets
    Rivera Borroto, Oscar Miguel
    Hernandez Diaz, Yoandy
    Manuel Garcia de la Vega, Jose
    Grau Abalo, Ricardo del Corazon
    Marrero Ponce, Yovani
    AFINIDAD, 2011, 68 (551) : 50 - 56
  • [10] Spoken document representations for probabilistic retrieval
    Jourlin, P
    Johnson, SE
    Sparck-Jones, K
    Woodland, PC
    SPEECH COMMUNICATION, 2000, 32 (1-2) : 21 - 36