The Cambridge University spoken document retrieval system

被引:22
|
作者
Johnson, SE [1 ]
Jourlin, P [1 ]
Moore, GL [1 ]
Jones, KS [1 ]
Woodland, PC [1 ]
机构
[1] Univ Cambridge, Dept Engn, Cambridge CB2 1PZ, England
关键词
D O I
10.1109/ICASSP.1999.758059
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper describes the spoken document retrieval system that we have been developing and assesses its performance using automatic transcriptions of about 50 hours of broadcast news data. The recognition engine is based on the HTK broadcast news transcription system and the retrieval engine is based on the techniques developed at City University. The retrieval performance over a wide range of speech transcription error rates is presented and a number of recognition error metrics that more accurately reflect the impact of transcription errors on retrieval accuracy are defined and computed. The results demonstrate the importance of high accuracy automatic transcription. The final system is currently being evaluated on the 1998 TREC-7 spoken document retrieval task.
引用
收藏
页码:49 / 52
页数:4
相关论文
共 50 条
  • [1] Cambridge University spoken document retrieval system
    Johnson, S.E.
    Jourlin, P.
    Moore, G.L.
    Sparck Jones, K.
    Woodland, P.C.
    ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 1999, 1 : 49 - 52
  • [2] The Cambridge University multimedia document retrieval demo system
    Tuerk A.
    Johnson S.E.
    Jourlin P.
    Jones K.S.
    Woodland P.C.
    International Journal of Speech Technology, 2001, 4 (3-4) : 241 - 250
  • [3] The RWTH speech recognition system and spoken document retrieval
    Ney, H
    Welling, L
    Ortmanns, S
    Beulen, K
    Wessel, E
    IECON '98 - PROCEEDINGS OF THE 24TH ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY, VOLS 1-4, 1998, : 2022 - 2027
  • [4] RWTH speech recognition system and spoken document retrieval
    RWTH Aachen - Univ of Technology, Aachen, Germany
    IECON Proc, 1600, (2022-2027):
  • [5] Spoken Document Retrieval System based on Phonemic Transcribing
    Tatarinova, Alexandra
    Prozorov, Dmitriy
    2017 IEEE EAST-WEST DESIGN & TEST SYMPOSIUM (EWDTS), 2017,
  • [6] Experiments in spoken document retrieval
    Jones, K.Sparck
    Jones, G.J.F.
    Foote, J.T.
    Young, S.J.
    Information Processing and Management, 1996, 32 (04): : 399 - 417
  • [7] An architecture for spoken document retrieval
    Terol, RM
    Martínez-Barco, P
    Palomar, M
    TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2004, 3206 : 505 - 511
  • [8] Experiments in spoken document retrieval
    Sparck-Jones, K
    Jones, GJF
    Foote, JT
    Young, SJ
    INFORMATION PROCESSING & MANAGEMENT, 1996, 32 (04) : 399 - 417
  • [9] Spoken document retrieval experiments with IR-n system
    Llopis, F
    Martínez-Barco, P
    COMPARATIVE EVALUATION OF MULTILINGUAL INFORMATION ACCESS SYSTEMS, 2003, 3237 : 664 - 671
  • [10] Exploring an Unsupervised, Language Independent, Spoken Document Retrieval System
    Caranica, Alexandru
    Cucu, Horia
    Buzo, Andi
    2016 14TH INTERNATIONAL WORKSHOP ON CONTENT-BASED MULTIMEDIA INDEXING (CBMI), 2016,