Predicting search term reliability for spoken term detection systems

被引:2
|
作者
Torbati, Amir [1 ]
Picone, Joseph [1 ]
机构
[1] Temple Univ, Dept Elect & Comp Engn, 1947 North 12th St, Philadelphia, PA 19027 USA
基金
美国国家科学基金会;
关键词
Spoken term detection; Voice keyword search; Information retrieval;
D O I
10.1007/s10772-013-9197-1
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Spoken term detection is an extension of text-based searching that allows users to type keywords and search audio files containing recordings of spoken language. Performance is dependent on many external factors such as the acoustic channel, language, pronunciation variations and acoustic confusability of the search term. Unlike text-based searches, the likelihoods of false alarms and misses for specific search terms, which we refer to as reliability, play a significant role in the overall perception of the usability of the system. In this paper, we present a system that predicts the reliability of a search term based on its inherent confusability. Our approach integrates predictors of the reliability that are based on both acoustic and phonetic features. These predictors are trained using an analysis of recognition errors produced from a state of the art spoken term detection system operating on the Fisher Corpus. This work represents the first large-scale attempt to predict the success of a keyword search term from only its spelling. We explore the complex relationship between phonetic and acoustic properties of search terms. We show that a 76 % correlation between the predicted error rate and the actual measured error rate can be achieved, and that the remaining confusability is due to other acoustic modeling issues that cannot be derived from a search term's spelling.
引用
收藏
页码:1 / 9
页数:9
相关论文
共 50 条
  • [21] COMBINATION OF SYLLABLE BASED N-GRAM SEARCH AND WORD SEARCH FOR SPOKEN TERM DETECTION THROUGH SPOKEN QUERIES AND IV/OOV CLASSIFICATION
    Sakamoto, Nagisa
    Yamamoto, Kazumasa
    Nakagawa, Seiichi
    2015 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2015, : 200 - 206
  • [22] Spoken term detection ALBAYZIN 2014 evaluation: overview, systems, results, and discussion
    Javier Tejedor
    Doroteo T. Toledano
    Paula Lopez-Otero
    Laura Docio-Fernandez
    Carmen Garcia-Mateo
    Antonio Cardenal
    Julian David Echeverry-Correa
    Alejandro Coucheiro-Limeres
    Julia Olcoz
    Antonio Miguel
    EURASIP Journal on Audio, Speech, and Music Processing, 2015
  • [23] EFFECTIVE COMBINATION OF HETEROGENEOUS SUBWORD-BASED SPOKEN TERM DETECTION SYSTEMS
    Lee, Shi-wook
    Tanaka, Kazuyo
    Itoh, Yoshiaki
    2014 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY SLT 2014, 2014, : 436 - 441
  • [24] A novel approach for spoken term detection in Vietnamese
    Nguyen Hong Quang
    Trinh Van Loan
    Le Xuan Thanh
    2015 INTERNATIONAL CONFERENCE ON COMMUNICATIONS, MANAGEMENT AND TELECOMMUNICATIONS (COMMANTEL), 2015, : 68 - 72
  • [25] Spoken term detection ALBAYZIN 2014 evaluation: overview, systems, results, and discussion
    Tejedor, Javier
    Toledano, Doroteo T.
    Lopez-Otero, Paula
    Docio-Fernandez, Laura
    Garcia-Mateo, Carmen
    Cardenal, Antonio
    David Echeverry-Correa, Julian
    Coucheiro-Limeres, Alejandro
    Olcoz, Julia
    Miguel, Antonio
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2015,
  • [26] Fusing multiple systems into a compact lattice index for Chinese spoken term detection
    Meng, Sha
    Yu, Peng
    Liu, Jia
    Seide, Frank
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4345 - +
  • [27] Recent developments in spoken term detection: a survey
    Mandal, Anupam
    Kumar, K.
    Mitra, Pabitra
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2014, 17 (02) : 183 - 198
  • [28] Spoken term detection for Turkish Broadcast News
    Parlak, Siddika
    Saraclar, Murat
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 5244 - 5247
  • [29] English Spoken Term Detection in Multilingual Recordings
    Motlicek, Petr
    Valente, Fabio
    Garner, Philip N.
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 206 - 209
  • [30] ORDER-FREE SPOKEN TERM DETECTION
    Mangu, Lidia
    Saon, George
    Picheny, Michael
    Kingsbury, Brian
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5331 - 5335