Predicting search term reliability for spoken term detection systems

被引:2
|
作者
Torbati, Amir [1 ]
Picone, Joseph [1 ]
机构
[1] Temple Univ, Dept Elect & Comp Engn, 1947 North 12th St, Philadelphia, PA 19027 USA
基金
美国国家科学基金会;
关键词
Spoken term detection; Voice keyword search; Information retrieval;
D O I
10.1007/s10772-013-9197-1
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Spoken term detection is an extension of text-based searching that allows users to type keywords and search audio files containing recordings of spoken language. Performance is dependent on many external factors such as the acoustic channel, language, pronunciation variations and acoustic confusability of the search term. Unlike text-based searches, the likelihoods of false alarms and misses for specific search terms, which we refer to as reliability, play a significant role in the overall perception of the usability of the system. In this paper, we present a system that predicts the reliability of a search term based on its inherent confusability. Our approach integrates predictors of the reliability that are based on both acoustic and phonetic features. These predictors are trained using an analysis of recognition errors produced from a state of the art spoken term detection system operating on the Fisher Corpus. This work represents the first large-scale attempt to predict the success of a keyword search term from only its spelling. We explore the complex relationship between phonetic and acoustic properties of search terms. We show that a 76 % correlation between the predicted error rate and the actual measured error rate can be achieved, and that the remaining confusability is due to other acoustic modeling issues that cannot be derived from a search term's spelling.
引用
收藏
页码:1 / 9
页数:9
相关论文
共 50 条
  • [41] TOPIC DEPENDENT LANGUAGE MODELLING FOR SPOKEN TERM DETECTION
    Kalantari, Shahram
    Dean, David
    Sridharan, Sridha
    Wallace, Roy
    2014 PROCEEDINGS OF THE 22ND EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2014, : 949 - 953
  • [42] Constructing Japanese Test Collections for Spoken Term Detection
    Itoh, Yoshiaki
    Nishizaki, Hiromitsu
    Hu, Xinhui
    Nanjo, Hiroaki
    Akiba, Tomoyosi
    Kawahara, Tatsuya
    Nakagawa, Seiichi
    Matsui, Tomoko
    Yamashita, Yoichi
    Aikawa, Kiyoaki
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 677 - +
  • [43] USING RHYTHMIC FEATURES FOR JAPANESE SPOKEN TERM DETECTION
    Kanda, Naoyuki
    Takeda, Ryu
    Obuchi, Yasunari
    2012 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2012), 2012, : 170 - 175
  • [44] Efficient System Combination for Chinese Spoken Term Detection
    Gao Jie
    Shao Jian
    Zhao Qingwei
    Yan Yonghong
    CHINESE JOURNAL OF ELECTRONICS, 2010, 19 (03): : 457 - 462
  • [45] SYSTEM COMBINATION AND SCORE NORMALIZATION FOR SPOKEN TERM DETECTION
    Mamou, Jonathan
    Cui, Jia
    Cui, Xiaodong
    Gales, Mark J. F.
    Kingsbury, Brian
    Knill, Kate
    Mangu, Lidia
    Nolden, David
    Picheny, Michael
    Ramabhadran, Bhuvana
    Schlueter, Ralf
    Sethy, Abhinav
    Woodland, Philip C.
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 8272 - 8276
  • [46] Evolutionary discriminative confidence estimation for spoken term detection
    Javier Tejedor
    Alejandro Echeverría
    Dong Wang
    Ravichander Vipperla
    Multimedia Tools and Applications, 2013, 62 : 5 - 34
  • [47] EFFECT OF PRONUNCIATIONS ON OOV QUERIES IN SPOKEN TERM DETECTION
    Can, Dogan
    Cooper, Erica
    Sethy, Abhinav
    White, Chris
    Ramabhadran, Bhuvana
    Saraclar, Murat
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 3957 - +
  • [48] Metric Subspace Indexing for Fast Spoken Term Detection
    Kaneko, Taisuke
    Akiba, Tomoyosi
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 689 - 692
  • [49] Spoken Term Detection Automatically Adjusted for a Given Threshold
    Fuchs, Tzeviya
    Keshet, Joseph
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2017, 11 (08) : 1310 - 1317
  • [50] CNN based Query by Example Spoken Term Detection
    Ram, Dhananjay
    Miculicich, Lesly
    Bourlard, Herve
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 92 - 96