ZERO RESOURCE GRAPH-BASED CONFIDENCE ESTIMATION FOR OPEN VOCABULARY SPOKEN TERM DETECTION

被引:0
|
作者
Norouzian, Atta [1 ]
Rose, Richard [1 ]
Ghalehjegh, Sina Hamidi [1 ]
Jansen, Aren [2 ,3 ]
机构
[1] McGill Univ, Dept Elect & Comp Engn, Montreal, PQ, Canada
[2] Human Knowledge Technol Ctr Excellence, Montreal, PQ, Canada
[3] Johns Hopkins Univ, Dept Elect & Comp Engn, Baltimore, MD 21218 USA
关键词
Open vocabulary spoken term detection; Dotplot; Random walk on directional graphs;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper the use of acoustic similarity of speech intervals for generating improved confidence scores for spoken term detection (STD) is investigated. A procedure based on acoustic dotplots which requires no training data is deployed for discovering similar speech intervals. A graph based random walk algorithm incorporates acoustic similarity of hypothesized term occurrences for improving the corresponding confidence scores. The proposed approach is evaluated in an open vocabulary STD task defined on a lecture domain corpus. It is shown that updating the confidence scores in this fashion results in a significant increase in term detection performance of out of vocabulary search terms. A relative improvement of 12.9% in figure of merit was gained relative to that obtained from a baseline lattice based STD system.
引用
收藏
页码:8292 / 8296
页数:5
相关论文
共 50 条
  • [1] ZERO-RESOURCE SPOKEN TERM DETECTION USING HIERARCHICAL GRAPH-BASED SIMILARITY SEARCH
    Aoyama, Kazuo
    Ogawa, Atsunori
    Hattori, Takashi
    Hori, Takaaki
    Nakamura, Atsushi
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [2] Contextual Verification for Open Vocabulary Spoken Term Detection
    Schneider, Daniel
    Mertens, Timo
    Larson, Martha
    Koehler, Joachim
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 697 - 700
  • [3] An approach for efficient open vocabulary spoken term detection
    Norouzian, Atta
    Rose, Richard
    SPEECH COMMUNICATION, 2014, 57 : 50 - 62
  • [4] Direct Posterior Confidence for Out-of-Vocabulary Spoken Term Detection
    Wang, Dong
    King, Simon
    Frankel, Joe
    Vipperla, Ravichander
    Evans, Nicholas
    Troncy, Raphael
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2012, 30 (03)
  • [5] Evolutionary discriminative confidence estimation for spoken term detection
    Javier Tejedor
    Alejandro Echeverría
    Dong Wang
    Ravichander Vipperla
    Multimedia Tools and Applications, 2013, 62 : 5 - 34
  • [6] Evolutionary discriminative confidence estimation for spoken term detection
    Tejedor, Javier
    Echeverria, Alejandro
    Wang, Dong
    Vipperla, Ravichander
    MULTIMEDIA TOOLS AND APPLICATIONS, 2013, 62 (01) : 5 - 34
  • [7] Term-Dependent Confidence Normalisation for Out-of-Vocabulary Spoken Term Detection
    Javier Tejedo
    Simon King
    Joe Frankel
    Journal of Computer Science & Technology, 2012, 27 (02) : 358 - 375
  • [8] Term-Dependent Confidence Normalisation for Out-of-Vocabulary Spoken Term Detection
    Wang, Dong
    Tejedor, Javier
    King, Simon
    Frankel, Joe
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2012, 27 (02) : 358 - 375
  • [9] Term-Dependent Confidence Normalisation for Out-of-Vocabulary Spoken Term Detection
    Dong Wang
    Javier Tejedor
    Simon King
    Joe Frankel
    Journal of Computer Science and Technology, 2012, 27 : 358 - 375
  • [10] IMPROVED SPOKEN TERM DETECTION WITH GRAPH-BASED RE-RANKING IN FEATURE SPACE
    Chen, Yun-Nung
    Chen, Chia-Ping
    Lee, Hung-Yi
    Chan, Chun-An
    Lee, Lin-Shan
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5644 - 5647