USING RHYTHMIC FEATURES FOR JAPANESE SPOKEN TERM DETECTION

被引:0
|
作者
Kanda, Naoyuki [1 ]
Takeda, Ryu [1 ]
Obuchi, Yasunari [1 ]
机构
[1] Hitachi Ltd, Cent Res Lab, Kokubunji, Tokyo 1858601, Japan
关键词
spoken term detection; spoken document retrieval; utterance verification; speech recognition;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
A new rescoring method for spoken term detection (STD) is proposed. Phoneme-based close-matching techniques have been used because of their ability to detect out-of-vocabulary (OOV) queries. To improve the accuracy of phoneme-based techniques, rescoring techniques have been used to accurately re-rank the results from phoneme-based close-matching; however, conventional rescoring techniques based on an utterance verification model still produce many false detection results. To further improve the accuracy, in this study, several features representing the "naturalness" (or "abnormality") of duration of phonemes/syllables in detected candidates of a keyword are proposed. These features are incorporated into a conventional rescoring technique using logistic regression. Experimental results with a 604-hour Japanese speech corpus indicated that combining the rhythmic features achieved a further relative error reduction of 8.9% compared to a conventional rescoring technique.
引用
收藏
页码:170 / 175
页数:6
相关论文
共 50 条
  • [41] Score Normalization using Phoneme-based Entropy for Spoken Term Detection
    Nishizaki, Hiromitsu
    Sawada, Naoki
    2015 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2015, : 263 - 269
  • [42] Can You Repeat That? Using Word Repetition to Improve Spoken Term Detection
    Wintrode, Jonathan
    Khudanpur, Sanjeev
    PROCEEDINGS OF THE 52ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, 2014, : 1316 - 1325
  • [43] A novel approach for spoken term detection in Vietnamese
    Nguyen Hong Quang
    Trinh Van Loan
    Le Xuan Thanh
    2015 INTERNATIONAL CONFERENCE ON COMMUNICATIONS, MANAGEMENT AND TELECOMMUNICATIONS (COMMANTEL), 2015, : 68 - 72
  • [44] Spoken term detection for Turkish Broadcast News
    Parlak, Siddika
    Saraclar, Murat
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 5244 - 5247
  • [45] English Spoken Term Detection in Multilingual Recordings
    Motlicek, Petr
    Valente, Fabio
    Garner, Philip N.
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 206 - 209
  • [46] Recent developments in spoken term detection: a survey
    Mandal, Anupam
    Kumar, K.
    Mitra, Pabitra
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2014, 17 (02) : 183 - 198
  • [47] ORDER-FREE SPOKEN TERM DETECTION
    Mangu, Lidia
    Saon, George
    Picheny, Michael
    Kingsbury, Brian
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5331 - 5335
  • [48] Incorporating visual information for spoken term detection
    Kalantari, Shahram
    Dean, David
    Sridharan, Sridha
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 558 - 562
  • [49] Stochastic Pronunciation Modelling for Spoken Term Detection
    Wang, Dong
    King, Simon
    Frankel, Joe
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2091 - 2094
  • [50] Predicting search term reliability for spoken term detection systems
    Torbati, Amir
    Picone, Joseph
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2014, 17 (01) : 1 - 9