USING RHYTHMIC FEATURES FOR JAPANESE SPOKEN TERM DETECTION

被引:0
|
作者
Kanda, Naoyuki [1 ]
Takeda, Ryu [1 ]
Obuchi, Yasunari [1 ]
机构
[1] Hitachi Ltd, Cent Res Lab, Kokubunji, Tokyo 1858601, Japan
关键词
spoken term detection; spoken document retrieval; utterance verification; speech recognition;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
A new rescoring method for spoken term detection (STD) is proposed. Phoneme-based close-matching techniques have been used because of their ability to detect out-of-vocabulary (OOV) queries. To improve the accuracy of phoneme-based techniques, rescoring techniques have been used to accurately re-rank the results from phoneme-based close-matching; however, conventional rescoring techniques based on an utterance verification model still produce many false detection results. To further improve the accuracy, in this study, several features representing the "naturalness" (or "abnormality") of duration of phonemes/syllables in detected candidates of a keyword are proposed. These features are incorporated into a conventional rescoring technique using logistic regression. Experimental results with a 604-hour Japanese speech corpus indicated that combining the rhythmic features achieved a further relative error reduction of 8.9% compared to a conventional rescoring technique.
引用
收藏
页码:170 / 175
页数:6
相关论文
共 50 条
  • [21] Spoken term detection based on DTW
    Hou J.
    Xie L.
    Yang P.
    Xiao X.
    Leung C.-C.
    Xu H.
    Wang L.
    Lü H.
    Ma B.
    Chng E.
    Li H.
    Xie, Lei (lxie@nwpu.edu.cn), 1600, Tsinghua University (57): : 18 - 23
  • [22] EXPLOITING DIVERSITY FOR SPOKEN TERM DETECTION
    Mangu, Lidia
    Soltau, Hagen
    Kuo, Hong-Kwang
    Kingsbury, Brian
    Saon, George
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 8282 - 8286
  • [23] Optimization of Spoken Term Detection System
    Wang, Chuanxu
    Zhang, Pengyuan
    JOURNAL OF APPLIED MATHEMATICS, 2012,
  • [24] Lattice Indexing for Spoken Term Detection
    Can, Dogan
    Saraclar, Murat
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (08): : 2338 - 2347
  • [25] Semantically Expanded Spoken Term Detection
    Kozhirbayev, Zhanibek
    Yessenbayev, Zhandos
    IEEE ACCESS, 2024, 12 : 177844 - 177855
  • [26] Multilingual spoken term detection: a review
    Deekshitha, G.
    Mary, Leena
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2020, 23 (03) : 653 - 667
  • [27] SPOKEN TERM DETECTION USING DYNAMIC MATCH SUBWORD CONFUSION NETWORK
    Gao, Jie
    Shao, Jian
    Zhang, Qingqing
    Zhao, Qingwei
    Yan, Yonghong
    ICNC 2008: FOURTH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 4, PROCEEDINGS, 2008, : 250 - 254
  • [28] ASSESSING SEARCH TERM STRENGTH IN SPOKEN TERM DETECTION
    Torbati, Amir Hossein Harati Nejad
    Picone, Joe
    2013 IEEE INTERNATIONAL MULTI-DISCIPLINARY CONFERENCE ON COGNITIVE METHODS IN SITUATION AWARENESS AND DECISION SUPPORT (COGSIMA), 2013, : 114 - 117
  • [29] AN INITIAL ATTEMPT TO IMPROVE SPOKEN TERM DETECTION BY LEARNING OPTIMAL WEIGHTS FOR DIFFERENT INDEXING FEATURES
    Chen, Yu-Hui
    Chou, Chia-Chen
    Lee, Hung-Yi
    Lee, Lin-Shan
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 5278 - 5281
  • [30] Unsupervised Bottleneck Features for Low-Resource Query-by-Example Spoken Term Detection
    Chen, Hongjie
    Leung, Chewing-Chi
    Xie, Lei
    Ma, Bin
    Lie, Haizhou
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 923 - 927