Missing feature theory applied to robust speech recognition over IP network

被引:0
|
作者
Endo, T [1 ]
Kuroiwa, S
Nakamura, S
机构
[1] ATR, Spoken Language Translat Res Labs, Kyoto 6190288, Japan
[2] Univ Tokushima, Tokushima 7708506, Japan
来源
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS | 2004年 / E87D卷 / 05期
关键词
DSR; data loss; data imputation; marginalization;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper addresses problems involved in performing speech recognition over mobile and IP networks. The main problem is speech data loss caused by packet loss in the network. We present two missing-feature-based approaches that recover lost regions of speech data. These approaches are based on the reconstruction of missing frames or on marginal distributions. For comparison, we also use a packing method, which skips lost data. We evaluate these approaches with packet loss models. i.e., random loss and Gilbert loss models. The results show that the marginal-distributed-based technique is most effective for a packet loss environment; the degradation of word accuracy is only 5% when the packet loss rate is 30% and only 3% when mean burst loss length is 24 frames in the case of DSR front-end. The simple data imputation method is also effective in the case of clean speech.
引用
收藏
页码:1119 / 1126
页数:8
相关论文
共 50 条
  • [21] Geometrical feature extraction for robust speech recognition
    Li, Xiaokun
    Kwan, Chiman
    2005 39TH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS, VOLS 1 AND 2, 2005, : 558 - 562
  • [22] Feature Adaptation for Robust Mobile Speech Recognition
    Lee, Hyeopwoo
    Yook, Dongsuk
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2012, 58 (04) : 1393 - 1398
  • [23] Acoustic feature combination for robust speech recognition
    Zolnay, A
    Schlüter, R
    Ney, H
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 457 - 460
  • [24] Temporal structure normalization of speech feature for robust speech recognition
    Xiao, Xiong
    Chng, Eng Siong
    Li, Haizhou
    IEEE SIGNAL PROCESSING LETTERS, 2007, 14 (07) : 500 - 503
  • [25] Lost Speech Reconstruction Method using Speech Recognition based on Missing Feature Theory and HMM-based Speech Synthesis
    Kuroiwa, Shingo
    Tsuge, Satoru
    Ren, Fuji
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1105 - 1108
  • [26] Robust speech recognition in telephone network
    Han, MS
    Park, GB
    Park, JG
    Han, JQ
    PROGRESS IN CONNECTIONIST-BASED INFORMATION SYSTEMS, VOLS 1 AND 2, 1998, : 1103 - 1106
  • [27] Multi-flow Block Interleaving applied to Distributed Speech Recognition over IP networks
    Gomez, Angel M.
    Ramos-Munoz, Juan J.
    Peinado, Antonio M.
    Sanchez, Victoria
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2342 - 2345
  • [28] Enhanced robot speech recognition based on microphone array source separation and missing feature theory
    Yamamoto, S
    Valin, JM
    Nakadai, K
    Rouat, J
    Michaud, F
    Ogata, T
    Okuno, HG
    2005 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), VOLS 1-4, 2005, : 1477 - 1482
  • [29] Mask Estimation in Non-stationary Noise Environments for Missing Feature Based Robust Speech Recognition
    Badiezadegan, Shirin
    Rose, Richard C.
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2062 - 2065
  • [30] Robust speech recognition over mobile and IP networks in burst-like packet loss
    Milner, B
    James, A
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (01): : 223 - 231