Missing feature theory applied to robust speech recognition over IP network

被引:0
|
作者
Endo, T [1 ]
Kuroiwa, S
Nakamura, S
机构
[1] ATR, Spoken Language Translat Res Labs, Kyoto 6190288, Japan
[2] Univ Tokushima, Tokushima 7708506, Japan
来源
关键词
DSR; data loss; data imputation; marginalization;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper addresses problems involved in performing speech recognition over mobile and IP networks. The main problem is speech data loss caused by packet loss in the network. We present two missing-feature-based approaches that recover lost regions of speech data. These approaches are based on the reconstruction of missing frames or on marginal distributions. For comparison, we also use a packing method, which skips lost data. We evaluate these approaches with packet loss models. i.e., random loss and Gilbert loss models. The results show that the marginal-distributed-based technique is most effective for a packet loss environment; the degradation of word accuracy is only 5% when the packet loss rate is 30% and only 3% when mean burst loss length is 24 frames in the case of DSR front-end. The simple data imputation method is also effective in the case of clean speech.
引用
收藏
页码:1119 / 1126
页数:8
相关论文
共 50 条
  • [1] Robust speech recognition over IP networks
    Milner, B
    Semnani, S
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1791 - 1794
  • [2] Correction of likelihoods for degrees of freedom in robust speech recognition using missing feature theory
    Van hamme, H
    SEVENTH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, VOL 1, PROCEEDINGS, 2003, : 401 - 404
  • [3] Robust speech recognition using missing feature theory and target speech enhancement based on degenerate unmixing and estimation technique
    Kim, Minook
    Kim, Ji-Seon
    Park, Hyung-Min
    INDEPENDENT COMPONENT ANALYSES, WAVELETS, NEURAL NETWORKS, BIOSYSTEMS, AND NANOENGINEERING IX, 2011, 8058
  • [4] Missing-Feature-Theory-based Robust Simultaneous Speech Recognition System with Non-clean Speech Acoustic Model
    Takahashi, Toni
    Nakadai, Kazuhiro
    Komatani, Kazunori
    Ogata, Tetsuya
    Okuno, Hiroshi G.
    2009 IEEE-RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, 2009, : 2730 - 2735
  • [5] Speech recognition for a humanoid with motor noise utilizing missing feature theory
    Nishimura, Yoshitaka
    Ishizuka, Mitsuru
    Nakadai, Kazuhiro
    Nakano, Mikio
    Tsujino, Hiroshi
    2006 6TH IEEE-RAS INTERNATIONAL CONFERENCE ON HUMANOID ROBOTS, VOLS 1 AND 2, 2006, : 26 - +
  • [6] Hard-Mask Missing Feature Theory for Robust Speaker Recognition
    Lim, Shin-Cheol
    Jang, Sei-Jin
    Lee, Soek-Pil
    Kim, Moo Young
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2011, 57 (03) : 1245 - 1250
  • [7] Independent component analysis applied to feature extraction for robust automatic speech recognition
    Potamitis, L
    Fakotakis, N
    Kokkinakis, G
    ELECTRONICS LETTERS, 2000, 36 (23) : 1977 - 1978
  • [8] Using phoneme segmentation in, conjunction with missing feature approaches for noise robust speech recognition
    Mohammadi, Arash
    Almasganj, Farshad
    Taherkhani, Aboozar
    Naderkhani, Farnoosh
    2007 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY, VOLS 1-3, 2007, : 447 - 451
  • [9] Advances in Missing Feature Techniques for Robust Large-Vocabulary Continuous Speech Recognition
    Van Segbroeck, Maarten
    Van Hamme, Hugo
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (01): : 123 - 137
  • [10] Missing-feature approaches in speech recognition
    Raj, B
    Stern, RM
    IEEE SIGNAL PROCESSING MAGAZINE, 2005, 22 (05) : 101 - 116