Joint decoding of multiple speech patterns for robust speech recognition

被引:0
|
作者
Nair, Nishanth Ulhas [1 ]
Sreenivas, T. V. [1 ]
机构
[1] Indian Inst Sci, Dept Elect Commun Engn, Bangalore 560012, Karnataka, India
关键词
Robust speech recognition; Viterbi Algorithm; Dynamic Time Warping; burst noise;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We are addressing a new problem of improving automatic speech recognition performance, given multiple utterances of patterns from the same class. We have formulated the problem of jointly decoding K multiple patterns given a single Hidden Markov Model. It is shown that such a solution is possible by aligning the K patterns using the proposed Multi Pattern Dynamic Time Warping algorithm followed by the Constrained Multi Pattern Viterbi Algorithm The new formulation is tested in the context of speaker independent isolated word recognition for both clean and noisy patterns. When 10 percent of speech is affected by a burst noise at -5 dB Signal to Noise Ratio (local), it is shown that joint decoding using only two noisy patterns reduces the noisy speech recognition error rate to about 51 percent, when compared to the single pattern decoding using the Viterbi Algorithm. In contrast a simple maximization of individual pattern likelihoods, provides only about 7 percent reduction in error rate.
引用
收藏
页码:93 / 98
页数:6
相关论文
共 50 条
  • [1] Joint evaluation of multiple speech patterns for speech recognition and training
    Nair, Nishanth Ulhas
    Sreenivas, T. V.
    COMPUTER SPEECH AND LANGUAGE, 2010, 24 (02): : 307 - 340
  • [2] Joint Uncertainty Decoding With Predictive Methods for Noise Robust Speech Recognition
    Xu, Haitian
    Gales, Mark J. F.
    Chin, K. K.
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (06): : 1665 - 1676
  • [3] JOINT UNCERTAINTY DECODING WITH THE SECOND ORDER APPROXIMATION FOR NOISE ROBUST SPEECH RECOGNITION
    Xu, Haitian
    Chin, K. K.
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 3841 - 3844
  • [4] Comparison of Estimation Techniques in Joint Uncertainty Decoding for Noise Robust Speech Recognition
    Xu, Haitian
    Chin, K. K.
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2363 - 2366
  • [5] Joint Decoding for Speech Recognition and Semantic Tagging
    Deoras, Anoop
    Sarikaya, Ruhi
    Tur, Gokhan
    Hakkani-Tuer, Dilek
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1066 - 1069
  • [6] Multi-Pattern Viterbi Algorithm for joint decoding of multiple speech patterns
    Nair, Nishanth Ulhas
    Sreenivas, T. V.
    SIGNAL PROCESSING, 2010, 90 (12) : 3278 - 3283
  • [7] Improving Joint Uncertainty Decoding Performance by Predictive Methods for Noise Robust Speech Recognition
    Xu, Haitian
    Gales, Mark J. F.
    Chin, K. K.
    2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009), 2009, : 222 - 227
  • [8] Issues with Uncertainty Decoding for Noise Robust Speech Recognition
    Liao, H.
    Gales, M. J. F.
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1121 - 1124
  • [9] Uncertainty decoding with splice for noise robust speech recognition
    Droppo, J
    Acero, A
    Deng, L
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 57 - 60
  • [10] Joint Decoding of CTC Based Systems for Speech Recognition
    Guo, Jiaqi
    You, Yongbin
    Qian, Yanmin
    Yu, Kai
    INTERSPEECH 2019, 2019, : 2205 - 2209