Linearized distortion model for robust speech recognition in noisy environments

被引:0
|
作者
He, Yong-Jun [1 ,2 ]
Han, Ji-Qing [1 ]
机构
[1] School of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001, China
[2] School of Computer Science and Technology, Harbin University of Science and Technology, Harbin 150080, China
来源
关键词
Linearization - Piecewise linear techniques;
D O I
暂无
中图分类号
学科分类号
摘要
The robustness of speech recognition system in noisy environments was investigated. The distortion model in Mel-frequency cepstral coefficient (MFCC) domain is highly non-linear and difficult to deal with. A new linear distortion model was proposed by replacing the logarithm operation with its piecewise linear interpolation function. Then the estimation of noise parameters and compensation of acoustic models were provided. The proposed method can avoid model error introduced by utilizing linearization methods based on vector Taylor series (VTS) expansion, and significantly improve the robustness of recognizer in noisy environments.
引用
收藏
页码:8 / 14
相关论文
共 50 条
  • [31] Recognition of Noisy Speech: A Comparative Survey of Robust Model Architecture and Feature Enhancement
    Schuller, Bjoern
    Woellmer, Martin
    Moosmayr, Tobias
    Rigoll, Gerhard
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2009,
  • [32] Recognition of Noisy Speech: A Comparative Survey of Robust Model Architecture and Feature Enhancement
    Björn Schuller
    Martin Wöllmer
    Tobias Moosmayr
    Gerhard Rigoll
    EURASIP Journal on Audio, Speech, and Music Processing, 2009
  • [33] Speech recognition in noisy environments with Convolutional Neural Networks
    Santos, Rafael M.
    Matos, Leonardo N.
    Macedo, Hendrik T.
    Montalvao, Jugurta
    2015 BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS 2015), 2015, : 175 - 179
  • [34] Perceptual features for automatic speech recognition in noisy environments
    Haque, Serajul
    Togneri, Roberto
    Zaknich, Anthony
    SPEECH COMMUNICATION, 2009, 51 (01) : 58 - 75
  • [35] SPEECH RECOGNITION IN NOISY ENVIRONMENTS WITH THE AID OF MICROPHONE ARRAYS
    VANCOMPERNOLLE, D
    MA, W
    XIE, F
    VANDIEST, M
    SPEECH COMMUNICATION, 1990, 9 (5-6) : 433 - 442
  • [36] Speech Emotion Recognition Based on EMD in Noisy Environments
    Chu, Yunyun
    Xiong, Weihua
    Chen, Wei
    ADVANCES IN CIVIL ENGINEERING AND BUILDING MATERIALS III, 2014, 831 : 460 - 464
  • [37] Multi-band speech recognition in noisy environments
    Okawa, S
    Bocchieri, E
    Potamianos, A
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 641 - 644
  • [38] Speech enhancement strategy for speech recognition microcontroller under noisy environments
    Chan, Kit Yan
    Nordholm, Sven
    Yiu, Ka Fai Cedric
    Togneri, Roberto
    NEUROCOMPUTING, 2013, 118 : 279 - 288
  • [39] A discriminative and robust training algorithm for noisy speech recognition
    Hong, WT
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 8 - 11
  • [40] ROBUST SPEECH RECOGNITION IN UNKNOWN REVERBERANT AND NOISY CONDITIONS
    Hsiao, Roger
    Ma, Jeff
    Hartmann, William
    Karafiat, Martin
    Grezl, Frantisek
    Burget, Lukas
    Szoke, Igor
    Cernocky, Jan Honza
    Watanabe, Shinji
    Chen, Zhuo
    Mallidi, Sri Harish
    Hermansky, Hynek
    Tsakalidis, Stavros
    Schwartz, Richard
    2015 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2015, : 533 - 538