Linearized distortion model for robust speech recognition in noisy environments

被引:0
|
作者
He, Yong-Jun [1 ,2 ]
Han, Ji-Qing [1 ]
机构
[1] School of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001, China
[2] School of Computer Science and Technology, Harbin University of Science and Technology, Harbin 150080, China
来源
关键词
Linearization - Piecewise linear techniques;
D O I
暂无
中图分类号
学科分类号
摘要
The robustness of speech recognition system in noisy environments was investigated. The distortion model in Mel-frequency cepstral coefficient (MFCC) domain is highly non-linear and difficult to deal with. A new linear distortion model was proposed by replacing the logarithm operation with its piecewise linear interpolation function. Then the estimation of noise parameters and compensation of acoustic models were provided. The proposed method can avoid model error introduced by utilizing linearization methods based on vector Taylor series (VTS) expansion, and significantly improve the robustness of recognizer in noisy environments.
引用
收藏
页码:8 / 14
相关论文
共 50 条
  • [41] A digital chip for robust speech recognition in noisy environment
    Kim, CM
    Lee, SY
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 1089 - 1092
  • [42] Multiband, Multisensor Robust Features for Noisy Speech Recognition
    Dimitriadis, Dimitrios
    Maragos, Petros
    Lefkimmiatis, Stamatios
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 889 - 892
  • [43] Techniques for robust speech recognition in noisy and reverberant conditions
    Brown, GJ
    Palomäki, KJ
    SPEECH SEPARATION BY HUMANS AND MACHINES, 2005, : 213 - 220
  • [44] SPEECH REINFORCEMENT IN NOISY REVERBERANT ENVIRONMENTS USING A PERCEPTUAL DISTORTION MEASURE
    Crespo, Joao B.
    Hendriks, Richard C.
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [45] Speech Recognition in Noisy Environments using a Switching Linear Dynamic Model for Feature Enhancement
    Schuller, Bjoern
    Woellmer, Martin
    Moosmayr, Tobias
    Rigoll, Gerhard
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1789 - +
  • [46] Robust speech/non-speech detection based on LDA-derived parameter and voicing parameter for speech recognition in noisy environments
    Martin, A
    Mauuary, L
    SPEECH COMMUNICATION, 2006, 48 (02) : 191 - 206
  • [47] A novel algorithm to robust speech endpoint detection in noisy environments
    Yi, Li
    Yingle, Fan
    ICIEA 2007: 2ND IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS, VOLS 1-4, PROCEEDINGS, 2007, : 1555 - 1558
  • [48] Voice Command II: A DSP implementation of robust speech recognition in real-world noisy environments
    Lee, SY
    Kim, DS
    Ahn, KH
    Jeong, JH
    Kim, H
    Park, SY
    Kim, LY
    Lee, JS
    Lee, HY
    PROGRESS IN CONNECTIONIST-BASED INFORMATION SYSTEMS, VOLS 1 AND 2, 1998, : 1051 - 1054
  • [49] A performance comparison of robust speech analysis methods in noisy environments
    Shimamura, T
    PROCEEDINGS OF 2001 INTERNATIONAL SYMPOSIUM ON INTELLIGENT MULTIMEDIA, VIDEO AND SPEECH PROCESSING, 2001, : 103 - 106
  • [50] Feature extraction based on zero-crossings with peak amplitudes for robust speech recognition in noisy environments
    Kim, DS
    Jeong, JH
    Kim, JW
    Lee, SY
    1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 61 - 64