MAP-based perceptual modeling for noisy speech recognition

被引:0
|
作者
Institute of Biomedical Engineering, National Cheng Kung University, Tainan, 701, Taiwan [1 ]
不详 [2 ]
不详 [3 ]
不详 [4 ]
不详 [5 ]
机构
来源
J. Inf. Sci. Eng. | 2006年 / 5卷 / 999-1013期
关键词
Computer simulation - Mathematical models - Noise abatement - Spurious signal noise;
D O I
暂无
中图分类号
学科分类号
摘要
This study presents a maximum a posteriori (MAP) based perceptual modeling approach to deal with the issue of recognition degradation in noisy environment. In this approach, MAP-based noise detection is first applied to identify the noise segment in an utterance. Subtractive-type enhancement algorithm with masking properties of the human auditory system is then used to reduce the noise effect. Finally, MAP-based incremental noise model adaptation is developed to overcome the model inconsistencies between training and testing environments. For performance evaluation of the proposed approach, a Mandarin keyword recognition system was constructed. The experimental results show that the proposed approach achieves a better recognition rate compared to the audible noise suppression (ANS) and parallel model combination (PMC) methods.
引用
收藏
相关论文
共 50 条
  • [21] A Study on Noisy Speech Recognition
    Saeed, Khalid
    Szczepanski, Adam
    ICBAKE: 2009 INTERNATIONAL CONFERENCE ON BIOMETRICS AND KANSEI ENGINEERING, 2009, : 142 - 147
  • [22] 2-DPsychoacoustic Modeling for Automatic Speech Recognition in Noisy Environment
    Desai, Sampreeta
    Khandekar, Prasad D.
    Raut, Ketan J.
    2016 CONFERENCE ON ADVANCES IN SIGNAL PROCESSING (CASP), 2016, : 129 - 132
  • [23] A Semantic Analysis Method for Concept Map-based Knowledge Modeling
    Hao, Jin-Xing
    Yu, Angela Yan
    Kwok, Ron Chi-Wai
    ELECTRONIC-BUSINESS INTELLIGENCE: FOR CORPORATE COMPETITIVE ADVANTAGES IN THE AGE OF EMERGING TECHNOLOGIES & GLOBALIZATION, 2010, 14 : 281 - +
  • [24] An evaluation of adaptive beamformer based on average speech spectrum for noisy speech recognition
    Nishiura, T
    Nakayama, M
    Nakamura, S
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 668 - 671
  • [25] An Improved MAP-Based Speech Enhancer for High Sound Quality in Automobile Environment
    Satomi, Yuki
    Kawamura, Arata
    2014 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2014,
  • [26] An evaluation of adaptive beamformer based on average speech spectrum for noisy speech recognition
    Nishiura, T
    Nakayama, M
    Nakamura, S
    2003 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL III, PROCEEDINGS, 2003, : 209 - 212
  • [27] Collaborative integration of speech and 3D gesture for map-based applications
    Corradini, A
    COMPUTATIONAL SCIENCE - ICCS 2004, PT 3, PROCEEDINGS, 2004, 3038 : 913 - 920
  • [28] PERCEPTUAL UNITS IN SPEECH RECOGNITION
    MASSARO, DW
    JOURNAL OF EXPERIMENTAL PSYCHOLOGY, 1974, 102 (02): : 199 - 208
  • [29] Latent Perceptual Mapping: A New Acoustic Modeling Framework for Speech Recognition
    Sundaram, Shiva
    Bellegarda, Jerome R.
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 881 - 884
  • [30] MAP-BASED ESTIMATION OF THE PARAMETERS OF NON-STATIONARY GAUSSIAN PROCESSES FROM NOISY OBSERVATIONS
    Krueger, Alexander
    Haeb-Umbach, Reinhold
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 3596 - 3599