MAP-based perceptual modeling for noisy speech recognition

被引：0

作者：

Institute of Biomedical Engineering, National Cheng Kung University, Tainan, 701, Taiwan ^{[1
]}

不详 ^{[2
]}

不详 ^{[3
]}

不详 ^{[4
]}

不详 ^{[5
]}

机构：

来源：

J. Inf. Sci. Eng. | 2006年 / 5卷 / 999-1013期

关键词：

Computer simulation - Mathematical models - Noise abatement - Spurious signal noise;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

This study presents a maximum a posteriori (MAP) based perceptual modeling approach to deal with the issue of recognition degradation in noisy environment. In this approach, MAP-based noise detection is first applied to identify the noise segment in an utterance. Subtractive-type enhancement algorithm with masking properties of the human auditory system is then used to reduce the noise effect. Finally, MAP-based incremental noise model adaptation is developed to overcome the model inconsistencies between training and testing environments. For performance evaluation of the proposed approach, a Mandarin keyword recognition system was constructed. The experimental results show that the proposed approach achieves a better recognition rate compared to the audible noise suppression (ANS) and parallel model combination (PMC) methods.

引用

共 50 条

[21] A Study on Noisy Speech Recognition
Saeed, Khalid
Szczepanski, Adam
ICBAKE: 2009 INTERNATIONAL CONFERENCE ON BIOMETRICS AND KANSEI ENGINEERING, 2009, : 142 - 147
[22] 2-DPsychoacoustic Modeling for Automatic Speech Recognition in Noisy Environment
Desai, Sampreeta
Khandekar, Prasad D.
Raut, Ketan J.
2016 CONFERENCE ON ADVANCES IN SIGNAL PROCESSING (CASP), 2016, : 129 - 132
[23] A Semantic Analysis Method for Concept Map-based Knowledge Modeling
Hao, Jin-Xing
Yu, Angela Yan
Kwok, Ron Chi-Wai
ELECTRONIC-BUSINESS INTELLIGENCE: FOR CORPORATE COMPETITIVE ADVANTAGES IN THE AGE OF EMERGING TECHNOLOGIES & GLOBALIZATION, 2010, 14 : 281 - +
[24] An evaluation of adaptive beamformer based on average speech spectrum for noisy speech recognition
Nishiura, T
Nakayama, M
Nakamura, S
2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 668 - 671
[25] An Improved MAP-Based Speech Enhancer for High Sound Quality in Automobile Environment
Satomi, Yuki
Kawamura, Arata
2014 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2014,
[26] An evaluation of adaptive beamformer based on average speech spectrum for noisy speech recognition
Nishiura, T
Nakayama, M
Nakamura, S
2003 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL III, PROCEEDINGS, 2003, : 209 - 212
[27] Collaborative integration of speech and 3D gesture for map-based applications
Corradini, A
COMPUTATIONAL SCIENCE - ICCS 2004, PT 3, PROCEEDINGS, 2004, 3038 : 913 - 920
[28] PERCEPTUAL UNITS IN SPEECH RECOGNITION
MASSARO, DW
JOURNAL OF EXPERIMENTAL PSYCHOLOGY, 1974, 102 (02): : 199 - 208
[29] Latent Perceptual Mapping: A New Acoustic Modeling Framework for Speech Recognition
Sundaram, Shiva
Bellegarda, Jerome R.
11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 881 - 884
[30] MAP-BASED ESTIMATION OF THE PARAMETERS OF NON-STATIONARY GAUSSIAN PROCESSES FROM NOISY OBSERVATIONS
Krueger, Alexander
Haeb-Umbach, Reinhold
2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 3596 - 3599

← 1 2 3 4 5 →