Robust technologies towards automatic speech recognition in car noise environments

被引：0

作者：

Ding, Pei ^{[1
]}

He, Lei ^{[1
]}

Yan, Xiang ^{[1
]}

Zhao, Rui ^{[1
]}

Hao, Jie ^{[1
]}

机构：

[1] Toshiba Res & Dev Ctr, Beijing, Peoples R China

来源：

2006 8TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-4 | 2006年

关键词：

robust speech recognition; in-car noise; speech enhancement; spectrum smoothing; immunity learning;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

This paper presents the research on robust automatic speech recognition (ASR) in car noise environments. In the front-end design, speech enhancement technologies are used to suppress the background noise in frequency domain, and then spectrum smoothing is implemented both in time and frequency index to compensate those spectrum components distorted by noise over-reduction. In acoustic model training, we propose to use an immunity teaming scheme, in which pre-recorded car noises are artificially added to clean training utterances with different signal-to-noise ratios (SNR) to imitate the in-car environments. After analyzing the SNR and, noise spectrum of real in-car utterances, we further refine the immunity training set by adjusting the distribution of SNR and increasing the proportion of training noises that has a similar characteristic. Evaluation results of isolated phrase recognition show that the ASR system with proposed technologies achieves the average error rate reduction (ERR) of 90.68% and 79.08% for artificial car noisy speech and real in-car speech respectively, when compared with the baseline system in which no robust technology is used.

引用

页码：776 / +

页数：2

共 50 条

[21] Robust Automatic Speech Recognition System for the Recognition of Continuous Kannada Speech Sentences in the Presence of Noise
Wireless Personal Communications, 2023, 130 : 2039 - 2058
[22] Robust Automatic Speech Recognition System for the Recognition of Continuous Kannada Speech Sentences in the Presence of Noise
Mahadevaswamy
WIRELESS PERSONAL COMMUNICATIONS, 2023, 130 (03) : 2039 - 2058
[23] Robust automatic speech recognition based on neural network in reverberant environments
Bai, L.
Li, H. L.
He, Y. Y.
CIVIL, ARCHITECTURE AND ENVIRONMENTAL ENGINEERING, VOLS 1 AND 2, 2017, : 1319 - 1324
[24] Spectral subtraction using spectral harmonics for robust speech recognition in car environments
Beh, J
Ko, H
COMPUTATIONAL SCIENCE - ICCS 2003, PT IV, PROCEEDINGS, 2003, 2660 : 1109 - 1116
[25] Silence Energy Normalization for Robust Speech Recognition in Additive Noise Environments
Tai, Chung-fu
Hung, Jeih-weih
INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2558 - 2561
[26] Silence Feature Normalization for Robust Speech Recognition in Additive Noise Environments
Wang, Chieh-cheng
Pan, Chi-an
Hung, Jeih-weih
INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1028 - 1031
[27] Feature-based Noise Robust Speech Recognition on an Indonesian Language Automatic Speech Recognition System
Satriawan, Cil Hardianto
Lestari, Dessi Puji
2014 International Conference on Electrical Engineering and Computer Science (ICEECS), 2014, : 42 - 46
[28] INCORPORATING MASK MODELLING FOR NOISE-ROBUST AUTOMATIC SPEECH RECOGNITION
Koekueer, Muenevver
Jancovic, Peter
2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 3929 - 3932
[29] Empirical Mode Decomposition For Noise-Robust Automatic Speech Recognition
Wu, Kuo-Hao
Chen, Chia-Ping
11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2074 - 2077
[30] Binaural Deep Neural Network for Noise Robust Automatic Speech Recognition
Jiang, Yi
Zu, Yuan-Yuan
INTERNATIONAL CONFERENCE ON CONTROL ENGINEERING AND AUTOMATION (ICCEA 2014), 2014, : 512 - 517

← 1 2 3 4 5 →