Solutions for robust speech/non-speech detection in wireless environment

被引:3
|
作者
Karray, L [1 ]
Mokbel, C [1 ]
Monné, J [1 ]
机构
[1] FT CNET, DIH, DIPS, F-22307 Lannion, France
来源
1998 IEEE 4TH WORKSHOP INTERACTIVE VOICE TECHNOLOGY FOR TELECOMMUNICATIONS APPLICATIONS - IVTTA '98 | 1998年
关键词
D O I
10.1109/IVTTA.1998.727714
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The use of speech recognition systems in noisy environments requires robustness to adverse conditions. An efficient detection of speech/non-speech segments is therefore necessary. Several approaches have been proposed in order to improve the robustness of speech/non-speech detection used for speech recognition in noisy conditions. In this paper, we describe a robust speech/non-speech detection algorithm based on the estimation of noise statistics: mean and variance. Results of several experiments carried out on a database collected over the GSM network show that this new approach improves the recognizer's global performances, especially in very noisy environments. Then, spectral subtraction is used as a preprocessing technique aiming to increase the robustness to noisy conditions. We show that the improvements concern mainly noisy conditions such as calls from outside or from running cars.
引用
收藏
页码:166 / 170
页数:5
相关论文
共 50 条
  • [31] Effects of audio-visual integration on the detection of masked speech and non-speech sounds
    Eramudugolla, Ranmalee
    Henderson, Rachel
    Mattingley, Jason B.
    BRAIN AND COGNITION, 2011, 75 (01) : 60 - 66
  • [32] Audiovisual synchrony perception of simplified speech sounds heard as speech and non-speech
    Asakawa, Kaori
    Tanaka, Akihiro
    Sakamoto, Shuichi
    Iwaya, Yukio
    Suzuki, Yoiti
    ACOUSTICAL SCIENCE AND TECHNOLOGY, 2011, 32 (03) : 125 - 128
  • [33] Robust robot localization using non-speech sound in industrial environments
    Bolea, Yolanda
    Manzanares, Manuel
    Grau, Antoni
    2008 IEEE INTERNATIONAL SYMPOSIUM ON INDUSTRIAL ELECTRONICS, VOLS 1-5, 2008, : 1831 - 1836
  • [34] Task related differences in speech/non-speech evoked potentials
    Meyer, G
    Perez, E
    JOURNAL OF COGNITIVE NEUROSCIENCE, 2005, : 213 - 213
  • [35] The discrimination of and orienting to speech and non-speech sounds in children with autism
    Lepistö, T
    Kujala, T
    Vanhala, R
    Alku, P
    Huotilainen, M
    Näätänen, R
    BRAIN RESEARCH, 2005, 1066 (1-2) : 147 - 157
  • [36] Speech/Non-Speech Segmentation Based on Phoneme Recognition Features
    Janez Žibert
    Nikola Pavešić
    France Mihelič
    EURASIP Journal on Advances in Signal Processing, 2006
  • [37] Speech/non-speech segmentation based on phoneme recognition features
    Zibert, Janez
    Pavesic, Nikola
    Mihelic, France
    EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2006, 2006 (1)
  • [38] A clinician survey of speech and non-speech characteristics of neurogenic stuttering
    Theys, Catherine
    van Wieringen, Astrid
    De Nil, Luc F.
    JOURNAL OF FLUENCY DISORDERS, 2008, 33 (01) : 1 - 23
  • [39] Speaker Non-speech Event Recognition with Standard Speech Datasets
    Rajnoha, J.
    ACTA POLYTECHNICA, 2007, 47 (4-5) : 107 - 111
  • [40] LOST IN SEGMENTATION: THREE APPROACHES FOR SPEECH/NON-SPEECH DETECTION IN CONSUMER-PRODUCED VIDEOS
    Elizalde, Benjamin
    Friedland, Gerald
    2013 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME 2013), 2013,