Solutions for robust speech/non-speech detection in wireless environment

被引:3
|
作者
Karray, L [1 ]
Mokbel, C [1 ]
Monné, J [1 ]
机构
[1] FT CNET, DIH, DIPS, F-22307 Lannion, France
来源
1998 IEEE 4TH WORKSHOP INTERACTIVE VOICE TECHNOLOGY FOR TELECOMMUNICATIONS APPLICATIONS - IVTTA '98 | 1998年
关键词
D O I
10.1109/IVTTA.1998.727714
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The use of speech recognition systems in noisy environments requires robustness to adverse conditions. An efficient detection of speech/non-speech segments is therefore necessary. Several approaches have been proposed in order to improve the robustness of speech/non-speech detection used for speech recognition in noisy conditions. In this paper, we describe a robust speech/non-speech detection algorithm based on the estimation of noise statistics: mean and variance. Results of several experiments carried out on a database collected over the GSM network show that this new approach improves the recognizer's global performances, especially in very noisy environments. Then, spectral subtraction is used as a preprocessing technique aiming to increase the robustness to noisy conditions. We show that the improvements concern mainly noisy conditions such as calls from outside or from running cars.
引用
收藏
页码:166 / 170
页数:5
相关论文
共 50 条
  • [1] Robust speech and non-speech detection
    Tian, Y
    Wang, ZY
    Lu, DJ
    CHINESE JOURNAL OF ELECTRONICS, 2002, 11 (01): : 79 - 82
  • [2] Robust speech/non-speech detection using LDA applied to MFCC
    Martin, A
    Charlet, D
    Mauuary, L
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING, 2001, : 237 - 240
  • [3] Speech/non-speech classification using multiple features for robust endpoint detection
    Shin, WH
    Lee, BS
    Lee, YK
    Lee, JS
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1399 - 1402
  • [4] Speech/Non-Speech Detection in Malay Language Spontaneous Speech
    Izzad, M.
    Jamil, Nursuriati
    Abu Bakar, Zainab
    2013 INTERNATIONAL CONFERENCE ON COMPUTING, MANAGEMENT AND TELECOMMUNICATIONS (COMMANTEL), 2013, : 219 - 224
  • [5] Robust speech/non-speech classification in heterogeneous multimedia content
    Huijbiegts, Marijn
    de Jong, Fianciska
    SPEECH COMMUNICATION, 2011, 53 (02) : 143 - 153
  • [6] Fuzzy logic speech/non-speech discrimination for noise robust speech processing
    Culebras, R.
    Ramirez, J.
    Gorriz, J. M.
    Segura, J. C.
    COMPUTATIONAL SCIENCE - ICCS 2006, PT 1, PROCEEDINGS, 2006, 3991 : 395 - 402
  • [7] Exploiting Contextual Information for Speech/Non-Speech Detection
    Parthasarathi, Sree Hari Krishnan
    Motlicek, Petr
    Hermansky, Hynek
    TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2008, 5246 : 451 - 459
  • [8] Robust speech/non-speech detection in adverse conditions using the fuzzy polarity correlation method
    Wu, YD
    Li, Y
    SMC 2000 CONFERENCE PROCEEDINGS: 2000 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN & CYBERNETICS, VOL 1-5, 2000, : 2935 - 2939
  • [9] NON-SPEECH AUDIO EVENT DETECTION
    Portelo, Jose
    Bugalho, Miguel
    Trancoso, Isabel
    Neto, Joao
    Abad, Alberto
    Serralheiro, Antonio
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 1973 - 1976
  • [10] Unsupervised speech/non-speech detection for automatic speech recognition in meeting rooms
    Maganti, Hari Krishna
    Motlicek, Petr
    Gatica-Perez, Daniel
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 1037 - +