Two-sensor noise robust ASR with missing frames for AURORA2 task

被引:0
|
作者
Demiroglu, C [1 ]
Anderson, DV [1 ]
机构
[1] Georgia Inst Technol, Sch ECE, Atlanta, GA 30332 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In a recently proposed system, we have used the missing frames idea for noise robust automatic speech recognition (ASR). The key point behind the missing frames idea is that frames with energies below a certain threshold are considered unreliable frames. We set these frames to a silence floor and treat them as silence frames even if they contain speech signal. Although this causes loss of valuable information such as transitional cues for consonants, we showed that for a small vocabulary task the system substantially decreases the Word Error Rate (WER) at low SNRs. We have also observed that the algorithm decreases the overall computational complexity as opposed to other proposed noise robust systems that typically require considerable computational power. The main drawback of the missing frames system is the difficulty in detecting high energy portions accurately at high noise environments. In this work we propose using a glottal sensor to detect the high energy portions of the acoustic signal. We show that the glottal sensor can detect the high energy speech portions very accurately without adding significant computational complexity. The second contribution of this paper is that we concatenate a speech enhancement algorithm to the front end of the speech recognizer. We show that the enhancement algorithm does not improve the performance of the baseline system much while it decreases the WER substantially for our proposed system.
引用
收藏
页码:113 / 116
页数:4
相关论文
共 30 条
  • [1] Novel feature extraction for noise robust asr using the Aurora 2 database
    Hix, Penny
    Zahorian, Stephen
    Meng, Fansheng
    2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 541 - 544
  • [2] The impact of spectral and energy mismatch on the Aurora2 digit recognition task
    de Wet, F
    de Veth, J
    Cranen, B
    Boves, L
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL II, PROCEEDINGS: SPEECH II; INDUSTRY TECHNOLOGY TRACKS; DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS; NEURAL NETWORKS FOR SIGNAL PROCESSING, 2003, : 105 - 108
  • [3] Effect of common process noise on two-sensor track fusion
    Saha, RK
    JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 1996, 19 (04) : 829 - 835
  • [4] Noise and channel distortion robust ASR system for DARPA SPINE2 task
    Markov, K
    Matsui, T
    Gruhn, R
    Zhang, JS
    Nakamura, S
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2003, E86D (03) : 497 - 504
  • [5] Covariance Intersection Fusion Robust Steady-State Kalman Predictor for Two-Sensor Systems with Unknown Noise Variances
    Qi, Wenjuan
    Zhang, Peng
    Deng, Zili
    PROCEEDINGS OF 2013 CHINESE INTELLIGENT AUTOMATION CONFERENCE: INTELLIGENT AUTOMATION, 2013, 254 : 821 - 828
  • [6] Covariance intersection fusion robust time-varying Kalman filter for two-sensor system with uncertain noise variances
    Qi, Wen-Juan
    Zhang, Peng
    Nie, Gui-Huan
    Deng, Zi-Li
    SENSORS, MEASUREMENT AND INTELLIGENT MATERIALS II, PTS 1 AND 2, 2014, 475-476 : 470 - 475
  • [7] Effect of the common process noise on performance of two-sensor fused-track
    Zhao, J
    Xu, T
    Ren, LX
    PROCEEDINGS OF THE IEEE 2004 RADAR CONFERENCE, 2004, : 225 - 229
  • [8] Some solutions to the missing feature problem in data classification, with application to noise robust ASR
    Morris, AC
    Cooke, MP
    Green, PD
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 737 - 740
  • [9] A Two-Sensor Noise Reduction System: Applications for Hands-Free Car Kit
    Alexandre Guérin
    Régine Le Bouquin-Jeannès
    Gérard Faucon
    EURASIP Journal on Advances in Signal Processing, 2003
  • [10] Two-sensor ultrasonic spacecraft leak detection using structure-borne noise
    Holland, SD
    Roberts, R
    Chimenti, DE
    Strei, M
    ACOUSTICS RESEARCH LETTERS ONLINE-ARLO, 2005, 6 (02): : 63 - 68