Adaptive regression based framework for in-car speech recognition

被引:0
|
作者
Li, Weifeng [1 ]
Itou, Katunobu [1 ]
Takeda, Kazuya [1 ]
Itakura, Fumitada [1 ]
机构
[1] Nagoya Univ, Grad Sch Engn, Nagoya, Aichi 4648603, Japan
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We address issues for improving hands-free speech recognition performance in different car environments using a single distant microphone. In our previous work, we proposed a regression based enhancement method for in-car speech recognition. In this paper, we describe recent improvements and propose a data-driven adaptive regression based speech recognition system, in which both feature enhancement and model compensation are performed. Based on isolated word recognition experiments conducted in 15 real car environments, the proposed adaptive regression approach shows an advantage in average relative word error rate (WER) reductions of 52.5% and 14.8%, compared to original noisy speech and ETSI advanced front-end, respectively.
引用
收藏
页码:501 / 504
页数:4
相关论文
共 50 条
  • [1] Adaptive nonlinear regression using multiple distributed microphones for in-car speech recognition
    Li, WF
    Miyajima, C
    Nishino, T
    Itou, K
    Takeda, K
    Itakura, F
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2005, E88A (07) : 1716 - 1723
  • [2] Adaptive log-spectral regression for in-car speech recognition using multiple distributed microphones
    Li, WF
    Takeda, K
    Itakura, F
    IEEE SIGNAL PROCESSING LETTERS, 2005, 12 (04) : 340 - 343
  • [3] Robust In-Car Speech Recognition Based on Nonlinear Multiple Regressions
    Weifeng Li
    Kazuya Takeda
    Fumitada Itakura
    EURASIP Journal on Advances in Signal Processing, 2007
  • [4] Robust in-car speech recognition based on nonlinear multiple regressions
    Li, Weifeng
    Takeda, Kazuya
    Itakura, Fumitada
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2007, 2007 (1)
  • [5] An In-Car Speech Recognition System for Disabled Drivers
    Ivanecky, Jozef
    Mehlhase, Stephan
    TEXT, SPEECH AND DIALOGUE, TSD 2012, 2012, 7499 : 505 - 512
  • [6] Multiple regression of log spectra for in-car speech recognition using multiple distributed microphones
    Li, WF
    Shinde, T
    Fujimura, H
    Miyajima, C
    Nishino, T
    Itou, K
    Takeda, K
    Itakura, F
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2005, E88D (03) : 384 - 390
  • [7] Improved noise spectra estimation and log-spectral regression for in-car speech recognition
    Li, W. (lee@sp.m.is.nagoya-u.ac.jp), Information Processing Society of Japan, IPSJ; The Database Society of Japan, DBSJ; The IEEE Computer Society; The Inst. of Elec., Info. and Com. Engineers, IEICE (IEEE Computer Society):
  • [8] In-car speech recognition using distributed multiple microphones
    Li, WF
    Nishino, T
    Miyajima, C
    Itou, K
    Takeda, K
    Itakura, F
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2004, PT 1, PROCEEDINGS, 2004, 3331 : 505 - 513
  • [9] Hybrid in-car speech recognition for mobile multimedia applications
    DaimlerChrysler Aerospace, Ulm, Germany
    IEEE Veh Technol Conf, (2009-2013):
  • [10] Hybrid in-car speech recognition for mobile multimedia applications
    Kuhn, T
    Jameel, A
    Stümpfle, M
    Haddadi, A
    1999 IEEE 49TH VEHICULAR TECHNOLOGY CONFERENCE, VOLS 1-3: MOVING INTO A NEW MILLENIUM, 1999, : 2009 - 2013