Adaptive regression based framework for in-car speech recognition

被引:0
|
作者
Li, Weifeng [1 ]
Itou, Katunobu [1 ]
Takeda, Kazuya [1 ]
Itakura, Fumitada [1 ]
机构
[1] Nagoya Univ, Grad Sch Engn, Nagoya, Aichi 4648603, Japan
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We address issues for improving hands-free speech recognition performance in different car environments using a single distant microphone. In our previous work, we proposed a regression based enhancement method for in-car speech recognition. In this paper, we describe recent improvements and propose a data-driven adaptive regression based speech recognition system, in which both feature enhancement and model compensation are performed. Based on isolated word recognition experiments conducted in 15 real car environments, the proposed adaptive regression approach shows an advantage in average relative word error rate (WER) reductions of 52.5% and 14.8%, compared to original noisy speech and ETSI advanced front-end, respectively.
引用
收藏
页码:501 / 504
页数:4
相关论文
共 50 条
  • [31] In-car speech recognition using distributed microphones - Adapting to automatically detected driving conditions
    Banno, H
    Shinde, T
    Takeda, K
    Itakura, F
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 324 - 327
  • [32] MODEL-BASED NOISE REDUCTION LEVERAGING FREQUENCY-WISE CONFIDENCE METRIC FOR IN-CAR SPEECH RECOGNITION
    Ichikawa, Osamu
    Rennie, Steven J.
    Fukuda, Takashi
    Nishimura, Masafumi
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4921 - 4924
  • [33] In-car speech recognition using distributed microphones - Adapting to automatically detected driving conditions
    Banno, H
    Shinde, T
    Takeda, K
    Itakura, F
    2003 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL III, PROCEEDINGS, 2003, : 609 - 612
  • [34] A SUBBAND HYBRID BEAMFORMING FOR IN-CAR SPEECH ENHANCEMENT
    Fox, Charles
    Vitte, Guillaume
    Charbit, Maurice
    Prado, Jacques
    Badeau, Roland
    David, Bertrand
    2012 PROCEEDINGS OF THE 20TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2012, : 11 - 15
  • [35] Laying the Foundation for In-car Alcohol Detection by Speech
    Schiel, Florian
    Heinrich, Christian
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 991 - 994
  • [36] A software architecture supporting in-car speech interaction
    Kun, AL
    Miller, WT
    Pelhe, A
    Lynch, RL
    2004 IEEE INTELLIGENT VEHICLES SYMPOSIUM, 2004, : 471 - 476
  • [37] Construction and evaluation of a large in-car speech corpus
    Takeda, K
    Fujimura, H
    Itou, K
    Kawaguchi, N
    Matsubara, S
    Itakura, F
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2005, E88D (03): : 553 - 561
  • [38] A noise robust front-end with low computational cost for embedded in-car speech recognition
    Ding, Pei
    He, Lei
    Yan, Xiang
    Zhao, Rui
    Hao, Jie
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 1045 - +
  • [39] Sub-band based Log-energy and Its Dynamic Range Stretching for Robust In-car Speech Recognition
    Li, Weifeng
    Bourlard, Herve
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 314 - 317
  • [40] HMM-based Driving Behavior Recognition for In-car Control Service
    Chuang, Chun-Fu
    Yang, Chung-Hsien
    Lin, Yu-Hui
    2015 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - TAIWAN (ICCE-TW), 2015, : 258 - 259