Adaptive regression based framework for in-car speech recognition

被引:0
|
作者
Li, Weifeng [1 ]
Itou, Katunobu [1 ]
Takeda, Kazuya [1 ]
Itakura, Fumitada [1 ]
机构
[1] Nagoya Univ, Grad Sch Engn, Nagoya, Aichi 4648603, Japan
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We address issues for improving hands-free speech recognition performance in different car environments using a single distant microphone. In our previous work, we proposed a regression based enhancement method for in-car speech recognition. In this paper, we describe recent improvements and propose a data-driven adaptive regression based speech recognition system, in which both feature enhancement and model compensation are performed. Based on isolated word recognition experiments conducted in 15 real car environments, the proposed adaptive regression approach shows an advantage in average relative word error rate (WER) reductions of 52.5% and 14.8%, compared to original noisy speech and ETSI advanced front-end, respectively.
引用
收藏
页码:501 / 504
页数:4
相关论文
共 50 条
  • [21] Experiments of in-car audio compensation for hands-free speech recognition
    Matassoni, M
    Omologo, M
    Zieger, C
    ASRU'03: 2003 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING ASRU '03, 2003, : 369 - 374
  • [22] THE ROYALFLUSH AUTOMATIC SPEECH DIARIZATION AND RECOGNITION SYSTEM FOR IN-CAR MULTI-CHANNEL AUTOMATIC SPEECH RECOGNITION CHALLENGE
    Tian, Jingguang
    Ye, Shuaishuai
    Chen, Shunfei
    Xiang, Yang
    Yin, Zhaohui
    Hu, Xinhui
    Xu, Xinkang
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING WORKSHOPS, ICASSPW 2024, 2024, : 1 - 2
  • [23] Multimedia corpus of in-car speech communication
    Kawaguchi, N
    Takeda, K
    Itakura, F
    JOURNAL OF VLSI SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2004, 36 (2-3): : 153 - 159
  • [24] Denoising Algorithms using Stacked RNN models for In-Car Speech Recognition System
    Panda, Anirban
    2018 4TH INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT), 2018,
  • [25] Multimedia Corpus of In-Car Speech Communication
    Nobuo Kawaguchi
    Kazuya Takeda
    Fumitada Itakura
    Journal of VLSI signal processing systems for signal, image and video technology, 2004, 36 : 153 - 159
  • [26] THE AUSTRALIAN ENGLISH SPEECH CORPUS FOR IN-CAR SPEECH PROCESSING
    Kleinschmidt, Tristan
    Mason, Michael
    Wong, Eddie
    Sridharan, Sridha
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4177 - 4180
  • [27] In-car speech enhancement based on ensemble empirical mode decomposition
    Chen, Xiangxian
    Huang, Hai
    Zhang, Jiafang
    COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2011, 26 (01): : 39 - 46
  • [28] A speech driven in-car assistance system
    Coletti, P
    Cristoforetti, L
    Matassoni, M
    Omologo, M
    Svaizer, P
    Geutner, P
    Steffens, F
    IEEE IV2003: INTELLIGENT VEHICLES SYMPOSIUM, PROCEEDINGS, 2003, : 622 - 626
  • [29] SPEECH INTELLIGIBILITY ENHANCEMENT BY EQUALIZATION FOR IN-CAR APPLICATIONS
    Gentet, Enguerrand
    David, Bertrand
    Denjean, Sebastien
    Richard, Gael
    Roussarie, Vincent
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6934 - 6938
  • [30] MODEL-BASED NOISE REDUCTION LEVERAGING FREQUENCY-WISE CONFIDENCE METRIC FOR IN-CAR SPEECH RECOGNITION
    Ichikawa, Osamu
    Rennie, Steven J.
    Fukuda, Takashi
    Nishimura, Masafumi
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4921 - 4924