Adaptive regression based framework for in-car speech recognition

被引:0
|
作者
Li, Weifeng [1 ]
Itou, Katunobu [1 ]
Takeda, Kazuya [1 ]
Itakura, Fumitada [1 ]
机构
[1] Nagoya Univ, Grad Sch Engn, Nagoya, Aichi 4648603, Japan
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We address issues for improving hands-free speech recognition performance in different car environments using a single distant microphone. In our previous work, we proposed a regression based enhancement method for in-car speech recognition. In this paper, we describe recent improvements and propose a data-driven adaptive regression based speech recognition system, in which both feature enhancement and model compensation are performed. Based on isolated word recognition experiments conducted in 15 real car environments, the proposed adaptive regression approach shows an advantage in average relative word error rate (WER) reductions of 52.5% and 14.8%, compared to original noisy speech and ETSI advanced front-end, respectively.
引用
收藏
页码:501 / 504
页数:4
相关论文
共 50 条
  • [41] Robust Log-Energy Estimation and its Dynamic Change Enhancement for In-car Speech Recognition
    Li, Weifeng
    Wang, Longbiao
    Zhou, Yicong
    Bourlard, Herv
    Liao, Qingmin
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (08): : 1689 - 1698
  • [42] On the joint use of noise reduction and MLLR adaptation for in-car hands-free speech recognition
    Matassoni, M
    Omologo, M
    Santarelli, A
    Svaizer, P
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 289 - 292
  • [43] Automatic music genre recognition for in-car infotainment
    Jakubec, Maros
    Chmulik, Michal
    13TH INTERNATIONAL SCIENTIFIC CONFERENCE ON SUSTAINABLE, MODERN AND SAFE TRANSPORT (TRANSCOM 2019), 2019, 40 : 1364 - 1371
  • [44] Improved traffic sign recognition for in-car cameras
    Lin, Huei-Yung
    Chang, Chin-Chen
    Van Luan Tran
    Shi, Jian-He
    JOURNAL OF THE CHINESE INSTITUTE OF ENGINEERS, 2020, 43 (03) : 300 - 307
  • [45] Decision fusion techniques for in-car driver recognition
    Eskil, Taner
    Erdogan, Hakan
    Ercil, Aytul
    Ozyagci, Ali Nazmi
    Rodoper, Mete
    2006 IEEE 14TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS, VOLS 1 AND 2, 2006, : 738 - +
  • [46] Characterizing in-car conversational speech of different dialogue modes
    Fujimura, Hiroshi
    Miyajima, Chiyomi
    Kawaguchi, Nobuo
    Itou, Katsunobu
    Takeda, Kazuya
    Itakura, Fumitada
    ICICIC 2006: FIRST INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING, INFORMATION AND CONTROL, VOL 2, PROCEEDINGS, 2006, : 552 - +
  • [47] CI-AVSR: A Cantonese Audio-Visual Speech Dataset for In-car Command Recognition
    Dai, Wenliang
    Cahyawijaya, Samuel
    Yu, Tiezheng
    Barezi, Elham J.
    Xu, Peng
    Yiu, Cheuk Tung Shadow
    Frieske, Rita
    Lovenia, Holy
    Winata, Genta Indra
    Chen, Qifeng
    Ma, Xiaojuan
    Shi, Bertram E.
    Fung, Pascale
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 6786 - 6793
  • [48] AN FLMS BASED TWO-MICROPHONE SPEECH ENHANCEMENT SYSTEM FOR IN-CAR APPLICATIONS
    Freudenberger, Juergen
    Stenzel, Sebastian
    Venditti, Benjamin
    2009 IEEE/SP 15TH WORKSHOP ON STATISTICAL SIGNAL PROCESSING, VOLS 1 AND 2, 2009, : 704 - 707
  • [49] Use of parallel recognizers for robust in-car speech interaction
    Cristoforetti, L
    Matassoni, M
    Omologo, M
    Svaizer, P
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 320 - 323
  • [50] CIAIR in-car speech corpus - Influence of driving status
    Kawaguchi, N
    Matsubara, S
    Takeda, K
    Itakura, F
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2005, E88D (03): : 578 - 582