An In-Car Speech Recognition System for Disabled Drivers

被引:0
|
作者
Ivanecky, Jozef [1 ]
Mehlhase, Stephan [1 ]
机构
[1] European Media Lab, D-69118 Heidelberg, Germany
来源
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic Speech Recognition (ASR) is becoming a standard in nowadays cars. However, ASR in cars is usually restricted to activities not directly influencing the driving process. Thus, the voice-controlled functions can rather be classified as comfort functions, e. g. controlling the air condition, the navigation and entertainment system or even the mobile phone of the driver. Obviously this usage of an ASR system could be extended in two directions: On the one side, the speech recognition system could be used to control secondary functions in the car like lights, windscreen wipers or windows. On the other side, the comfort functions could be enriched by utilizing services like weather inquiries, SMS dictation or online traffic information. Compared to todays usage these extensions require a different approach than the one employed today. Controlling secondary functions in the car by voice demands the usage of a very reliable, real-time, local ASR. At the same time a large vocabulary ASR system is required for comfort functions like dictation of messages. In this paper, we describe our efforts towards a hybrid speech recognition system to control secondary functions in the car. We also provide an extended comfort functionality to the driver. The hybrid speech recognition system contains a fast, grammar-based, embedded recognizer and a remote, server-based, LM-based, large vocabulary ASR system. We will analyze different aspects of such a design and the integration of it into a car. The main focus of the paper will be on maximizing the reliability of the embedded recognizer and designing an algorithm for switching dynamically between the embedded recognizer and the server-based ASR system.
引用
收藏
页码:505 / 512
页数:8
相关论文
共 50 条
  • [31] Dual-Microphone Speech Reinforcement System With Howling-Control for In-Car Speech Communication
    Alkaher, Yehav
    Cohen, Israel
    FRONTIERS IN SIGNAL PROCESSING, 2022, 2
  • [32] Improved noise spectra estimation and log-spectral regression for in-car speech recognition
    Li, W. (lee@sp.m.is.nagoya-u.ac.jp), Information Processing Society of Japan, IPSJ; The Database Society of Japan, DBSJ; The IEEE Computer Society; The Inst. of Elec., Info. and Com. Engineers, IEICE (IEEE Computer Society):
  • [33] In-car speech recognition using distributed microphones - Adapting to automatically detected driving conditions
    Banno, H
    Shinde, T
    Takeda, K
    Itakura, F
    2003 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL III, PROCEEDINGS, 2003, : 609 - 612
  • [34] A SUBBAND HYBRID BEAMFORMING FOR IN-CAR SPEECH ENHANCEMENT
    Fox, Charles
    Vitte, Guillaume
    Charbit, Maurice
    Prado, Jacques
    Badeau, Roland
    David, Bertrand
    2012 PROCEEDINGS OF THE 20TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2012, : 11 - 15
  • [35] Laying the Foundation for In-car Alcohol Detection by Speech
    Schiel, Florian
    Heinrich, Christian
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 991 - 994
  • [36] A software architecture supporting in-car speech interaction
    Kun, AL
    Miller, WT
    Pelhe, A
    Lynch, RL
    2004 IEEE INTELLIGENT VEHICLES SYMPOSIUM, 2004, : 471 - 476
  • [37] Construction and evaluation of a large in-car speech corpus
    Takeda, K
    Fujimura, H
    Itou, K
    Kawaguchi, N
    Matsubara, S
    Itakura, F
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2005, E88D (03): : 553 - 561
  • [38] SPEECH RECOGNITION FOR THE DISABLED
    GRATTAN, KTV
    PALMER, AW
    SHURROCK, CSA
    IEEE ENGINEERING IN MEDICINE AND BIOLOGY MAGAZINE, 1991, 10 (03): : 51 - 57
  • [39] AN FLMS BASED TWO-MICROPHONE SPEECH ENHANCEMENT SYSTEM FOR IN-CAR APPLICATIONS
    Freudenberger, Juergen
    Stenzel, Sebastian
    Venditti, Benjamin
    2009 IEEE/SP 15TH WORKSHOP ON STATISTICAL SIGNAL PROCESSING, VOLS 1 AND 2, 2009, : 704 - 707
  • [40] A noise robust front-end with low computational cost for embedded in-car speech recognition
    Ding, Pei
    He, Lei
    Yan, Xiang
    Zhao, Rui
    Hao, Jie
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 1045 - +