An In-Car Speech Recognition System for Disabled Drivers

被引:0
|
作者
Ivanecky, Jozef [1 ]
Mehlhase, Stephan [1 ]
机构
[1] European Media Lab, D-69118 Heidelberg, Germany
来源
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic Speech Recognition (ASR) is becoming a standard in nowadays cars. However, ASR in cars is usually restricted to activities not directly influencing the driving process. Thus, the voice-controlled functions can rather be classified as comfort functions, e. g. controlling the air condition, the navigation and entertainment system or even the mobile phone of the driver. Obviously this usage of an ASR system could be extended in two directions: On the one side, the speech recognition system could be used to control secondary functions in the car like lights, windscreen wipers or windows. On the other side, the comfort functions could be enriched by utilizing services like weather inquiries, SMS dictation or online traffic information. Compared to todays usage these extensions require a different approach than the one employed today. Controlling secondary functions in the car by voice demands the usage of a very reliable, real-time, local ASR. At the same time a large vocabulary ASR system is required for comfort functions like dictation of messages. In this paper, we describe our efforts towards a hybrid speech recognition system to control secondary functions in the car. We also provide an extended comfort functionality to the driver. The hybrid speech recognition system contains a fast, grammar-based, embedded recognizer and a remote, server-based, LM-based, large vocabulary ASR system. We will analyze different aspects of such a design and the integration of it into a car. The main focus of the paper will be on maximizing the reliability of the embedded recognizer and designing an algorithm for switching dynamically between the embedded recognizer and the server-based ASR system.
引用
收藏
页码:505 / 512
页数:8
相关论文
共 50 条
  • [21] THE FOSAFER SYSTEM FOR THE ICASSP2024 IN-CAR MULTI-CHANNEL AUTOMATIC SPEECH RECOGNITION CHALLENGE
    Huang, Shangkun
    Du, Yuxuan
    Wang, Yankai
    Deng, Jing
    Zheng, Rong
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING WORKSHOPS, ICASSPW 2024, 2024, : 5 - 6
  • [22] Robust speech recognition techniques evaluation for telephony server based in-car applications
    Delphin-Poulat, L
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 65 - 68
  • [23] Adaptive nonlinear regression using multiple distributed microphones for in-car speech recognition
    Li, WF
    Miyajima, C
    Nishino, T
    Itou, K
    Takeda, K
    Itakura, F
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2005, E88A (07) : 1716 - 1723
  • [24] Multimedia corpus of in-car speech communication
    Kawaguchi, N
    Takeda, K
    Itakura, F
    JOURNAL OF VLSI SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2004, 36 (2-3): : 153 - 159
  • [25] Multimedia Corpus of In-Car Speech Communication
    Nobuo Kawaguchi
    Kazuya Takeda
    Fumitada Itakura
    Journal of VLSI signal processing systems for signal, image and video technology, 2004, 36 : 153 - 159
  • [26] THE AUSTRALIAN ENGLISH SPEECH CORPUS FOR IN-CAR SPEECH PROCESSING
    Kleinschmidt, Tristan
    Mason, Michael
    Wong, Eddie
    Sridharan, Sridha
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4177 - 4180
  • [27] Effects of an In-Car Augmented Reality System on Improving Safety of Younger and Older Drivers
    Fu, Wai-Tat
    Gasper, John
    Kim, Seong-Whan
    2013 IEEE INTERNATIONAL SYMPOSIUM ON MIXED AND AUGMENTED REALITY (ISMAR) - SCIENCE AND TECHNOLOGY, 2013, : 59 - 66
  • [28] SPEECH INTELLIGIBILITY ENHANCEMENT BY EQUALIZATION FOR IN-CAR APPLICATIONS
    Gentet, Enguerrand
    David, Bertrand
    Denjean, Sebastien
    Richard, Gael
    Roussarie, Vincent
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6934 - 6938
  • [29] Multiple regression of log spectra for in-car speech recognition using multiple distributed microphones
    Li, WF
    Shinde, T
    Fujimura, H
    Miyajima, C
    Nishino, T
    Itou, K
    Takeda, K
    Itakura, F
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2005, E88D (03) : 384 - 390
  • [30] In-car speech recognition using distributed microphones - Adapting to automatically detected driving conditions
    Banno, H
    Shinde, T
    Takeda, K
    Itakura, F
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 324 - 327