An In-Car Speech Recognition System for Disabled Drivers

被引:0
|
作者
Ivanecky, Jozef [1 ]
Mehlhase, Stephan [1 ]
机构
[1] European Media Lab, D-69118 Heidelberg, Germany
来源
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic Speech Recognition (ASR) is becoming a standard in nowadays cars. However, ASR in cars is usually restricted to activities not directly influencing the driving process. Thus, the voice-controlled functions can rather be classified as comfort functions, e. g. controlling the air condition, the navigation and entertainment system or even the mobile phone of the driver. Obviously this usage of an ASR system could be extended in two directions: On the one side, the speech recognition system could be used to control secondary functions in the car like lights, windscreen wipers or windows. On the other side, the comfort functions could be enriched by utilizing services like weather inquiries, SMS dictation or online traffic information. Compared to todays usage these extensions require a different approach than the one employed today. Controlling secondary functions in the car by voice demands the usage of a very reliable, real-time, local ASR. At the same time a large vocabulary ASR system is required for comfort functions like dictation of messages. In this paper, we describe our efforts towards a hybrid speech recognition system to control secondary functions in the car. We also provide an extended comfort functionality to the driver. The hybrid speech recognition system contains a fast, grammar-based, embedded recognizer and a remote, server-based, LM-based, large vocabulary ASR system. We will analyze different aspects of such a design and the integration of it into a car. The main focus of the paper will be on maximizing the reliability of the embedded recognizer and designing an algorithm for switching dynamically between the embedded recognizer and the server-based ASR system.
引用
收藏
页码:505 / 512
页数:8
相关论文
共 50 条
  • [1] A speech driven in-car assistance system
    Coletti, P
    Cristoforetti, L
    Matassoni, M
    Omologo, M
    Svaizer, P
    Geutner, P
    Steffens, F
    IEEE IV2003: INTELLIGENT VEHICLES SYMPOSIUM, PROCEEDINGS, 2003, : 622 - 626
  • [2] In-car speech recognition using distributed multiple microphones
    Li, WF
    Nishino, T
    Miyajima, C
    Itou, K
    Takeda, K
    Itakura, F
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2004, PT 1, PROCEEDINGS, 2004, 3331 : 505 - 513
  • [3] THE ROYALFLUSH AUTOMATIC SPEECH DIARIZATION AND RECOGNITION SYSTEM FOR IN-CAR MULTI-CHANNEL AUTOMATIC SPEECH RECOGNITION CHALLENGE
    Tian, Jingguang
    Ye, Shuaishuai
    Chen, Shunfei
    Xiang, Yang
    Yin, Zhaohui
    Hu, Xinhui
    Xu, Xinkang
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING WORKSHOPS, ICASSPW 2024, 2024, : 1 - 2
  • [4] Adaptive regression based framework for in-car speech recognition
    Li, Weifeng
    Itou, Katunobu
    Takeda, Kazuya
    Itakura, Fumitada
    2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 501 - 504
  • [5] Denoising Algorithms using Stacked RNN models for In-Car Speech Recognition System
    Panda, Anirban
    2018 4TH INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT), 2018,
  • [6] Intelligent In-Car Emotion Regulation Interaction System Based on Speech Emotion Recognition
    Yang, Yuhan
    Zhang, Yan
    Zhong, Zhinan
    Dai, Wan
    Chen, Yunfei
    Chen, Mo
    2024 4TH INTERNATIONAL CONFERENCE ON COMPUTER, CONTROL AND ROBOTICS, ICCCR 2024, 2024, : 142 - 150
  • [7] Hybrid in-car speech recognition for mobile multimedia applications
    DaimlerChrysler Aerospace, Ulm, Germany
    IEEE Veh Technol Conf, (2009-2013):
  • [8] Hybrid in-car speech recognition for mobile multimedia applications
    Kuhn, T
    Jameel, A
    Stümpfle, M
    Haddadi, A
    1999 IEEE 49TH VEHICULAR TECHNOLOGY CONFERENCE, VOLS 1-3: MOVING INTO A NEW MILLENIUM, 1999, : 2009 - 2013
  • [9] FPGA Implementation of Spectral Subtraction for In-Car Speech Enhancement and Recognition
    Whittington, Jim
    Deo, Kapeel
    Kleinschmidt, Tristan
    Mason, Michael
    ICSPCS: 2ND INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION SYSTEMS, PROCEEDINGS, 2008, : 393 - +
  • [10] Local peak enhancement for in-car speech recognition in noisy environment
    Ichikawa, Osamu
    Fukuda, Takashi
    Nishimura, Masafumi
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2008, E91D (03) : 635 - 639