An In-Car Speech Recognition System for Disabled Drivers

被引：0

作者：

Ivanecky, Jozef ^{[1
]}

Mehlhase, Stephan ^{[1
]}

机构：

[1] European Media Lab, D-69118 Heidelberg, Germany

来源：

TEXT, SPEECH AND DIALOGUE, TSD 2012 | 2012年 / 7499卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Automatic Speech Recognition (ASR) is becoming a standard in nowadays cars. However, ASR in cars is usually restricted to activities not directly influencing the driving process. Thus, the voice-controlled functions can rather be classified as comfort functions, e. g. controlling the air condition, the navigation and entertainment system or even the mobile phone of the driver. Obviously this usage of an ASR system could be extended in two directions: On the one side, the speech recognition system could be used to control secondary functions in the car like lights, windscreen wipers or windows. On the other side, the comfort functions could be enriched by utilizing services like weather inquiries, SMS dictation or online traffic information. Compared to todays usage these extensions require a different approach than the one employed today. Controlling secondary functions in the car by voice demands the usage of a very reliable, real-time, local ASR. At the same time a large vocabulary ASR system is required for comfort functions like dictation of messages. In this paper, we describe our efforts towards a hybrid speech recognition system to control secondary functions in the car. We also provide an extended comfort functionality to the driver. The hybrid speech recognition system contains a fast, grammar-based, embedded recognizer and a remote, server-based, LM-based, large vocabulary ASR system. We will analyze different aspects of such a design and the integration of it into a car. The main focus of the paper will be on maximizing the reliability of the embedded recognizer and designing an algorithm for switching dynamically between the embedded recognizer and the server-based ASR system.

引用

页码：505 / 512

页数：8

共 50 条

[1] A speech driven in-car assistance system
Coletti, P
Cristoforetti, L
Matassoni, M
Omologo, M
Svaizer, P
Geutner, P
Steffens, F
IEEE IV2003: INTELLIGENT VEHICLES SYMPOSIUM, PROCEEDINGS, 2003, : 622 - 626
[2] In-car speech recognition using distributed multiple microphones
Li, WF
Nishino, T
Miyajima, C
Itou, K
Takeda, K
Itakura, F
ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2004, PT 1, PROCEEDINGS, 2004, 3331 : 505 - 513
[3] THE ROYALFLUSH AUTOMATIC SPEECH DIARIZATION AND RECOGNITION SYSTEM FOR IN-CAR MULTI-CHANNEL AUTOMATIC SPEECH RECOGNITION CHALLENGE
Tian, Jingguang
Ye, Shuaishuai
Chen, Shunfei
Xiang, Yang
Yin, Zhaohui
Hu, Xinhui
Xu, Xinkang
2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING WORKSHOPS, ICASSPW 2024, 2024, : 1 - 2
[4] Adaptive regression based framework for in-car speech recognition
Li, Weifeng
Itou, Katunobu
Takeda, Kazuya
Itakura, Fumitada
2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 501 - 504
[5] Denoising Algorithms using Stacked RNN models for In-Car Speech Recognition System
Panda, Anirban
2018 4TH INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT), 2018,
[6] Intelligent In-Car Emotion Regulation Interaction System Based on Speech Emotion Recognition
Yang, Yuhan
Zhang, Yan
Zhong, Zhinan
Dai, Wan
Chen, Yunfei
Chen, Mo
2024 4TH INTERNATIONAL CONFERENCE ON COMPUTER, CONTROL AND ROBOTICS, ICCCR 2024, 2024, : 142 - 150
[7] Hybrid in-car speech recognition for mobile multimedia applications
DaimlerChrysler Aerospace, Ulm, Germany
IEEE Veh Technol Conf, (2009-2013):
[8] Hybrid in-car speech recognition for mobile multimedia applications
Kuhn, T
Jameel, A
Stümpfle, M
Haddadi, A
1999 IEEE 49TH VEHICULAR TECHNOLOGY CONFERENCE, VOLS 1-3: MOVING INTO A NEW MILLENIUM, 1999, : 2009 - 2013
[9] FPGA Implementation of Spectral Subtraction for In-Car Speech Enhancement and Recognition
Whittington, Jim
Deo, Kapeel
Kleinschmidt, Tristan
Mason, Michael
ICSPCS: 2ND INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION SYSTEMS, PROCEEDINGS, 2008, : 393 - +
[10] Local peak enhancement for in-car speech recognition in noisy environment
Ichikawa, Osamu
Fukuda, Takashi
Nishimura, Masafumi
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2008, E91D (03) : 635 - 639

← 1 2 3 4 5 →