Fast Speech Recognition for Voice Destination Entry in a Car Navigation System

被引：0

作者：

Chung, Hoon ^{[1
]}

Park, JeonGue ^{[1
]}

Jeon, HyeonBae ^{[1
]}

Lee, YunKeun ^{[1
]}

机构：

[1] Elect & Telecommun Res Inst, Taejon 305606, South Korea

来源：

INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5 | 2009年

关键词：

speech recognition; multi-stage decoding;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we introduce a multi-stage decoding algorithm optimized to recognize very large number of entry names on a resource-limited embedded device. The multi-stage decoding algorithm is composed of a two-stage HMM-based coarse search and a detailed search. The two-stage HMM-based coarse search generates a small set of candidates that are assumed to contain a correct hypothesis with high probability, and the detailed search re-ranks the candidates by rescoring them with sophisticate acoustic models. In this paper, we take experiments with I-millions of point-of-interest (POI) names on an in-car navigation device with a fixed-point processor running at 620MHz. The experimental result shows that the multi-stage decoding algorithm runs about 2.23 times real-time on the device without serious degradation of recognition performance.

引用

页码：979 / 982

页数：4

共 50 条

[41] VOICE RECOGNITION SYSTEM
HANSEN, GC
FALKENBACH, KH
YAGHMAI, I
RADIOLOGY, 1988, 169 (02) : 580 - 580
[42] Demo: Proactive Car Navigation: How Can Destination Prediction Give Us New Navigation Experience?
Imai, Ryo
Watanabe, Kosuke
Tsubouchi, Kota
Shimosaka, Masamichi
UBICOMP/ISWC'19 ADJUNCT: PROCEEDINGS OF THE 2019 ACM INTERNATIONAL JOINT CONFERENCE ON PERVASIVE AND UBIQUITOUS COMPUTING AND PROCEEDINGS OF THE 2019 ACM INTERNATIONAL SYMPOSIUM ON WEARABLE COMPUTERS, 2019, : 292 - 295
[43] Evaluation of Interface and In-Car Speech - Many Undesirable Utterances and Sever Noisy Speech on Car Navigation Application -
Hataoka, Nobuo
Araki, Manabu
Matsuda, Takashi
Takahashi, Masayuki
Ohtaki, Ryoichi
Obuchi, Yasunari
2008 IEEE 10TH WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, VOLS 1 AND 2, 2008, : 960 - +
[44] Speech recognition for command entry in multimodal interaction
Tyfa, DA
Howes, M
INTERNATIONAL JOURNAL OF HUMAN-COMPUTER STUDIES, 2000, 52 (04) : 637 - 667
[45] Speech Recognition and Voice Separation for the Internet of Things
Mofrad, Mohammad Hasanzadeh
Mosse, Daniel
PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON THE INTERNET OF THINGS (IOT'18), 2018,
[46] VOICE RECOGNITION JOINS SPEECH ON PROGRAMMABLE BOARD
DUSEK, L
SCHALK, TB
ELECTRONICS, 1983, 56 (08): : 128 - &
[47] Robust speech recognition for car environment noise
Kokubo, H
Amano, A
Hataoka, N
ELECTRONICS AND COMMUNICATIONS IN JAPAN PART III-FUNDAMENTAL ELECTRONIC SCIENCE, 2002, 85 (11): : 65 - 73
[48] Automatic Speech Recognition Technique For Voice Command
Gupta, Anshul
Patel, Nileshkumar
Khan, Shabana
2014 INTERNATIONAL CONFERENCE ON SCIENCE ENGINEERING AND MANAGEMENT RESEARCH (ICSEMR), 2014,
[49] A MONOLITHIC PROGRAMMABLE SPEECH SYNTHESIZER WITH VOICE RECOGNITION
YOSHINO, T
TAKAMIZAWA, T
HENDERSON, A
ABIKO, S
HASHIZUME, M
SATOH, T
KATOH, K
ISSCC DIGEST OF TECHNICAL PAPERS, 1984, 27 : 116 - 117
[50] Turkish Speech Recognition for Voice Search Applications
Tekgoz, Hilal
Ozbek, Muhammed Murat
Buyuktanir, Tolga
Uz, Harun
32ND IEEE SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU 2024, 2024,

← 1 2 3 4 5 →