Fast Speech Recognition for Voice Destination Entry in a Car Navigation System

被引:0
|
作者
Chung, Hoon [1 ]
Park, JeonGue [1 ]
Jeon, HyeonBae [1 ]
Lee, YunKeun [1 ]
机构
[1] Elect & Telecommun Res Inst, Taejon 305606, South Korea
关键词
speech recognition; multi-stage decoding;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we introduce a multi-stage decoding algorithm optimized to recognize very large number of entry names on a resource-limited embedded device. The multi-stage decoding algorithm is composed of a two-stage HMM-based coarse search and a detailed search. The two-stage HMM-based coarse search generates a small set of candidates that are assumed to contain a correct hypothesis with high probability, and the detailed search re-ranks the candidates by rescoring them with sophisticate acoustic models. In this paper, we take experiments with I-millions of point-of-interest (POI) names on an in-car navigation device with a fixed-point processor running at 620MHz. The experimental result shows that the multi-stage decoding algorithm runs about 2.23 times real-time on the device without serious degradation of recognition performance.
引用
收藏
页码:979 / 982
页数:4
相关论文
共 50 条
  • [21] Efficient Embedded Speech Recognition for Very Large Vocabulary Mandarin Car-Navigation Systems
    Qian, Yanmin
    Liu, Jia
    Johnson, Michael T.
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2009, 55 (03) : 1496 - 1500
  • [22] Visual Distraction Effects of In-Car Text Entry Methods - Comparing Keyboard, Handwriting and Voice Recognition
    Kujala, Tuomo
    Grahn, Hilkka
    AUTOMOTIVEUI 2017: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON AUTOMOTIVE USER INTERFACES AND INTERACTIVE VEHICULAR APPLICATIONS, 2017, : 1 - 10
  • [23] Improvements on Speech Recognition for Fast Speech
    Lee, Ki-Seung
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2006, 25 (02): : 88 - 95
  • [24] AN IMPLEMENTATION OF VOICE CONTROL SYSTEM BY USING CLOUD SPEECH RECOGNITION SERVICES
    Lee, Chiung-Hon Leon
    Lee, Chengzhe
    Cheng, I-Jing
    4TH INTERNATIONAL CONFERENCE ON SOFTWARE TECHNOLOGY AND ENGINEERING (ICSTE 2012), 2012, : 577 - 581
  • [25] Robust speech recognition in car environments
    Shozakai, M
    Nakamura, S
    Shikano, K
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 269 - 272
  • [26] Smart Car: Digital Controlling System Using Android Smartwatch Voice Recognition
    Tombeng, Marchel T.
    Najoan, Regi
    Karel, Noviko
    2018 6TH INTERNATIONAL CONFERENCE ON CYBER AND IT SERVICE MANAGEMENT (CITSM), 2018, : 610 - 614
  • [27] SPEECH RECOGNITION IN THE NOISY CAR ENVIRONMENT
    RUEHL, HW
    DOBLER, S
    WEITH, J
    MEYER, P
    NOLL, A
    HAMER, HH
    PIOTROWSKI, H
    SPEECH COMMUNICATION, 1991, 10 (01) : 11 - 22
  • [28] Bangla Speech Recognition for Voice Search
    Saurav, Jillur Rahman
    Amin, Shakhawat
    Kibria, Shafkat
    Rahman, M. Shahidur
    2018 INTERNATIONAL CONFERENCE ON BANGLA SPEECH AND LANGUAGE PROCESSING (ICBSLP), 2018,
  • [29] THE ROYALFLUSH AUTOMATIC SPEECH DIARIZATION AND RECOGNITION SYSTEM FOR IN-CAR MULTI-CHANNEL AUTOMATIC SPEECH RECOGNITION CHALLENGE
    Tian, Jingguang
    Ye, Shuaishuai
    Chen, Shunfei
    Xiang, Yang
    Yin, Zhaohui
    Hu, Xinhui
    Xu, Xinkang
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING WORKSHOPS, ICASSPW 2024, 2024, : 1 - 2
  • [30] SOC for Car Navigation System with a 55.3GOPS Image Recognition Engine
    Hamasaki, Hiroyuki
    Hoshi, Yasuhiko
    Nakamura, Atsushi
    Yamamoto, Akihiro
    Kido, Hideaki
    Muramatsu, Shoji
    2010 15TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE (ASP-DAC 2010), 2010, : 458 - +