Fast Speech Recognition for Voice Destination Entry in a Car Navigation System

被引:0
|
作者
Chung, Hoon [1 ]
Park, JeonGue [1 ]
Jeon, HyeonBae [1 ]
Lee, YunKeun [1 ]
机构
[1] Elect & Telecommun Res Inst, Taejon 305606, South Korea
关键词
speech recognition; multi-stage decoding;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we introduce a multi-stage decoding algorithm optimized to recognize very large number of entry names on a resource-limited embedded device. The multi-stage decoding algorithm is composed of a two-stage HMM-based coarse search and a detailed search. The two-stage HMM-based coarse search generates a small set of candidates that are assumed to contain a correct hypothesis with high probability, and the detailed search re-ranks the candidates by rescoring them with sophisticate acoustic models. In this paper, we take experiments with I-millions of point-of-interest (POI) names on an in-car navigation device with a fixed-point processor running at 620MHz. The experimental result shows that the multi-stage decoding algorithm runs about 2.23 times real-time on the device without serious degradation of recognition performance.
引用
收藏
页码:979 / 982
页数:4
相关论文
共 50 条
  • [41] VOICE RECOGNITION SYSTEM
    HANSEN, GC
    FALKENBACH, KH
    YAGHMAI, I
    RADIOLOGY, 1988, 169 (02) : 580 - 580
  • [42] Demo: Proactive Car Navigation: How Can Destination Prediction Give Us New Navigation Experience?
    Imai, Ryo
    Watanabe, Kosuke
    Tsubouchi, Kota
    Shimosaka, Masamichi
    UBICOMP/ISWC'19 ADJUNCT: PROCEEDINGS OF THE 2019 ACM INTERNATIONAL JOINT CONFERENCE ON PERVASIVE AND UBIQUITOUS COMPUTING AND PROCEEDINGS OF THE 2019 ACM INTERNATIONAL SYMPOSIUM ON WEARABLE COMPUTERS, 2019, : 292 - 295
  • [43] Evaluation of Interface and In-Car Speech - Many Undesirable Utterances and Sever Noisy Speech on Car Navigation Application -
    Hataoka, Nobuo
    Araki, Manabu
    Matsuda, Takashi
    Takahashi, Masayuki
    Ohtaki, Ryoichi
    Obuchi, Yasunari
    2008 IEEE 10TH WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, VOLS 1 AND 2, 2008, : 960 - +
  • [44] Speech recognition for command entry in multimodal interaction
    Tyfa, DA
    Howes, M
    INTERNATIONAL JOURNAL OF HUMAN-COMPUTER STUDIES, 2000, 52 (04) : 637 - 667
  • [45] Speech Recognition and Voice Separation for the Internet of Things
    Mofrad, Mohammad Hasanzadeh
    Mosse, Daniel
    PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON THE INTERNET OF THINGS (IOT'18), 2018,
  • [46] VOICE RECOGNITION JOINS SPEECH ON PROGRAMMABLE BOARD
    DUSEK, L
    SCHALK, TB
    ELECTRONICS, 1983, 56 (08): : 128 - &
  • [47] Robust speech recognition for car environment noise
    Kokubo, H
    Amano, A
    Hataoka, N
    ELECTRONICS AND COMMUNICATIONS IN JAPAN PART III-FUNDAMENTAL ELECTRONIC SCIENCE, 2002, 85 (11): : 65 - 73
  • [48] Automatic Speech Recognition Technique For Voice Command
    Gupta, Anshul
    Patel, Nileshkumar
    Khan, Shabana
    2014 INTERNATIONAL CONFERENCE ON SCIENCE ENGINEERING AND MANAGEMENT RESEARCH (ICSEMR), 2014,
  • [49] A MONOLITHIC PROGRAMMABLE SPEECH SYNTHESIZER WITH VOICE RECOGNITION
    YOSHINO, T
    TAKAMIZAWA, T
    HENDERSON, A
    ABIKO, S
    HASHIZUME, M
    SATOH, T
    KATOH, K
    ISSCC DIGEST OF TECHNICAL PAPERS, 1984, 27 : 116 - 117
  • [50] Turkish Speech Recognition for Voice Search Applications
    Tekgoz, Hilal
    Ozbek, Muhammed Murat
    Buyuktanir, Tolga
    Uz, Harun
    32ND IEEE SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU 2024, 2024,