Hands-free speech recognition and communication on PDAS using microphone array technology

被引:0
|
作者
Herbordt, W [1 ]
Horiuchi, T [1 ]
Fujimoto, M [1 ]
Jitsuhiro, T [1 ]
Nakamura, S [1 ]
机构
[1] ATR, Spoken Language Commun Res Labs, Kyoto 6190288, Japan
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, a personal digital assistant (PDA) for handsfree speech recognition and communication with a microphone array mounted on the PDA is presented. An outlier-robust generalized sidelobe canceller (RGSC) and a minimum mean-squared error (MMSE) estimator for log Mel-spectral energy coefficients using a Gaussian mixture model (GMM) for clean speech are implemented in real-time and evaluated for speech recognition based on a small experimental multichannel database. It is shown that the joint system of beamformer and single-channel noise suppression highly improves the noise-robustness of a large-vocabulary speech recognizer so that down to SNR= 5 dB more than 91 % word accuracy is obtained.
引用
收藏
页码:302 / 307
页数:6
相关论文
共 50 条
  • [21] Defeating reverberation: Advanced dereverberation and recognition techniques for hands-free speech recognition
    Delcroix, Marc
    Yoshioka, Takuya
    Ogawa, Atsunori
    Kubo, Yotaro
    Fujimoto, Masakiyo
    Ito, Nobutaka
    Kinoshita, Keisuke
    Espi, Miquel
    Araki, Shoko
    Hori, Takaaki
    Nakatani, Tomohiro
    2014 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP), 2014, : 522 - 526
  • [22] Optimized second-order gradient microphone for hands-free speech recordings in cars
    Aubauer, R
    Leckschat, D
    SPEECH COMMUNICATION, 2001, 34 (1-2) : 13 - 23
  • [23] Stereophonic hands-free communication system based on microphone array fixed beamforming: Real-time implementation and evaluation
    Pirro, Matteo
    Squartini, Stefano
    Romoli, Laura
    Piazza, Francesco
    Eurasip Journal on Audio, Speech, and Music Processing, 2012, 2012 (01)
  • [24] Hands-free speech recognition using a filtered clean corpus and incremental HMM adaptation
    Matassoni, M
    Omologo, M
    Giuliani, D
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1407 - 1410
  • [25] Stereophonic hands-free communication system based on microphone array fixed beamforming: real-time implementation and evaluation
    Pirro, Matteo
    Squartini, Stefano
    Romoli, Laura
    Piazza, Francesco
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2012,
  • [26] Stereophonic hands-free communication system based on microphone array fixed beamforming: real-time implementation and evaluation
    Matteo Pirro
    Stefano Squartini
    Laura Romoli
    Francesco Piazza
    EURASIP Journal on Audio, Speech, and Music Processing, 2012
  • [27] Energy-based speech enhancement technique for hands-free communication
    Rahmani, M.
    Yousefian, N.
    Akbari, A.
    ELECTRONICS LETTERS, 2009, 45 (01) : 85 - 86
  • [28] Experiments of in-car audio compensation for hands-free speech recognition
    Matassoni, M
    Omologo, M
    Zieger, C
    ASRU'03: 2003 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING ASRU '03, 2003, : 369 - 374
  • [29] IMPROVED HANDS-FREE AUTOMATIC SPEECH RECOGNITION IN REVERBERANT ENVIRONMENT CONDITION
    Gomez, Randy
    Nakamura, Keisuke
    Mizumoto, Takeshi
    Nakadai, Kazuhiro
    2014 4TH JOINT WORKSHOP ON HANDS-FREE SPEECH COMMUNICATION AND MICROPHONE ARRAYS (HSCMA), 2014, : 67 - 71
  • [30] Likelihood-maximizing beamforming for robust hands-free speech recognition
    Seltzer, ML
    Raj, B
    Stern, RM
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2004, 12 (05): : 489 - 498