Hands-free speech recognition and communication on PDAS using microphone array technology

被引:0
|
作者
Herbordt, W [1 ]
Horiuchi, T [1 ]
Fujimoto, M [1 ]
Jitsuhiro, T [1 ]
Nakamura, S [1 ]
机构
[1] ATR, Spoken Language Commun Res Labs, Kyoto 6190288, Japan
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, a personal digital assistant (PDA) for handsfree speech recognition and communication with a microphone array mounted on the PDA is presented. An outlier-robust generalized sidelobe canceller (RGSC) and a minimum mean-squared error (MMSE) estimator for log Mel-spectral energy coefficients using a Gaussian mixture model (GMM) for clean speech are implemented in real-time and evaluated for speech recognition based on a small experimental multichannel database. It is shown that the joint system of beamformer and single-channel noise suppression highly improves the noise-robustness of a large-vocabulary speech recognizer so that down to SNR= 5 dB more than 91 % word accuracy is obtained.
引用
收藏
页码:302 / 307
页数:6
相关论文
共 50 条
  • [41] Transforming HMMs for speaker-independent hands-free speech recognition in the car
    Gong, Y
    Godfrey, JJ
    ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 297 - 300
  • [42] Microphone array system for speech recognition
    Kiyohara, K
    Kaneda, Y
    Takahashi, S
    Nomura, H
    Kojima, J
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS, 1997, : 215 - 218
  • [43] Using hands-free technology in programs for profoundly disabled children
    Eachus, HT
    Junker, AM
    INTERNATIONAL CONFERENCE ON COMPUTING AND INFORMATION TECHNOLOGIES : EXPLORING EMERGING TECHNOLOGIES, 2001, : 111 - 117
  • [44] A Hands-free Communication Solution for Wearable Devices
    Sun, Yixin
    Tao, Yudong
    Hu, Zhi
    Fan, Hao
    Wang, Yuwei
    2014 IEEE Healthcare Innovation Conference (HIC), 2014, : 75 - 78
  • [45] Speech recognition in cars by speaker localization using microphone array
    Kondo, Keisuke
    Nagai, Takayuki
    Kaneko, Masahide
    Kurematsu, Akira
    Systems and Computers in Japan, 2003, 34 (08) : 1 - 12
  • [46] Frame-synchronous noise compensation for hands-free speech recognition in car environments
    Chien, JT
    Lin, MS
    IEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSING, 2000, 147 (06): : 508 - 515
  • [47] Speech and Hands-free Interaction: Myths, Challenges, and Opportunities
    Munteanu, Cosmin
    Penn, Gerald
    CHI 2018: EXTENDED ABSTRACTS OF THE 2018 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, 2018,
  • [48] Speech and Hands-free Interaction: Myths, Challenges, and Opportunities
    Munteanu, Cosmin
    Penn, Gerald
    PROCEEDINGS OF THE 19TH INTERNATIONAL CONFERENCE ON HUMAN-COMPUTER INTERACTION WITH MOBILE DEVICES AND SERVICES (MOBILEHCI '17), 2017,
  • [49] HANDS-FREE SPEECH-SOUND INTERACTIONS AT HOME
    Milhorat, P.
    Istrate, D.
    Boudy, J.
    Chollet, G.
    2012 PROCEEDINGS OF THE 20TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2012, : 1678 - 1682
  • [50] A robust speech detection algorithm for speech activated hands-free applications
    Wu, D
    Tanaka, M
    Chen, R
    Olorenshaw, L
    Amador, M
    Menendez-Pidal, X
    ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 2407 - 2410