Hands-free speech recognition and communication on PDAS using microphone array technology

被引:0
|
作者
Herbordt, W [1 ]
Horiuchi, T [1 ]
Fujimoto, M [1 ]
Jitsuhiro, T [1 ]
Nakamura, S [1 ]
机构
[1] ATR, Spoken Language Commun Res Labs, Kyoto 6190288, Japan
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, a personal digital assistant (PDA) for handsfree speech recognition and communication with a microphone array mounted on the PDA is presented. An outlier-robust generalized sidelobe canceller (RGSC) and a minimum mean-squared error (MMSE) estimator for log Mel-spectral energy coefficients using a Gaussian mixture model (GMM) for clean speech are implemented in real-time and evaluated for speech recognition based on a small experimental multichannel database. It is shown that the joint system of beamformer and single-channel noise suppression highly improves the noise-robustness of a large-vocabulary speech recognizer so that down to SNR= 5 dB more than 91 % word accuracy is obtained.
引用
收藏
页码:302 / 307
页数:6
相关论文
共 50 条
  • [31] Speech enhancement for hands-free terminals
    Grbic, N
    Nordholm, S
    Johansson, A
    ISPA 2001: PROCEEDINGS OF THE 2ND INTERNATIONAL SYMPOSIUM ON IMAGE AND SIGNAL PROCESSING AND ANALYSIS, 2001, : 435 - 440
  • [32] Study of microphone system for hands-free teleconferencing units
    Nakagawa, Akira
    Shimauchi, Suehiro
    Makino, Shoji
    Journal of the Acoustical Society of Japan (E) (English translation of Nippon Onkyo Gakkaishi), 2000, 21 (01): : 33 - 35
  • [33] HANDS-FREE SPEECH RECOGNITION CHALLENGE FOR REAL-WORLD SPEECH DIALOGUE SYSTEMS
    Saruwatari, Hiroshi
    Kawanami, Hiromichi
    Takeuchi, Shota
    Takahashi, Yu
    Cincarek, Tobias
    Shikano, Kiyohiro
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 3729 - 3732
  • [34] Noise-robust hands-free speech recognition based on spatial subtraction array and known noise superimposition
    Ohashi, Y
    Nishikawa, T
    Saruwatari, H
    Lee, A
    Shikano, K
    2005 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-4, 2005, : 533 - 537
  • [35] INTELLIGIBILITY OF ELECTROLARYNX SPEECH USING A NOVEL HANDS-FREE ACTUATOR
    Madden, Brian
    Nolan, Mark
    Burke, Edward
    Condron, James
    Coyle, Eugene
    BIOSIGNALS 2011, 2011, : 265 - 269
  • [36] Hands-free Communication Devices for Construction
    Moore, Bill
    Liu, Junshan
    Williams, Steve
    PROCEEDINGS OF CRIOCM 2008 INTERNATIONAL RESEARCH SYMPOSIUM ON ADVANCES OF CONSTRUCTION MANAGEMENT AND REAL ESTATE, 2008, : 251 - 257
  • [37] Mobile, Hands-free, Silent Speech Texting Using SilentSpeller
    Kimura, Naoki
    Gemicioglu, Tan
    Womack, Jon
    Li, Richard
    Zhao, Yuhui
    Bedri, Abdelkareem
    Olwal, Alex
    Rekimoto, Jun
    Starner, Thad
    EXTENDED ABSTRACTS OF THE 2021 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS (CHI'21), 2021,
  • [38] Hands-free Voice Communication with TV
    Papp, Istvan I.
    Saric, Zoran M.
    Teslic, Nikola Dj
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2011, 57 (02) : 606 - 614
  • [39] Distant Speech Recognition Using a Microphone Array Network
    Nakano, Alberto Yoshihiro
    Nakagawa, Seiichi
    Yamamoto, Kazumasa
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2010, E93D (09): : 2451 - 2462
  • [40] Transforming HMMs for speaker-independent hands-free speech recognition in the car
    Gong, Y.
    Godfrey, John J.
    ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 1999, 1 : 297 - 300