Hands-free speech recognition and communication on PDAS using microphone array technology

被引：0

作者：

Herbordt, W ^{[1
]}

Horiuchi, T ^{[1
]}

Fujimoto, M ^{[1
]}

Jitsuhiro, T ^{[1
]}

Nakamura, S ^{[1
]}

机构：

[1] ATR, Spoken Language Commun Res Labs, Kyoto 6190288, Japan

来源：

2005 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU) | 2005年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, a personal digital assistant (PDA) for handsfree speech recognition and communication with a microphone array mounted on the PDA is presented. An outlier-robust generalized sidelobe canceller (RGSC) and a minimum mean-squared error (MMSE) estimator for log Mel-spectral energy coefficients using a Gaussian mixture model (GMM) for clean speech are implemented in real-time and evaluated for speech recognition based on a small experimental multichannel database. It is shown that the joint system of beamformer and single-channel noise suppression highly improves the noise-robustness of a large-vocabulary speech recognizer so that down to SNR= 5 dB more than 91 % word accuracy is obtained.

引用

页码：302 / 307

页数：6

共 50 条

[21] Defeating reverberation: Advanced dereverberation and recognition techniques for hands-free speech recognition
Delcroix, Marc
Yoshioka, Takuya
Ogawa, Atsunori
Kubo, Yotaro
Fujimoto, Masakiyo
Ito, Nobutaka
Kinoshita, Keisuke
Espi, Miquel
Araki, Shoko
Hori, Takaaki
Nakatani, Tomohiro
2014 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP), 2014, : 522 - 526
[22] Optimized second-order gradient microphone for hands-free speech recordings in cars
Aubauer, R
Leckschat, D
SPEECH COMMUNICATION, 2001, 34 (1-2) : 13 - 23
[23] Stereophonic hands-free communication system based on microphone array fixed beamforming: Real-time implementation and evaluation
Pirro, Matteo
Squartini, Stefano
Romoli, Laura
Piazza, Francesco
Eurasip Journal on Audio, Speech, and Music Processing, 2012, 2012 (01)
[24] Hands-free speech recognition using a filtered clean corpus and incremental HMM adaptation
Matassoni, M
Omologo, M
Giuliani, D
2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1407 - 1410
[25] Stereophonic hands-free communication system based on microphone array fixed beamforming: real-time implementation and evaluation
Pirro, Matteo
Squartini, Stefano
Romoli, Laura
Piazza, Francesco
EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2012,
[26] Stereophonic hands-free communication system based on microphone array fixed beamforming: real-time implementation and evaluation
Matteo Pirro
Stefano Squartini
Laura Romoli
Francesco Piazza
EURASIP Journal on Audio, Speech, and Music Processing, 2012
[27] Energy-based speech enhancement technique for hands-free communication
Rahmani, M.
Yousefian, N.
Akbari, A.
ELECTRONICS LETTERS, 2009, 45 (01) : 85 - 86
[28] Experiments of in-car audio compensation for hands-free speech recognition
Matassoni, M
Omologo, M
Zieger, C
ASRU'03: 2003 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING ASRU '03, 2003, : 369 - 374
[29] IMPROVED HANDS-FREE AUTOMATIC SPEECH RECOGNITION IN REVERBERANT ENVIRONMENT CONDITION
Gomez, Randy
Nakamura, Keisuke
Mizumoto, Takeshi
Nakadai, Kazuhiro
2014 4TH JOINT WORKSHOP ON HANDS-FREE SPEECH COMMUNICATION AND MICROPHONE ARRAYS (HSCMA), 2014, : 67 - 71
[30] Likelihood-maximizing beamforming for robust hands-free speech recognition
Seltzer, ML
Raj, B
Stern, RM
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2004, 12 (05): : 489 - 498

← 1 2 3 4 5 →