Hands-free speech recognition and communication on PDAS using microphone array technology

被引：0

作者：

Herbordt, W ^{[1
]}

Horiuchi, T ^{[1
]}

Fujimoto, M ^{[1
]}

Jitsuhiro, T ^{[1
]}

Nakamura, S ^{[1
]}

机构：

[1] ATR, Spoken Language Commun Res Labs, Kyoto 6190288, Japan

来源：

2005 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU) | 2005年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, a personal digital assistant (PDA) for handsfree speech recognition and communication with a microphone array mounted on the PDA is presented. An outlier-robust generalized sidelobe canceller (RGSC) and a minimum mean-squared error (MMSE) estimator for log Mel-spectral energy coefficients using a Gaussian mixture model (GMM) for clean speech are implemented in real-time and evaluated for speech recognition based on a small experimental multichannel database. It is shown that the joint system of beamformer and single-channel noise suppression highly improves the noise-robustness of a large-vocabulary speech recognizer so that down to SNR= 5 dB more than 91 % word accuracy is obtained.

引用

页码：302 / 307

页数：6

共 50 条

[41] Transforming HMMs for speaker-independent hands-free speech recognition in the car
Gong, Y
Godfrey, JJ
ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 297 - 300
[42] Microphone array system for speech recognition
Kiyohara, K
Kaneda, Y
Takahashi, S
Nomura, H
Kojima, J
1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS, 1997, : 215 - 218
[43] Using hands-free technology in programs for profoundly disabled children
Eachus, HT
Junker, AM
INTERNATIONAL CONFERENCE ON COMPUTING AND INFORMATION TECHNOLOGIES : EXPLORING EMERGING TECHNOLOGIES, 2001, : 111 - 117
[44] A Hands-free Communication Solution for Wearable Devices
Sun, Yixin
Tao, Yudong
Hu, Zhi
Fan, Hao
Wang, Yuwei
2014 IEEE Healthcare Innovation Conference (HIC), 2014, : 75 - 78
[45] Speech recognition in cars by speaker localization using microphone array
Kondo, Keisuke
Nagai, Takayuki
Kaneko, Masahide
Kurematsu, Akira
Systems and Computers in Japan, 2003, 34 (08) : 1 - 12
[46] Frame-synchronous noise compensation for hands-free speech recognition in car environments
Chien, JT
Lin, MS
IEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSING, 2000, 147 (06): : 508 - 515
[47] Speech and Hands-free Interaction: Myths, Challenges, and Opportunities
Munteanu, Cosmin
Penn, Gerald
CHI 2018: EXTENDED ABSTRACTS OF THE 2018 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, 2018,
[48] Speech and Hands-free Interaction: Myths, Challenges, and Opportunities
Munteanu, Cosmin
Penn, Gerald
PROCEEDINGS OF THE 19TH INTERNATIONAL CONFERENCE ON HUMAN-COMPUTER INTERACTION WITH MOBILE DEVICES AND SERVICES (MOBILEHCI '17), 2017,
[49] HANDS-FREE SPEECH-SOUND INTERACTIONS AT HOME
Milhorat, P.
Istrate, D.
Boudy, J.
Chollet, G.
2012 PROCEEDINGS OF THE 20TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2012, : 1678 - 1682
[50] A robust speech detection algorithm for speech activated hands-free applications
Wu, D
Tanaka, M
Chen, R
Olorenshaw, L
Amador, M
Menendez-Pidal, X
ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 2407 - 2410

← 1 2 3 4 5 →