Subspace-based speaker-independent vowel recognition

被引：0

作者：

Muralishankar, R ^{[1
]}

O'Shaughnessy, D ^{[1
]}

机构：

[1] Univ Quebec, INRS EMT, Quebec City, PQ, Canada

来源：

2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING | 2005年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we present a subspace-based approach for speaker-independent vowel recognition. Five vowels (/aa/,/eh/,/iy/,/ow/ and /uw/) from the TIMIT database were considered for the task. The subspaces representing two different vowel classes may have a large common subspace due to speaker variability, noise and coarticulation. We use common principal component (CPC) [1] and its extension i.e., partial-Common principal component (pCPC) to obtain a specific subspace for each vowel which is insensitive to variations. We perform CPC analysis on the covariance matrices of the vowels. pCPC gives q eigenvectors which are common to all vowels and (p - q) vowel specific eigenvectors. For each value of q, vowel specific subspaces are obtained. An input vector from an unknown vowel is classified based on the maximum length of its projection on the specific subspaces. We have choosen 18-dimensional Mel-Frequency Cepstral coefficients as a feature in our recognition task. The specific subspace is treated as a transformation matrix which enhances the vowel-specific information in the feature vector and, inturn. increases signal-to-noise ratio. Recognition experiments were performed on vowels extracted from a multiple speaker set taken from different dialect regions in the TIMIT database. Results for each vowel-specific subspace are presented for different values of q ranging from 1 to 5. The results are encouraging in the context of a speaker-independent framework. Visual Analysis of the vowel basis spectra provides useful and interesting information by highlighting the importance of different frequency regions.

引用

页码：549 / 552

页数：4

共 50 条

[21] Speaker-independent Malay isolated sounds recognition
Ting, HN
Yunus, J
Wong, LC
ICONIP'02: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON NEURAL INFORMATION PROCESSING: COMPUTATIONAL INTELLIGENCE FOR THE E-AGE, 2002, : 2405 - 2408
[22] Speaker-independent Mandarin polysyllabic word recognition
Chang, HY
Chen, B
Chou, CS
Liu, CM
ISSPA 96 - FOURTH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, PROCEEDINGS, VOLS 1 AND 2, 1996, : 329 - 332
[23] PREDICTOR CODEBOOK FOR SPEAKER-INDEPENDENT SPEECH RECOGNITION
KAWABATA, T
SYSTEMS AND COMPUTERS IN JAPAN, 1994, 25 (01) : 37 - 46
[24] Japanese Speaker-Independent Homonyms Speech Recognition
Murakami, Jin'ichi
Hotta, Haseo
COMPUTATIONAL LINGUISTICS AND RELATED FIELDS, 2011, 27 : 306 - 313
[25] A Method for Designing Neural Networks Using Nonlinear Multivariate Analysis: Application to Speaker-Independent Vowel Recognition
Irino, Toshio
Kawahara, Hideki
NEURAL COMPUTATION, 1990, 2 (03) : 386 - 397
[26] HMM-based integrated method for speaker-independent speech recognition
Tsinghua Univ, Beijing, China
Int Conf Signal Process Proc, (613-616):
[27] On Speaker-Independent, Speaker-Dependent, and Speaker-Adaptive Speech Recognition
Huang, Xuedong
Lee, Kai-Fu
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1993, 1 (02): : 150 - 157
[28] DSP-based large vocabulary speaker-independent speech recognition
Hirayama, H
Yoshida, K
Koga, S
Hattori, H
NEC RESEARCH & DEVELOPMENT, 1996, 37 (04): : 528 - 534
[29] SPEAKER-INDEPENDENT SPEECH-RECOGNITION SYSTEM BASED ON LINEAR PREDICTION
GUPTA, VN
BRYAN, JK
GOWDY, JN
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1978, 26 (01): : 27 - 33
[30] A HMM-based integrated method for speaker-independent speech recognition
Zhang, YY
Zhu, XY
ICSP '98: 1998 FOURTH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, PROCEEDINGS, VOLS I AND II, 1998, : 613 - 616

← 1 2 3 4 5 →