Subspace-based speaker-independent vowel recognition

被引:0
|
作者
Muralishankar, R [1 ]
O'Shaughnessy, D [1 ]
机构
[1] Univ Quebec, INRS EMT, Quebec City, PQ, Canada
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present a subspace-based approach for speaker-independent vowel recognition. Five vowels (/aa/,/eh/,/iy/,/ow/ and /uw/) from the TIMIT database were considered for the task. The subspaces representing two different vowel classes may have a large common subspace due to speaker variability, noise and coarticulation. We use common principal component (CPC) [1] and its extension i.e., partial-Common principal component (pCPC) to obtain a specific subspace for each vowel which is insensitive to variations. We perform CPC analysis on the covariance matrices of the vowels. pCPC gives q eigenvectors which are common to all vowels and (p - q) vowel specific eigenvectors. For each value of q, vowel specific subspaces are obtained. An input vector from an unknown vowel is classified based on the maximum length of its projection on the specific subspaces. We have choosen 18-dimensional Mel-Frequency Cepstral coefficients as a feature in our recognition task. The specific subspace is treated as a transformation matrix which enhances the vowel-specific information in the feature vector and, inturn. increases signal-to-noise ratio. Recognition experiments were performed on vowels extracted from a multiple speaker set taken from different dialect regions in the TIMIT database. Results for each vowel-specific subspace are presented for different values of q ranging from 1 to 5. The results are encouraging in the context of a speaker-independent framework. Visual Analysis of the vowel basis spectra provides useful and interesting information by highlighting the importance of different frequency regions.
引用
收藏
页码:549 / 552
页数:4
相关论文
共 50 条
  • [21] Speaker-independent Malay isolated sounds recognition
    Ting, HN
    Yunus, J
    Wong, LC
    ICONIP'02: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON NEURAL INFORMATION PROCESSING: COMPUTATIONAL INTELLIGENCE FOR THE E-AGE, 2002, : 2405 - 2408
  • [22] Speaker-independent Mandarin polysyllabic word recognition
    Chang, HY
    Chen, B
    Chou, CS
    Liu, CM
    ISSPA 96 - FOURTH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, PROCEEDINGS, VOLS 1 AND 2, 1996, : 329 - 332
  • [23] PREDICTOR CODEBOOK FOR SPEAKER-INDEPENDENT SPEECH RECOGNITION
    KAWABATA, T
    SYSTEMS AND COMPUTERS IN JAPAN, 1994, 25 (01) : 37 - 46
  • [24] Japanese Speaker-Independent Homonyms Speech Recognition
    Murakami, Jin'ichi
    Hotta, Haseo
    COMPUTATIONAL LINGUISTICS AND RELATED FIELDS, 2011, 27 : 306 - 313
  • [25] A Method for Designing Neural Networks Using Nonlinear Multivariate Analysis: Application to Speaker-Independent Vowel Recognition
    Irino, Toshio
    Kawahara, Hideki
    NEURAL COMPUTATION, 1990, 2 (03) : 386 - 397
  • [26] HMM-based integrated method for speaker-independent speech recognition
    Tsinghua Univ, Beijing, China
    Int Conf Signal Process Proc, (613-616):
  • [27] On Speaker-Independent, Speaker-Dependent, and Speaker-Adaptive Speech Recognition
    Huang, Xuedong
    Lee, Kai-Fu
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1993, 1 (02): : 150 - 157
  • [28] DSP-based large vocabulary speaker-independent speech recognition
    Hirayama, H
    Yoshida, K
    Koga, S
    Hattori, H
    NEC RESEARCH & DEVELOPMENT, 1996, 37 (04): : 528 - 534
  • [29] SPEAKER-INDEPENDENT SPEECH-RECOGNITION SYSTEM BASED ON LINEAR PREDICTION
    GUPTA, VN
    BRYAN, JK
    GOWDY, JN
    IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1978, 26 (01): : 27 - 33
  • [30] A HMM-based integrated method for speaker-independent speech recognition
    Zhang, YY
    Zhu, XY
    ICSP '98: 1998 FOURTH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, PROCEEDINGS, VOLS I AND II, 1998, : 613 - 616