Speaker based Language Independent Isolated Speech Recognition System

被引:0
|
作者
Therese, Shanthi S. [1 ]
Lingam, Chelpa [2 ]
机构
[1] Univ Mumbai, Thadomal Shahani Engn Coll, Bandra W, India
[2] Univ Mumbai, Coll Engn & Technol, Rasayani, India
关键词
K-Means Algorithm; Mel Frequency Cepstral Coefficients (MFCC); Euclidean Distance; Pitch Contour;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper presents a speaker based Language Independent Isolated Speech Recognition System (LIISRS). The most popular feature extraction technique Mel Frequency Cepstral Coefficients (MFCC) is used for training the system. Representative specific features are identified using K-Means algorithm. Distortion measure is calculated using Euclidian distance function. Pitch contour characteristics are used to identify the language specific features. Decision rules are formed to recognize language and speech of the given input. Thus, the proposed system not only recognizes the speech but also the language in which the speech is uttered. The result shows a satisfactory performance when the training is carried using native language speakers. Digits from one to ten of seven different languages are taken as training samples. Results obtained using 12 MFCC features for overall word level accuracy is 90.02% and language recognition accuracy is 97.14%.
引用
收藏
页数:7
相关论文
共 50 条
  • [31] SPEAKER INDEPENDENT RECOGNITION OF ISOLATED SPANISH DIGITS
    1600, (The International Society for Computers and Their Applications (ISCA)):
  • [32] Low-cost speech recognition system for small vocabulary and speaker independent
    Teh, CC
    Jong, CC
    Siek, L
    DESIGN, MODELING AND SIMULATION IN MICROELECTRONICS, 2000, 4228 : 208 - 211
  • [33] SPEAKER INDEPENDENT ISOLATED WORD RECOGNITION BASED ON STOCHASTIC WORD MODELS
    EULER, S
    AEU-ARCHIV FUR ELEKTRONIK UND UBERTRAGUNGSTECHNIK-INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATIONS, 1989, 43 (05): : 303 - 307
  • [34] On Speaker-Independent, Speaker-Dependent, and Speaker-Adaptive Speech Recognition
    Huang, Xuedong
    Lee, Kai-Fu
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1993, 1 (02): : 150 - 157
  • [35] Speaker Independent Sinhala Speech Recognition for Voice Dialling
    Amarasingh, W. G. T. N.
    Gamini, D. D. A.
    INTERNATIONAL CONFERENCE ON ADVANCES IN ICT FOR EMERGING REGIONS (ICTER2012), 2012, : 3 - 6
  • [36] Speaker independent audio-visual speech recognition
    Zhang, Y
    Levinson, S
    Huang, T
    2000 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, PROCEEDINGS VOLS I-III, 2000, : 1073 - 1076
  • [37] Japanese Speaker-Independent Homonyms Speech Recognition
    Murakami, Jin'ichi
    Hotta, Haseo
    COMPUTATIONAL LINGUISTICS AND RELATED FIELDS, 2011, 27 : 306 - 313
  • [38] PREDICTOR CODEBOOK FOR SPEAKER-INDEPENDENT SPEECH RECOGNITION
    KAWABATA, T
    SYSTEMS AND COMPUTERS IN JAPAN, 1994, 25 (01) : 37 - 46
  • [39] Speaker independent speech emotion recognition by ensemble classification
    Schuller, B
    Reiter, S
    Müller, R
    Al-Hames, M
    Lang, M
    Rigoll, G
    2005 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), VOLS 1 AND 2, 2005, : 865 - 868
  • [40] Predictor codebook for speaker-independent speech recognition
    Kawabata, Takeshi
    Systems and Computers in Japan, 1994, 25 (01): : 37 - 46