Speaker based Language Independent Isolated Speech Recognition System

被引:0
|
作者
Therese, Shanthi S. [1 ]
Lingam, Chelpa [2 ]
机构
[1] Univ Mumbai, Thadomal Shahani Engn Coll, Bandra W, India
[2] Univ Mumbai, Coll Engn & Technol, Rasayani, India
关键词
K-Means Algorithm; Mel Frequency Cepstral Coefficients (MFCC); Euclidean Distance; Pitch Contour;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper presents a speaker based Language Independent Isolated Speech Recognition System (LIISRS). The most popular feature extraction technique Mel Frequency Cepstral Coefficients (MFCC) is used for training the system. Representative specific features are identified using K-Means algorithm. Distortion measure is calculated using Euclidian distance function. Pitch contour characteristics are used to identify the language specific features. Decision rules are formed to recognize language and speech of the given input. Thus, the proposed system not only recognizes the speech but also the language in which the speech is uttered. The result shows a satisfactory performance when the training is carried using native language speakers. Digits from one to ten of seven different languages are taken as training samples. Results obtained using 12 MFCC features for overall word level accuracy is 90.02% and language recognition accuracy is 97.14%.
引用
收藏
页数:7
相关论文
共 50 条
  • [41] Speaker Independent Urdu Speech Recognition Using HMM
    Ashraf, Javed
    Iqbal, Naveed
    Khattak, Naveed Sarfraz
    Zaidi, Ather Mohsin
    NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, 2010, 6177 : 140 - 148
  • [42] SPEAKER-INDEPENDENT VOWEL RECOGNITION IN PERSIAN SPEECH
    Nazari, Mohammad
    Sayadiyan, Abolghasem
    Valiollahzadeh, Seyyed Majid
    2008 3RD INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGIES: FROM THEORY TO APPLICATIONS, VOLS 1-5, 2008, : 672 - 676
  • [43] Speaker Authentication System Based on Voice Biometrics and Speech Recognition
    Dovydaitis, Laurynas
    Rasymas, Tomas
    Rudzionis, Vytautas
    BUSINESS INFORMATION SYSTEMS WORKSHOPS, BIS 2016, 2017, 263 : 79 - 84
  • [44] An efficient algorithm for recognition of emotions from speaker and language independent speech using deep learning
    Youddha Beer Singh
    Shivani Goel
    Multimedia Tools and Applications, 2021, 80 : 14001 - 14018
  • [45] Language and Speaker-Independent Feature Transformation for End-to-End Multilingual Speech Recognition
    Hayakawa, Tomoaki
    Leow, Chee Siang
    Kobayashi, Akio
    Utsuro, Takehito
    Nishizaki, Hiromitsu
    INTERSPEECH 2021, 2021, : 2431 - 2435
  • [46] Emotional Speech Clustering based Robust Speaker Recognition System
    Li, Dongdong
    Yang, Yingchun
    PROCEEDINGS OF THE 2009 2ND INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOLS 1-9, 2009, : 4576 - +
  • [47] An efficient algorithm for recognition of emotions from speaker and language independent speech using deep learning
    Singh, Youddha Beer
    Goel, Shivani
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (09) : 14001 - 14018
  • [48] DSP-based large vocabulary speaker-independent speech recognition
    Hirayama, H
    Yoshida, K
    Koga, S
    Hattori, H
    NEC RESEARCH & DEVELOPMENT, 1996, 37 (04): : 528 - 534
  • [49] A HMM-based integrated method for speaker-independent speech recognition
    Zhang, YY
    Zhu, XY
    ICSP '98: 1998 FOURTH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, PROCEEDINGS, VOLS I AND II, 1998, : 613 - 616
  • [50] HMM-based integrated method for speaker-independent speech recognition
    Tsinghua Univ, Beijing, China
    Int Conf Signal Process Proc, (613-616):