Speaker based Language Independent Isolated Speech Recognition System

被引:0
|
作者
Therese, Shanthi S. [1 ]
Lingam, Chelpa [2 ]
机构
[1] Univ Mumbai, Thadomal Shahani Engn Coll, Bandra W, India
[2] Univ Mumbai, Coll Engn & Technol, Rasayani, India
关键词
K-Means Algorithm; Mel Frequency Cepstral Coefficients (MFCC); Euclidean Distance; Pitch Contour;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper presents a speaker based Language Independent Isolated Speech Recognition System (LIISRS). The most popular feature extraction technique Mel Frequency Cepstral Coefficients (MFCC) is used for training the system. Representative specific features are identified using K-Means algorithm. Distortion measure is calculated using Euclidian distance function. Pitch contour characteristics are used to identify the language specific features. Decision rules are formed to recognize language and speech of the given input. Thus, the proposed system not only recognizes the speech but also the language in which the speech is uttered. The result shows a satisfactory performance when the training is carried using native language speakers. Digits from one to ten of seven different languages are taken as training samples. Results obtained using 12 MFCC features for overall word level accuracy is 90.02% and language recognition accuracy is 97.14%.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] Speaker Independent Isolated Speech Recognition System for Tamil Language using HMM
    Vimala, C.
    Radha, V.
    INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY AND SYSTEM DESIGN 2011, 2012, 30 : 1097 - 1102
  • [2] Designing an independent speaker isolated Speech Recognition System on an FPGA.
    Gonzalez-Concejero, C.
    Rodellar, V.
    Alvarez-Marquina, A.
    de Icaya, E. Martinez
    Gomez-Vilda, P.
    PRIME 2006: 2ND CONFERENCE ON PH.D. RESEARCH IN MICROELECTRONIC AND ELECTRONICS, PROCEEDINGS, 2006, : 81 - +
  • [3] Speaker independent speech recognition system based on phoneme identification
    Maheswari, N. Uma
    Kabilan, A. P.
    Venkatesh, R.
    ICCN: 2008 INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING, 2008, : 585 - +
  • [4] Speaker Independent Speech Recognition System.
    Trnka, R.
    1984, (16):
  • [5] Speaker Independent Speech Recognition Implementation with Adaptive Language Models
    Anukriti
    Tiwari, Sushant
    Chatterjee, Tanmay
    Bhattacharya, Mahua
    2013 INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL AND BUSINESS INTELLIGENCE (ISCBI), 2013, : 7 - 10
  • [6] SPEAKER-INDEPENDENT SPEECH-RECOGNITION SYSTEM BASED ON LINEAR PREDICTION
    GUPTA, VN
    BRYAN, JK
    GOWDY, JN
    IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1978, 26 (01): : 27 - 33
  • [7] SOC implementation of IINM based speaker independent isolated digit recognition system
    Amudha, V.
    Venkataramani, B.
    Kumar, R. Vinoth
    Ravishankar, S.
    20TH INTERNATIONAL CONFERENCE ON VLSI DESIGN, PROCEEDINGS: TECHNOLOGY CHALLENGES IN THE NANOELECTRONICS ERA, 2007, : 848 - +
  • [8] Uighur speaker-independent speech recognition based on CDCPM
    Wang, K.L.
    2001, Science Press (38):
  • [9] Graph Learning Based Speaker Independent Speech Emotion Recognition
    Xu, Xinzhou
    Huang, Chengwei
    Wu, Chen
    Wang, Qingyun
    Zhao, Li
    ADVANCES IN ELECTRICAL AND COMPUTER ENGINEERING, 2014, 14 (02) : 17 - 22
  • [10] Speaker-and language-independent speech recognition in mobile communication systems
    Viikki, I
    Kiss, I
    Tian, J
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 5 - 8