Speaker based Language Independent Isolated Speech Recognition System

被引：0

作者：

Therese, Shanthi S. ^{[1
]}

Lingam, Chelpa ^{[2
]}

机构：

[1] Univ Mumbai, Thadomal Shahani Engn Coll, Bandra W, India

[2] Univ Mumbai, Coll Engn & Technol, Rasayani, India

来源：

2015 INTERNATIONAL CONFERENCE ON COMMUNICATION, INFORMATION & COMPUTING TECHNOLOGY (ICCICT) | 2015年

关键词：

K-Means Algorithm; Mel Frequency Cepstral Coefficients (MFCC); Euclidean Distance; Pitch Contour;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

This paper presents a speaker based Language Independent Isolated Speech Recognition System (LIISRS). The most popular feature extraction technique Mel Frequency Cepstral Coefficients (MFCC) is used for training the system. Representative specific features are identified using K-Means algorithm. Distortion measure is calculated using Euclidian distance function. Pitch contour characteristics are used to identify the language specific features. Decision rules are formed to recognize language and speech of the given input. Thus, the proposed system not only recognizes the speech but also the language in which the speech is uttered. The result shows a satisfactory performance when the training is carried using native language speakers. Digits from one to ten of seven different languages are taken as training samples. Results obtained using 12 MFCC features for overall word level accuracy is 90.02% and language recognition accuracy is 97.14%.

引用

页数：7

共 50 条

[41] Speaker Independent Urdu Speech Recognition Using HMM
Ashraf, Javed
Iqbal, Naveed
Khattak, Naveed Sarfraz
Zaidi, Ather Mohsin
NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, 2010, 6177 : 140 - 148
[42] SPEAKER-INDEPENDENT VOWEL RECOGNITION IN PERSIAN SPEECH
Nazari, Mohammad
Sayadiyan, Abolghasem
Valiollahzadeh, Seyyed Majid
2008 3RD INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGIES: FROM THEORY TO APPLICATIONS, VOLS 1-5, 2008, : 672 - 676
[43] Speaker Authentication System Based on Voice Biometrics and Speech Recognition
Dovydaitis, Laurynas
Rasymas, Tomas
Rudzionis, Vytautas
BUSINESS INFORMATION SYSTEMS WORKSHOPS, BIS 2016, 2017, 263 : 79 - 84
[44] An efficient algorithm for recognition of emotions from speaker and language independent speech using deep learning
Youddha Beer Singh
Shivani Goel
Multimedia Tools and Applications, 2021, 80 : 14001 - 14018
[45] Language and Speaker-Independent Feature Transformation for End-to-End Multilingual Speech Recognition
Hayakawa, Tomoaki
Leow, Chee Siang
Kobayashi, Akio
Utsuro, Takehito
Nishizaki, Hiromitsu
INTERSPEECH 2021, 2021, : 2431 - 2435
[46] Emotional Speech Clustering based Robust Speaker Recognition System
Li, Dongdong
Yang, Yingchun
PROCEEDINGS OF THE 2009 2ND INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOLS 1-9, 2009, : 4576 - +
[47] An efficient algorithm for recognition of emotions from speaker and language independent speech using deep learning
Singh, Youddha Beer
Goel, Shivani
MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (09) : 14001 - 14018
[48] DSP-based large vocabulary speaker-independent speech recognition
Hirayama, H
Yoshida, K
Koga, S
Hattori, H
NEC RESEARCH & DEVELOPMENT, 1996, 37 (04): : 528 - 534
[49] A HMM-based integrated method for speaker-independent speech recognition
Zhang, YY
Zhu, XY
ICSP '98: 1998 FOURTH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, PROCEEDINGS, VOLS I AND II, 1998, : 613 - 616
[50] HMM-based integrated method for speaker-independent speech recognition
Tsinghua Univ, Beijing, China
Int Conf Signal Process Proc, (613-616):

← 1 2 3 4 5 →