Speaker based Language Independent Isolated Speech Recognition System

被引：0

作者：

Therese, Shanthi S. ^{[1
]}

Lingam, Chelpa ^{[2
]}

机构：

[1] Univ Mumbai, Thadomal Shahani Engn Coll, Bandra W, India

[2] Univ Mumbai, Coll Engn & Technol, Rasayani, India

来源：

2015 INTERNATIONAL CONFERENCE ON COMMUNICATION, INFORMATION & COMPUTING TECHNOLOGY (ICCICT) | 2015年

关键词：

K-Means Algorithm; Mel Frequency Cepstral Coefficients (MFCC); Euclidean Distance; Pitch Contour;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

This paper presents a speaker based Language Independent Isolated Speech Recognition System (LIISRS). The most popular feature extraction technique Mel Frequency Cepstral Coefficients (MFCC) is used for training the system. Representative specific features are identified using K-Means algorithm. Distortion measure is calculated using Euclidian distance function. Pitch contour characteristics are used to identify the language specific features. Decision rules are formed to recognize language and speech of the given input. Thus, the proposed system not only recognizes the speech but also the language in which the speech is uttered. The result shows a satisfactory performance when the training is carried using native language speakers. Digits from one to ten of seven different languages are taken as training samples. Results obtained using 12 MFCC features for overall word level accuracy is 90.02% and language recognition accuracy is 97.14%.

引用

页数：7

共 50 条

[31] SPEAKER INDEPENDENT RECOGNITION OF ISOLATED SPANISH DIGITS
1600, (The International Society for Computers and Their Applications (ISCA)):
[32] Low-cost speech recognition system for small vocabulary and speaker independent
Teh, CC
Jong, CC
Siek, L
DESIGN, MODELING AND SIMULATION IN MICROELECTRONICS, 2000, 4228 : 208 - 211
[33] SPEAKER INDEPENDENT ISOLATED WORD RECOGNITION BASED ON STOCHASTIC WORD MODELS
EULER, S
AEU-ARCHIV FUR ELEKTRONIK UND UBERTRAGUNGSTECHNIK-INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATIONS, 1989, 43 (05): : 303 - 307
[34] On Speaker-Independent, Speaker-Dependent, and Speaker-Adaptive Speech Recognition
Huang, Xuedong
Lee, Kai-Fu
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1993, 1 (02): : 150 - 157
[35] Speaker Independent Sinhala Speech Recognition for Voice Dialling
Amarasingh, W. G. T. N.
Gamini, D. D. A.
INTERNATIONAL CONFERENCE ON ADVANCES IN ICT FOR EMERGING REGIONS (ICTER2012), 2012, : 3 - 6
[36] Speaker independent audio-visual speech recognition
Zhang, Y
Levinson, S
Huang, T
2000 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, PROCEEDINGS VOLS I-III, 2000, : 1073 - 1076
[37] Japanese Speaker-Independent Homonyms Speech Recognition
Murakami, Jin'ichi
Hotta, Haseo
COMPUTATIONAL LINGUISTICS AND RELATED FIELDS, 2011, 27 : 306 - 313
[38] PREDICTOR CODEBOOK FOR SPEAKER-INDEPENDENT SPEECH RECOGNITION
KAWABATA, T
SYSTEMS AND COMPUTERS IN JAPAN, 1994, 25 (01) : 37 - 46
[39] Speaker independent speech emotion recognition by ensemble classification
Schuller, B
Reiter, S
Müller, R
Al-Hames, M
Lang, M
Rigoll, G
2005 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), VOLS 1 AND 2, 2005, : 865 - 868
[40] Predictor codebook for speaker-independent speech recognition
Kawabata, Takeshi
Systems and Computers in Japan, 1994, 25 (01): : 37 - 46

← 1 2 3 4 5 →