Dialectal Assamese Vowel Speech Detection using Acoustic Phonetic Features, KNN and RNN

被引：0

作者：

Sharma, Mridusmita ^{[1
]}

Sarma, Kandarpa Kumar ^{[1
]}

机构：

[1] Gauhati Univ, Dept Elect & Commun Engn, Gauhati 14, Assam, India

来源：

2ND INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND INTEGRATED NETWORKS (SPIN) 2015 | 2015年

关键词：

Vowels; Dialect; Acoustic Phonetic Features; Recurrent Neural Network (RNN); K-Nearest Neighbor (KNN; Recognition;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

The recognition of vowel phonemes plays an important role in the field of speech processing. Assamese is the major language of Assam and also the mother-tongue of the of the largest segment of the population of Assam. The standard Assamese language has four major dialects namely Central dialect, Eastern dialect, Goalpariya dialect and Kamrupi dialect. It has eight vowel phonemes which are /i/, /e/, /epsilon/, /a/, /n/, /c/, /o/ and /u/. In this paper, a comparative analysis between the Recurrent Neural Network (RNN) based algorithm and K-Nearest Neighbor (KNN) based algorithm is carried out for the recognition of the vowel sounds using the Acoustic Phonetic Features as the feature vector. Dialect wise recognition of the vowels is also carried out using the same feature vectors. A recognition rate of 97 % is obtained by using the KNN based algorithm for vowel recognition and an overall rate of 84.3% and 87% is obtained by RNN based algorithm and KNN based algorithm respectively for the dialectal Assamese vowel recognition. K-NN based approach gives better recognition rate than the ANN based approach.

引用

页码：674 / 678

页数：5

共 50 条

[21] Speech synthesis of emotions using vowel features of a speaker
Boku, K.
Asada, T.
Yoshitomi, Y.
Tabuse, M.
PROCEEDINGS OF THE EIGHTEENTH INTERNATIONAL SYMPOSIUM ON ARTIFICIAL LIFE AND ROBOTICS (AROB 18TH '13), 2013, : 176 - 179
[22] An Efficient Mispronunciation Detection System Using Discriminative Acoustic Phonetic Features for Arabic Consonants
Maqsood, Muazzam
Habib, Adnan
Nawaz, Tabassam
INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2019, 16 (02) : 242 - 250
[23] Speech Synthesis of Emotions in a Sentence using Vowel Features
Makino, Rintaro
Yoshitomi, Yasunari
Asada, Taro
Tabuse, Masayoshi
JOURNAL OF ROBOTICS NETWORKING AND ARTIFICIAL LIFE, 2020, 7 (02): : 107 - 110
[24] Speech synthesis of emotions using vowel features of a speaker
Boku, Kanu
Asada, Taro
Yoshitomi, Yasunari
Tabuse, Masayoshi
ARTIFICIAL LIFE AND ROBOTICS, 2014, 19 (01) : 27 - 32
[25] Hidden Markov model-based Assamese vowel phoneme recognition using cepstral features
Department of Instrumentation, USIC, Gauhati University, Guwahati
781 014, India
不详
781 001, India
Int. J. Inf. Commun. Technol., 2-3 (218-234):
[26] ACOUSTIC-PHONETIC FEATURES OF STRESSED SYLLABLES IN SPEECH OF 3 YEAR OLDS
HAWKINS, S
ALLEN, G
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1978, 63 : S56 - S56
[27] Incorporating finer acoustic phonetic features in lexicon for Hindi language speech recognition
Patil, Atul
More, Prashant
Sasikumar, M.
JOURNAL OF INFORMATION & OPTIMIZATION SCIENCES, 2019, 40 (08): : 1731 - 1739
[28] Speech spoofing detection using SVM and ELM technique with acoustic features
Rahmeni, Raoudha
Ben Aicha, Anis
Ben Ayed, Yassine
2020 5TH INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR SIGNAL AND IMAGE PROCESSING (ATSIP'2020), 2020,
[29] Speech Recognition of Assamese Numerals Using Combinations of LPC - Features and Heterogenous ANNs
Sarma, Manash Pratim
Sarma, Kandarpa Kumar
INFORMATION AND COMMUNICATION TECHNOLOGIES, 2010, 101 : 8 - 12
[30] An analysis of general acoustic-phonetic features for Spanish speech produced with the Lombard effect
Castellanos, A
Benedi, JM
Casacuberta, F
SPEECH COMMUNICATION, 1996, 20 (1-2) : 23 - 35

← 1 2 3 4 5 →