Dialectal Assamese Vowel Speech Detection using Acoustic Phonetic Features, KNN and RNN

被引:0
|
作者
Sharma, Mridusmita [1 ]
Sarma, Kandarpa Kumar [1 ]
机构
[1] Gauhati Univ, Dept Elect & Commun Engn, Gauhati 14, Assam, India
关键词
Vowels; Dialect; Acoustic Phonetic Features; Recurrent Neural Network (RNN); K-Nearest Neighbor (KNN; Recognition;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The recognition of vowel phonemes plays an important role in the field of speech processing. Assamese is the major language of Assam and also the mother-tongue of the of the largest segment of the population of Assam. The standard Assamese language has four major dialects namely Central dialect, Eastern dialect, Goalpariya dialect and Kamrupi dialect. It has eight vowel phonemes which are /i/, /e/, /epsilon/, /a/, /n/, /c/, /o/ and /u/. In this paper, a comparative analysis between the Recurrent Neural Network (RNN) based algorithm and K-Nearest Neighbor (KNN) based algorithm is carried out for the recognition of the vowel sounds using the Acoustic Phonetic Features as the feature vector. Dialect wise recognition of the vowels is also carried out using the same feature vectors. A recognition rate of 97 % is obtained by using the KNN based algorithm for vowel recognition and an overall rate of 84.3% and 87% is obtained by RNN based algorithm and KNN based algorithm respectively for the dialectal Assamese vowel recognition. K-NN based approach gives better recognition rate than the ANN based approach.
引用
收藏
页码:674 / 678
页数:5
相关论文
共 50 条
  • [21] Speech synthesis of emotions using vowel features of a speaker
    Boku, K.
    Asada, T.
    Yoshitomi, Y.
    Tabuse, M.
    PROCEEDINGS OF THE EIGHTEENTH INTERNATIONAL SYMPOSIUM ON ARTIFICIAL LIFE AND ROBOTICS (AROB 18TH '13), 2013, : 176 - 179
  • [22] An Efficient Mispronunciation Detection System Using Discriminative Acoustic Phonetic Features for Arabic Consonants
    Maqsood, Muazzam
    Habib, Adnan
    Nawaz, Tabassam
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2019, 16 (02) : 242 - 250
  • [23] Speech Synthesis of Emotions in a Sentence using Vowel Features
    Makino, Rintaro
    Yoshitomi, Yasunari
    Asada, Taro
    Tabuse, Masayoshi
    JOURNAL OF ROBOTICS NETWORKING AND ARTIFICIAL LIFE, 2020, 7 (02): : 107 - 110
  • [24] Speech synthesis of emotions using vowel features of a speaker
    Boku, Kanu
    Asada, Taro
    Yoshitomi, Yasunari
    Tabuse, Masayoshi
    ARTIFICIAL LIFE AND ROBOTICS, 2014, 19 (01) : 27 - 32
  • [25] Hidden Markov model-based Assamese vowel phoneme recognition using cepstral features
    Department of Instrumentation, USIC, Gauhati University, Guwahati
    781 014, India
    不详
    781 001, India
    Int. J. Inf. Commun. Technol., 2-3 (218-234):
  • [26] ACOUSTIC-PHONETIC FEATURES OF STRESSED SYLLABLES IN SPEECH OF 3 YEAR OLDS
    HAWKINS, S
    ALLEN, G
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1978, 63 : S56 - S56
  • [27] Incorporating finer acoustic phonetic features in lexicon for Hindi language speech recognition
    Patil, Atul
    More, Prashant
    Sasikumar, M.
    JOURNAL OF INFORMATION & OPTIMIZATION SCIENCES, 2019, 40 (08): : 1731 - 1739
  • [28] Speech spoofing detection using SVM and ELM technique with acoustic features
    Rahmeni, Raoudha
    Ben Aicha, Anis
    Ben Ayed, Yassine
    2020 5TH INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR SIGNAL AND IMAGE PROCESSING (ATSIP'2020), 2020,
  • [29] Speech Recognition of Assamese Numerals Using Combinations of LPC - Features and Heterogenous ANNs
    Sarma, Manash Pratim
    Sarma, Kandarpa Kumar
    INFORMATION AND COMMUNICATION TECHNOLOGIES, 2010, 101 : 8 - 12
  • [30] An analysis of general acoustic-phonetic features for Spanish speech produced with the Lombard effect
    Castellanos, A
    Benedi, JM
    Casacuberta, F
    SPEECH COMMUNICATION, 1996, 20 (1-2) : 23 - 35