Dialectal Assamese Vowel Speech Detection using Acoustic Phonetic Features, KNN and RNN

被引:0
|
作者
Sharma, Mridusmita [1 ]
Sarma, Kandarpa Kumar [1 ]
机构
[1] Gauhati Univ, Dept Elect & Commun Engn, Gauhati 14, Assam, India
关键词
Vowels; Dialect; Acoustic Phonetic Features; Recurrent Neural Network (RNN); K-Nearest Neighbor (KNN; Recognition;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The recognition of vowel phonemes plays an important role in the field of speech processing. Assamese is the major language of Assam and also the mother-tongue of the of the largest segment of the population of Assam. The standard Assamese language has four major dialects namely Central dialect, Eastern dialect, Goalpariya dialect and Kamrupi dialect. It has eight vowel phonemes which are /i/, /e/, /epsilon/, /a/, /n/, /c/, /o/ and /u/. In this paper, a comparative analysis between the Recurrent Neural Network (RNN) based algorithm and K-Nearest Neighbor (KNN) based algorithm is carried out for the recognition of the vowel sounds using the Acoustic Phonetic Features as the feature vector. Dialect wise recognition of the vowels is also carried out using the same feature vectors. A recognition rate of 97 % is obtained by using the KNN based algorithm for vowel recognition and an overall rate of 84.3% and 87% is obtained by RNN based algorithm and KNN based algorithm respectively for the dialectal Assamese vowel recognition. K-NN based approach gives better recognition rate than the ANN based approach.
引用
收藏
页码:674 / 678
页数:5
相关论文
共 50 条
  • [41] Towards capturing fine phonetic variation in speech using articulatory features
    Scharenborg, Odette
    Wan, Vincent
    Moore, Roger K.
    SPEECH COMMUNICATION, 2007, 49 (10-11) : 811 - 826
  • [42] Speech Emotion Classification using Acoustic Features
    Chen, Shizhe
    Jin, Qin
    Li, Xirong
    Yang, Gang
    Xu, Jieping
    2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 579 - 583
  • [43] SOME FEATURES OF ACOUSTIC-PHONETIC SEGMENTS OF VOICELESS PLOSIVES AND THEIR RELATION TO SPEECH CONTEXT.
    Hayamizu, Satoru
    Tanaka, Kazuyo
    Ohta, Kozo
    Denshi Gijutsu Sogo Kenkyusho Iho/Bulletin of the Electrotechnical Laboratory, 1988, 52 (03): : 38 - 42
  • [44] Preliminary Analysis of Lambani Vowels and Vowel Classification Using Acoustic Features
    Dihingia, Leena
    Bannulmath, Prashant
    Chowdhury, Amartya Roy
    Prasanna, S.R.M.
    Deepak, K.T.
    Sheikh, Tehreem
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2023, 14339 LNAI : 195 - 207
  • [45] Preliminary Analysis of Lambani Vowels and Vowel Classification Using Acoustic Features
    Dihingia, Leena
    Bannulmath, Prashant
    Chowdhury, Amartya Roy
    Prasanna, S. R. M.
    Deepak, K. T.
    Sheikh, Tehreem
    SPEECH AND COMPUTER, SPECOM 2023, PT II, 2023, 14339 : 195 - 207
  • [46] An Approach to Lexical Stress Detection from Transcribed Continuous Speech Using Acoustic Features
    Domokos, Jozsef
    Stan, Adriana
    Giurgiu, Mircea
    2014 22ND TELECOMMUNICATIONS FORUM TELFOR (TELFOR), 2014, : 525 - 528
  • [47] Automatic question detection from acoustic and phonetic features using feature-wise pre-training
    Ando, Atsushi
    Asakawa, Reine
    Masumura, Ryo
    Kamiyama, Hosana
    Kobashikawa, Satoshi
    Aono, Yushi
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1731 - 1735
  • [48] Excitation Source Features for Improving the Detection of Vowel Onset and Offset Points in a Speech Sequence
    Pradhan, Gayadhar
    Kumar, Avinash
    Shahnawazuddin, S.
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1884 - 1888
  • [49] Can we decode phonetic features in inner speech using surface electromyography?
    Nalborczyk, Ladislas
    Grandchamp, Romain
    Koster, Ernst H. W.
    Perrone-Bertolotti, Marcela
    Loevenbruck, Helene
    PLOS ONE, 2020, 15 (05):
  • [50] Acoustic Features Characterization of Autism Speech for Automated Detection and Classification
    Mohanta, Abhijit
    Mukherjee, Prerana
    Mirtal, Vinay Kumar
    2020 TWENTY SIXTH NATIONAL CONFERENCE ON COMMUNICATIONS (NCC 2020), 2020,