Dialectal Assamese Vowel Speech Detection using Acoustic Phonetic Features, KNN and RNN

被引：0

作者：

Sharma, Mridusmita ^{[1
]}

Sarma, Kandarpa Kumar ^{[1
]}

机构：

[1] Gauhati Univ, Dept Elect & Commun Engn, Gauhati 14, Assam, India

来源：

2ND INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND INTEGRATED NETWORKS (SPIN) 2015 | 2015年

关键词：

Vowels; Dialect; Acoustic Phonetic Features; Recurrent Neural Network (RNN); K-Nearest Neighbor (KNN; Recognition;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

The recognition of vowel phonemes plays an important role in the field of speech processing. Assamese is the major language of Assam and also the mother-tongue of the of the largest segment of the population of Assam. The standard Assamese language has four major dialects namely Central dialect, Eastern dialect, Goalpariya dialect and Kamrupi dialect. It has eight vowel phonemes which are /i/, /e/, /epsilon/, /a/, /n/, /c/, /o/ and /u/. In this paper, a comparative analysis between the Recurrent Neural Network (RNN) based algorithm and K-Nearest Neighbor (KNN) based algorithm is carried out for the recognition of the vowel sounds using the Acoustic Phonetic Features as the feature vector. Dialect wise recognition of the vowels is also carried out using the same feature vectors. A recognition rate of 97 % is obtained by using the KNN based algorithm for vowel recognition and an overall rate of 84.3% and 87% is obtained by RNN based algorithm and KNN based algorithm respectively for the dialectal Assamese vowel recognition. K-NN based approach gives better recognition rate than the ANN based approach.

引用

页码：674 / 678

页数：5

共 50 条

[41] Towards capturing fine phonetic variation in speech using articulatory features
Scharenborg, Odette
Wan, Vincent
Moore, Roger K.
SPEECH COMMUNICATION, 2007, 49 (10-11) : 811 - 826
[42] Speech Emotion Classification using Acoustic Features
Chen, Shizhe
Jin, Qin
Li, Xirong
Yang, Gang
Xu, Jieping
2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 579 - 583
[43] SOME FEATURES OF ACOUSTIC-PHONETIC SEGMENTS OF VOICELESS PLOSIVES AND THEIR RELATION TO SPEECH CONTEXT.
Hayamizu, Satoru
Tanaka, Kazuyo
Ohta, Kozo
Denshi Gijutsu Sogo Kenkyusho Iho/Bulletin of the Electrotechnical Laboratory, 1988, 52 (03): : 38 - 42
[44] Preliminary Analysis of Lambani Vowels and Vowel Classification Using Acoustic Features
Dihingia, Leena
Bannulmath, Prashant
Chowdhury, Amartya Roy
Prasanna, S.R.M.
Deepak, K.T.
Sheikh, Tehreem
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2023, 14339 LNAI : 195 - 207
[45] Preliminary Analysis of Lambani Vowels and Vowel Classification Using Acoustic Features
Dihingia, Leena
Bannulmath, Prashant
Chowdhury, Amartya Roy
Prasanna, S. R. M.
Deepak, K. T.
Sheikh, Tehreem
SPEECH AND COMPUTER, SPECOM 2023, PT II, 2023, 14339 : 195 - 207
[46] An Approach to Lexical Stress Detection from Transcribed Continuous Speech Using Acoustic Features
Domokos, Jozsef
Stan, Adriana
Giurgiu, Mircea
2014 22ND TELECOMMUNICATIONS FORUM TELFOR (TELFOR), 2014, : 525 - 528
[47] Automatic question detection from acoustic and phonetic features using feature-wise pre-training
Ando, Atsushi
Asakawa, Reine
Masumura, Ryo
Kamiyama, Hosana
Kobashikawa, Satoshi
Aono, Yushi
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1731 - 1735
[48] Excitation Source Features for Improving the Detection of Vowel Onset and Offset Points in a Speech Sequence
Pradhan, Gayadhar
Kumar, Avinash
Shahnawazuddin, S.
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1884 - 1888
[49] Can we decode phonetic features in inner speech using surface electromyography?
Nalborczyk, Ladislas
Grandchamp, Romain
Koster, Ernst H. W.
Perrone-Bertolotti, Marcela
Loevenbruck, Helene
PLOS ONE, 2020, 15 (05):
[50] Acoustic Features Characterization of Autism Speech for Automated Detection and Classification
Mohanta, Abhijit
Mukherjee, Prerana
Mirtal, Vinay Kumar
2020 TWENTY SIXTH NATIONAL CONFERENCE ON COMMUNICATIONS (NCC 2020), 2020,

← 1 2 3 4 5 →