Spotting and Recognition of Consonant-Vowel Units from Continuous Speech Using Accurate Detection of Vowel Onset Points

被引：0

作者：

Anil Kumar Vuppala

K. Sreenivasa Rao

Saswat Chakrabarti

机构：

[1] GSSST,

[2] IIT Kharagpur,undefined

[3] SIT,undefined

[4] IIT Kharagpur,undefined

来源：

Circuits, Systems, and Signal Processing | 2012年 / 31卷

关键词：

Spotting consonant-vowel (CV) units; Vowel onset point (VOP); Epoch locations; Hidden Markov model (HMM); Support vector machine (SVM);

D O I：

暂无

中图分类号：

学科分类号：

摘要：

In this paper, we propose an efficient approach to spotting and recognition of consonant-vowel (CV) units from continuous speech using accurate detection of vowel onset points (VOPs). Existing methods for VOP detection suffer from lack of high accuracy, spurious VOPs, and missed VOPs. The proposed VOP detection is designed to overcome most of the shortcomings of the existing methods and provide accurate detection of VOPs for improving the performance of spotting and recognition of CV units. The proposed method for VOP detection is carried out in two levels. At the first level, VOPs are detected by combining the complementary evidence from excitation source, spectral peaks, and modulation spectrum. At the second level, hypothesized VOPs are verified (genuine or spurious), and their positions are corrected using the uniform epoch intervals present in the vowel regions. The spotted CV units are recognized using a two-stage CV recognizer. Two-stage CV recognition system consists of hidden Markov models (HMMs) at the first stage for recognizing the vowel category of a CV unit and support vector machines (SVMs) for recognizing the consonant category of a CV unit at the second stage. Performance of spotting and recognition of CV units from continuous speech is evaluated using Telugu broadcast news speech corpus.

引用

页码：1459 / 1474

页数：15

共 41 条

[31] Homomorphic Filtered Spectral Peaks Energy for Automatic Detection of Vowel Onset Point in Continuous Speech
Zang, Xian
Chong, Kil To
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2013, E96D (04): : 949 - 956
[32] Issues in Formant Analysis of Emotive Speech Using Vowel-Like Region Onset Points
Surya, R.
Ashwini, R.
Pravena, D.
Govind, D.
INTELLIGENT SYSTEMS TECHNOLOGIES AND APPLICATIONS, VOL 1, 2016, 384 : 139 - 146
[33] Vowel onset point detection for noisy speech using spectral energy at formant frequencies
Vuppala A.K.
Rao K.S.
International Journal of Speech Technology, 2013, 16 (02) : 229 - 235
[34] Vowel Recognition from Telephonic Speech Using MFCCs and Gaussian Mixture Models
Koolagudi, Shashidhar G.
Thakur, Sujata Negi
Barthwal, Anurag
Singh, Manoj Kumar
Rawat, Ramesh
Rao, K. Sreenivasa
ECO-FRIENDLY COMPUTING AND COMMUNICATION SYSTEMS, 2012, 305 : 170 - +
[35] Detection of Vowel Offset Points Using Non-Local Similarity Between Speech Samples
Kumar, Avinash
Shahnawazuddin, S.
Pradhan, Gayadhar
2018 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS (SPCOM 2018), 2018, : 252 - 256
[36] Emotion recognition from spontaneous speech using emotional vowel-like regions
Fahad, Md Shah
Singh, Shreya
Abhinav
Ranjan, Ashish
Deepak, Akshay
MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (10) : 14025 - 14043
[37] Emotion recognition from spontaneous speech using emotional vowel-like regions
Md Shah Fahad
Shreya Singh
Ashish Abhinav
Akshay Ranjan
Multimedia Tools and Applications, 2022, 81 : 14025 - 14043
[38] Vowel, digit and continuous speech recognition based on statistical, neural and hybrid modelling by using ASRS_RL
Dumitru, Corneliu Octavian
Gavat, Inge
EUROCON 2007: THE INTERNATIONAL CONFERENCE ON COMPUTER AS A TOOL, VOLS 1-6, 2007, : 670 - 677
[39] Detection of vowel onset and offset points using non-local similarity between DWT approximation coefficients
Kumar, A.
Pradhan, G.
ELECTRONICS LETTERS, 2018, 54 (11) : 722 - 723
[40] Vowel speech recognition from rat electroencephalography using long short-term memory neural network
Ham, Jinsil
Yoo, Hyun-Joon
Kim, Jongin
Lee, Boreom
PLOS ONE, 2022, 17 (06):

← 1 2 3 4 5 →