Spotting and Recognition of Consonant-Vowel Units from Continuous Speech Using Accurate Detection of Vowel Onset Points

被引:0
|
作者
Anil Kumar Vuppala
K. Sreenivasa Rao
Saswat Chakrabarti
机构
[1] GSSST,
[2] IIT Kharagpur,undefined
[3] SIT,undefined
[4] IIT Kharagpur,undefined
关键词
Spotting consonant-vowel (CV) units; Vowel onset point (VOP); Epoch locations; Hidden Markov model (HMM); Support vector machine (SVM);
D O I
暂无
中图分类号
学科分类号
摘要
In this paper, we propose an efficient approach to spotting and recognition of consonant-vowel (CV) units from continuous speech using accurate detection of vowel onset points (VOPs). Existing methods for VOP detection suffer from lack of high accuracy, spurious VOPs, and missed VOPs. The proposed VOP detection is designed to overcome most of the shortcomings of the existing methods and provide accurate detection of VOPs for improving the performance of spotting and recognition of CV units. The proposed method for VOP detection is carried out in two levels. At the first level, VOPs are detected by combining the complementary evidence from excitation source, spectral peaks, and modulation spectrum. At the second level, hypothesized VOPs are verified (genuine or spurious), and their positions are corrected using the uniform epoch intervals present in the vowel regions. The spotted CV units are recognized using a two-stage CV recognizer. Two-stage CV recognition system consists of hidden Markov models (HMMs) at the first stage for recognizing the vowel category of a CV unit and support vector machines (SVMs) for recognizing the consonant category of a CV unit at the second stage. Performance of spotting and recognition of CV units from continuous speech is evaluated using Telugu broadcast news speech corpus.
引用
收藏
页码:1459 / 1474
页数:15
相关论文
共 41 条
  • [31] Homomorphic Filtered Spectral Peaks Energy for Automatic Detection of Vowel Onset Point in Continuous Speech
    Zang, Xian
    Chong, Kil To
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2013, E96D (04): : 949 - 956
  • [32] Issues in Formant Analysis of Emotive Speech Using Vowel-Like Region Onset Points
    Surya, R.
    Ashwini, R.
    Pravena, D.
    Govind, D.
    INTELLIGENT SYSTEMS TECHNOLOGIES AND APPLICATIONS, VOL 1, 2016, 384 : 139 - 146
  • [33] Vowel onset point detection for noisy speech using spectral energy at formant frequencies
    Vuppala A.K.
    Rao K.S.
    International Journal of Speech Technology, 2013, 16 (02) : 229 - 235
  • [34] Vowel Recognition from Telephonic Speech Using MFCCs and Gaussian Mixture Models
    Koolagudi, Shashidhar G.
    Thakur, Sujata Negi
    Barthwal, Anurag
    Singh, Manoj Kumar
    Rawat, Ramesh
    Rao, K. Sreenivasa
    ECO-FRIENDLY COMPUTING AND COMMUNICATION SYSTEMS, 2012, 305 : 170 - +
  • [35] Detection of Vowel Offset Points Using Non-Local Similarity Between Speech Samples
    Kumar, Avinash
    Shahnawazuddin, S.
    Pradhan, Gayadhar
    2018 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS (SPCOM 2018), 2018, : 252 - 256
  • [36] Emotion recognition from spontaneous speech using emotional vowel-like regions
    Fahad, Md Shah
    Singh, Shreya
    Abhinav
    Ranjan, Ashish
    Deepak, Akshay
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (10) : 14025 - 14043
  • [37] Emotion recognition from spontaneous speech using emotional vowel-like regions
    Md Shah Fahad
    Shreya Singh
    Ashish Abhinav
    Akshay Ranjan
    Multimedia Tools and Applications, 2022, 81 : 14025 - 14043
  • [38] Vowel, digit and continuous speech recognition based on statistical, neural and hybrid modelling by using ASRS_RL
    Dumitru, Corneliu Octavian
    Gavat, Inge
    EUROCON 2007: THE INTERNATIONAL CONFERENCE ON COMPUTER AS A TOOL, VOLS 1-6, 2007, : 670 - 677
  • [39] Detection of vowel onset and offset points using non-local similarity between DWT approximation coefficients
    Kumar, A.
    Pradhan, G.
    ELECTRONICS LETTERS, 2018, 54 (11) : 722 - 723
  • [40] Vowel speech recognition from rat electroencephalography using long short-term memory neural network
    Ham, Jinsil
    Yoo, Hyun-Joon
    Kim, Jongin
    Lee, Boreom
    PLOS ONE, 2022, 17 (06):