Spotting and Recognition of Consonant-Vowel Units from Continuous Speech Using Accurate Detection of Vowel Onset Points

被引:0
|
作者
Anil Kumar Vuppala
K. Sreenivasa Rao
Saswat Chakrabarti
机构
[1] GSSST,
[2] IIT Kharagpur,undefined
[3] SIT,undefined
[4] IIT Kharagpur,undefined
关键词
Spotting consonant-vowel (CV) units; Vowel onset point (VOP); Epoch locations; Hidden Markov model (HMM); Support vector machine (SVM);
D O I
暂无
中图分类号
学科分类号
摘要
In this paper, we propose an efficient approach to spotting and recognition of consonant-vowel (CV) units from continuous speech using accurate detection of vowel onset points (VOPs). Existing methods for VOP detection suffer from lack of high accuracy, spurious VOPs, and missed VOPs. The proposed VOP detection is designed to overcome most of the shortcomings of the existing methods and provide accurate detection of VOPs for improving the performance of spotting and recognition of CV units. The proposed method for VOP detection is carried out in two levels. At the first level, VOPs are detected by combining the complementary evidence from excitation source, spectral peaks, and modulation spectrum. At the second level, hypothesized VOPs are verified (genuine or spurious), and their positions are corrected using the uniform epoch intervals present in the vowel regions. The spotted CV units are recognized using a two-stage CV recognizer. Two-stage CV recognition system consists of hidden Markov models (HMMs) at the first stage for recognizing the vowel category of a CV unit and support vector machines (SVMs) for recognizing the consonant category of a CV unit at the second stage. Performance of spotting and recognition of CV units from continuous speech is evaluated using Telugu broadcast news speech corpus.
引用
收藏
页码:1459 / 1474
页数:15
相关论文
共 41 条
  • [1] Spotting and Recognition of Consonant-Vowel Units from Continuous Speech Using Accurate Detection of Vowel Onset Points
    Vuppala, Anil Kumar
    Rao, K. Sreenivasa
    Chakrabarti, Saswat
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2012, 31 (04) : 1459 - 1474
  • [2] Spotting multilingual consonant-vowel units of speech using neural network models
    Gangashetty, SV
    Sekhar, CC
    Yegnanarayana, B
    NONLINEAR ANALYSES AND ALGORITHMS FOR SPEECH PROCESSING, 2005, 3817 : 303 - 317
  • [3] Spotting consonant-vowel units in continuous speech using autoassociative neural networks and support vector machines
    Gangashetty, SV
    Sekhar, CC
    Yegnanarayana, B
    MACHINE LEARNING FOR SIGNAL PROCESSING XIV, 2004, : 401 - 410
  • [4] Effect of Speech Coding on Recognition of Consonant-Vowel (CV) Units
    Vuppala, Anil Kumar
    Chakrabarti, Saswat
    Rao, K. Sreenivasa
    CONTEMPORARY COMPUTING, PT 1, 2010, 94 : 284 - +
  • [5] Neural network models for spotting Stop Consonant-Vowel (SCV) segments in continuous speech
    Sekhar, CC
    Yegnanarayana, B
    ICNN - 1996 IEEE INTERNATIONAL CONFERENCE ON NEURAL NETWORKS, VOLS. 1-4, 1996, : 2003 - 2008
  • [6] Combining evidence from multiple modular networks for recognition of consonant-vowel units of speech
    Gangashetty, SV
    Rao, KS
    Khan, AN
    Sekhar, CC
    Yegnanarayana, B
    PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS 2003, VOLS 1-4, 2003, : 686 - 691
  • [7] Effect of Noise on Recognition of Consonant-Vowel (CV) Units
    Vuppala, Anil Kumar
    Rao, K. Sreenivasa
    Chakrabarti, Saswat
    CONTEMPORARY COMPUTING, 2011, 168 : 191 - +
  • [8] Improved vowel region detection from a continuous speech using post processing of vowel onset points and vowel end-points
    Ramakrishna Thirumuru
    Suryakanth V. Gangashetty
    Anil Kumar Vuppala
    Multimedia Tools and Applications, 2018, 77 : 4753 - 4767
  • [9] Combining evidence from multiple classifiers for recognition of consonant-vowel units of speech in multiple languages
    Gangashetty, SV
    Sekhar, CC
    Yegnanarayana, B
    2005 International Conference on Intelligent Sensing and Information Processing, Proceedings, 2005, : 387 - 391
  • [10] Improved vowel region detection from a continuous speech using post processing of vowel onset points and vowel end-points
    Thirumuru, Ramakrishna
    Gangashetty, Suryakanth V.
    Vuppala, Anil Kumar
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (04) : 4753 - 4767