Spotting and Recognition of Consonant-Vowel Units from Continuous Speech Using Accurate Detection of Vowel Onset Points

被引:0
|
作者
Anil Kumar Vuppala
K. Sreenivasa Rao
Saswat Chakrabarti
机构
[1] GSSST,
[2] IIT Kharagpur,undefined
[3] SIT,undefined
[4] IIT Kharagpur,undefined
关键词
Spotting consonant-vowel (CV) units; Vowel onset point (VOP); Epoch locations; Hidden Markov model (HMM); Support vector machine (SVM);
D O I
暂无
中图分类号
学科分类号
摘要
In this paper, we propose an efficient approach to spotting and recognition of consonant-vowel (CV) units from continuous speech using accurate detection of vowel onset points (VOPs). Existing methods for VOP detection suffer from lack of high accuracy, spurious VOPs, and missed VOPs. The proposed VOP detection is designed to overcome most of the shortcomings of the existing methods and provide accurate detection of VOPs for improving the performance of spotting and recognition of CV units. The proposed method for VOP detection is carried out in two levels. At the first level, VOPs are detected by combining the complementary evidence from excitation source, spectral peaks, and modulation spectrum. At the second level, hypothesized VOPs are verified (genuine or spurious), and their positions are corrected using the uniform epoch intervals present in the vowel regions. The spotted CV units are recognized using a two-stage CV recognizer. Two-stage CV recognition system consists of hidden Markov models (HMMs) at the first stage for recognizing the vowel category of a CV unit and support vector machines (SVMs) for recognizing the consonant category of a CV unit at the second stage. Performance of spotting and recognition of CV units from continuous speech is evaluated using Telugu broadcast news speech corpus.
引用
收藏
页码:1459 / 1474
页数:15
相关论文
共 41 条
  • [21] Recognition of Stop-Consonant-Vowel (SCV) segments in continuous speech using neural network models
    Sekhar, CC
    Yegnanarayana, B
    JOURNAL OF THE INSTITUTION OF ELECTRONICS AND TELECOMMUNICATION ENGINEERS, 1996, 42 (4-5): : 269 - 280
  • [22] A Study on Vowel Region Detection from a Continuous Speech
    Thirumuru, Ramakrishna
    Vydana, Harikrishna
    Gangashetty, Suryakanth V.
    Vuppala, Anil Kumar
    MINING INTELLIGENCE AND KNOWLEDGE EXPLORATION (MIKE 2016), 2017, 10089 : 74 - 82
  • [23] AUTOMATIC DETECTION OF VOWEL CENTERS FROM CONTINUOUS SPEECH.
    Kasuya, Hideki
    Wakita, Hisashi
    Transactions of the Institute of Electronics and Communication Engineers of Japan. Section E, 1981, E64 (10): : 640 - 645
  • [24] Improved Vowel Onset and Offset Points Detection Using Bessel Features
    Sarma, Biswajit Dev
    Prajwal, Supreeth S.
    Prasanna, S. R. Mahadeva
    2014 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS (SPCOM), 2014,
  • [25] Excitation Source Features for Improving the Detection of Vowel Onset and Offset Points in a Speech Sequence
    Pradhan, Gayadhar
    Kumar, Avinash
    Shahnawazuddin, S.
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1884 - 1888
  • [26] Classification of stop place in consonant-vowel contexts using feature extrapolation of acoustic-phonetic features in telephone speech
    Lee, Jung-Won
    Choi, Jeung-Yoon
    Kang, Hong-Goo
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2012, 131 (02): : 1536 - 1546
  • [27] VOWEL AND CONSONANT RECOGNITION OF COCHLEAR IMPLANT PATIENTS USING FORMANT-ESTIMATING SPEECH PROCESSORS
    BLAMEY, PJ
    DOWELL, RC
    BROWN, AM
    CLARK, GM
    SELIGMAN, PM
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1987, 82 (01): : 48 - 57
  • [28] Automatic syllabification of speech signal using short time energy and vowel onset points
    Mary, Leena
    Antony, Anil P.
    Babu, Ben P.
    Prasanna, S. R. Mahadeva
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2018, 21 (03) : 571 - 579
  • [29] The differential effects of vowel and onset consonant lengthening on speech segmentation: Evidence from Taiwanese Southern Min
    Ou, Shu-chen
    Guo, Zhe-chen
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2021, 149 (03): : 1866 - 1877
  • [30] Using auditory-visual speech to probe the basis of noise-impaired consonant-vowel perception in dyslexia and auditory neuropathy
    Ramirez, J
    Mann, V
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2005, 118 (02): : 1122 - 1133