Improved vowel region detection from a continuous speech using post processing of vowel onset points and vowel end-points

被引:0
|
作者
Ramakrishna Thirumuru
Suryakanth V. Gangashetty
Anil Kumar Vuppala
机构
[1] International Institute of Information Technology Hyderabad,Language Technology Research Center
来源
关键词
Vowel onset point (VOP); Vowel end-point (VEP); Zero frequency filtering; Magnitude spectrum; Epoch intervals; Strength of the excitation;
D O I
暂无
中图分类号
学科分类号
摘要
Vowels are produced with an open configuration of the vocal tract, without any audible friction. The acoustic signal is relatively loud with varying strength of impulse-like excitation. Vowels possess significant energy content in the low-frequency bands of the speech signal. Acoustic events such as vowel onset point (VOP) and vowel end-point (VEP) can be used as landmarks to detect vowel regions in a speech signal. In this paper, a two-stage algorithm is proposed to detect precise vowel regions. In the first level, the speech signal is processed using zero frequency filtering to emphasize energy content in low-frequency bands of speech. Zero frequency filtered signal predominantly contains low-frequency content of the speech signal as it is filtered around 0 Hz. This process is followed by the extraction of dominant spectral peaks from the magnitude spectrum around glottal closure regions of the speech signal. The vowel onset points and vowel end-points are obtained by convolving the enhanced spectral contour of zero frequency filtered signal with first order Gaussian differentiator. In the next level, a post-processing is carried out in the regions around VOP and VEP to remove spurious vowel regions based on uniformity of epoch intervals. In addition, the positions of VOPs and VEPs are also corrected using the strength of the excitation of the speech signal. The performance of the proposed vowel region detection method is compared with the existing state of art methods on TIMIT acoustic-phonetic speech corpus. It is reported that this method produced significant improvement in vowel region detection in clean and noisy environments.
引用
收藏
页码:4753 / 4767
页数:14
相关论文
共 50 条
  • [31] Detection of Vowel-Like Speech Using Variance of Sample Magnitudes
    Srinivas, Nagapuri
    Pradhan, Gayadhar
    Kumar, Puli Kishore
    2019 25TH NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2019,
  • [32] Perception of Emotional Valences and Activity Levels from Vowel Segments of Continuous Speech
    Waaramaa, Teija
    Laukkanen, Anne-Maria
    Airas, Matti
    Alku, Paavo
    JOURNAL OF VOICE, 2010, 24 (01) : 30 - 38
  • [33] Character Region Detection Using Structure of Hangul Vowel Graphemes from Mobile Image
    Park, Jong-Cheon
    Jun, Byoung-Min
    Oh, Myoung-Kwan
    GRID AND DISTRIBUTED COMPUTING, 2011, 261 : 228 - +
  • [34] Non-Local Estimation of Speech Signal for Vowel Onset Point Detection in Varied Environments
    Kumar, Avinash
    Shahnawazuddin, S.
    Pradhan, Gayadhar
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 429 - 433
  • [35] Syllable Segmentation of Tamil Speech Signals Using Vowel Onset Point and Spectral Transition Measure
    Geetha K.
    Vadivel R.
    Automatic Control and Computer Sciences, 2018, 52 (1) : 25 - 31
  • [36] A pre-processing method for improvement of vowel onset point detection under noisy conditions
    Saha, P.
    Laskar, R. H.
    Laskar, A.
    SPEECH COMMUNICATION, 2016, 80 : 71 - 83
  • [37] Vowel Recognition from Telephonic Speech Using MFCCs and Gaussian Mixture Models
    Koolagudi, Shashidhar G.
    Thakur, Sujata Negi
    Barthwal, Anurag
    Singh, Manoj Kumar
    Rawat, Ramesh
    Rao, K. Sreenivasa
    ECO-FRIENDLY COMPUTING AND COMMUNICATION SYSTEMS, 2012, 305 : 170 - +
  • [38] Dialectal Assamese Vowel Speech Detection using Acoustic Phonetic Features, KNN and RNN
    Sharma, Mridusmita
    Sarma, Kandarpa Kumar
    2ND INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND INTEGRATED NETWORKS (SPIN) 2015, 2015, : 674 - 678
  • [39] Vowel Onset Point Detection Using Source, Spectral Peaks, and Modulation Spectrum Energies
    Prasanna, S. R. Mahadeva
    Reddy, B. V. Sandeep
    Krishnamoorthy, P.
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 17 (04): : 556 - 565
  • [40] Emotions in vowel segments of continuous speech: Analysis of the glottal flow using the normalised amplitude quotient
    Airas, M
    Alku, P
    PHONETICA, 2006, 63 (01) : 26 - 46