Improved vowel region detection from a continuous speech using post processing of vowel onset points and vowel end-points

被引:5
|
作者
Thirumuru, Ramakrishna [1 ]
Gangashetty, Suryakanth V. [1 ]
Vuppala, Anil Kumar [1 ]
机构
[1] Int Inst Informat Technol Hyderabad, Language Technol Res Ctr, Hyderabad, Andhra Pradesh, India
关键词
Vowel onset point (VOP); Vowel end-point (VEP); Zero frequency filtering; Magnitude spectrum; Epoch intervals; Strength of the excitation; EXCITATION; SIGNALS;
D O I
10.1007/s11042-017-5044-8
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Vowels are produced with an open configuration of the vocal tract, without any audible friction. The acoustic signal is relatively loud with varying strength of impulse-like excitation. Vowels possess significant energy content in the low-frequency bands of the speech signal. Acoustic events such as vowel onset point (VOP) and vowel end-point (VEP) can be used as landmarks to detect vowel regions in a speech signal. In this paper, a two-stage algorithm is proposed to detect precise vowel regions. In the first level, the speech signal is processed using zero frequency filtering to emphasize energy content in low-frequency bands of speech. Zero frequency filtered signal predominantly contains low-frequency content of the speech signal as it is filtered around 0 Hz. This process is followed by the extraction of dominant spectral peaks from the magnitude spectrum around glottal closure regions of the speech signal. The vowel onset points and vowel end-points are obtained by convolving the enhanced spectral contour of zero frequency filtered signal with first order Gaussian differentiator. In the next level, a post-processing is carried out in the regions around VOP and VEP to remove spurious vowel regions based on uniformity of epoch intervals. In addition, the positions of VOPs and VEPs are also corrected using the strength of the excitation of the speech signal. The performance of the proposed vowel region detection method is compared with the existing state of art methods on TIMIT acoustic-phonetic speech corpus. It is reported that this method produced significant improvement in vowel region detection in clean and noisy environments.
引用
收藏
页码:4753 / 4767
页数:15
相关论文
共 50 条
  • [21] Robust vowel region detection method for multimode speech
    Tripathi, Kumud
    Rao, K. Sreenivasa
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (09) : 13615 - 13637
  • [22] Detection of vowel onset and offset points using non-local similarity between DWT approximation coefficients
    Kumar, A.
    Pradhan, G.
    ELECTRONICS LETTERS, 2018, 54 (11) : 722 - 723
  • [23] Vowel Onset Point Detection using Sonority Information
    Sharma, Bidisha
    Prasanna, S. R. Mahadeva
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 444 - 448
  • [24] Homomorphic Filtered Spectral Peaks Energy for Automatic Detection of Vowel Onset Point in Continuous Speech
    Zang, Xian
    Chong, Kil To
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2013, E96D (04): : 949 - 956
  • [25] Detection of Vowel Offset Point From Speech Signal
    Yadav, Jainath
    Rao, K. Sreenivasa
    IEEE SIGNAL PROCESSING LETTERS, 2013, 20 (04) : 299 - 302
  • [26] Vowel onset point detection for noisy speech using spectral energy at formant frequencies
    Vuppala A.K.
    Rao K.S.
    International Journal of Speech Technology, 2013, 16 (02) : 229 - 235
  • [27] Vowel Onset Point Detection for Low Bit Rate Coded Speech
    Vuppala, Anil Kumar
    Yadav, Jainath
    Chakrabarti, Saswat
    Rao, K. Sreenivasa
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (06): : 1894 - 1903
  • [28] Significance of Automatic Detection of Vowel Regions for Automatic Shout Detection in Continuous Speech
    Mittal, Vinay Kumar
    Vuppala, Anil Kumar
    2016 10TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2016,
  • [29] Non-uniform time scale modification using instants of significant excitation and vowel onset points
    Rao, K. Sreenivasa
    Vuppala, Anil Kumar
    SPEECH COMMUNICATION, 2013, 55 (06) : 745 - 756
  • [30] Semi-automatic Syllable Labelling for Assamese Language Using HMM and Vowel Onset-Offset Points
    Sarma, Biswajit Dev
    Sarma, Mousmita
    Prasanna, S. R. M.
    ADVANCES IN COMMUNICATION AND COMPUTING, 2015, 347 : 139 - 147