Vowel Onset Point Detection using Sonority Information

被引:3
|
作者
Sharma, Bidisha [1 ]
Prasanna, S. R. Mahadeva [1 ]
机构
[1] Indian Inst Technol Guwahati, Dept Elect & Elect Engn, Gauhati 781039, India
关键词
Vowel onset point; Sonority; Vocal-tract system; Excitation source; Suprasegmental; C/V SEGMENTATION ALGORITHM; SPEAKER VERIFICATION; SPEECH; MODEL;
D O I
10.21437/Interspeech.2017-790
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Vowel onset point (VOP) refers to the starting event of a vowel, that may be reflected in different aspects of the speech signal. The major issue in VOP detection using existing methods is the confusion among the vowels and other categories of sounds preceding them. This work explores the usefulness of sonority information to reduce this confusion and improve VOP detection. Vowels arc the most sonorant sounds followed by semivowels, nasals, voiced fricatives, voiced stops. The sonority feature is derived from the vocal-tract system, excitation source and suprasegmental aspects. As this feature has the capability to discriminate among different sonorant sound units, it reduces the confusion among onset of vowels with that of other sonorant sounds. This results in improved detection and resolution of VOP detection for continuous speech. The performance of proposed sonority information based VOP detection is found to be 92.4%, compared to 85.2% by the existing method. Also the resolution of localizing VOP within 10 ms is significantly enhanced and a perfonnance of 73.0% is achieved as opposed to 60.2% by the existing method.
引用
收藏
页码:444 / 448
页数:5
相关论文
共 50 条
  • [1] Detection of vowel onset point in speech
    Prasanna, SRM
    Zachariah, JM
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 4159 - 4159
  • [2] Improved vowel onset point detection using epoch intervals
    Vuppala, Anil Kumar
    Rao, K. Sreenivasa
    Chakrabarti, Saswat
    AEU-INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATIONS, 2012, 66 (08) : 697 - 700
  • [3] Effect of Noise on Vowel Onset Point Detection
    Vuppala, Anil Kumar
    Yadav, Jainath
    Rao, K. Sreenivasa
    Chakrabarti, Saswat
    CONTEMPORARY COMPUTING, 2011, 168 : 201 - +
  • [4] Vowel onset point detection for noisy speech using spectral energy at formant frequencies
    Vuppala A.K.
    Rao K.S.
    International Journal of Speech Technology, 2013, 16 (02) : 229 - 235
  • [5] Vowel Onset Point Detection Using Source, Spectral Peaks, and Modulation Spectrum Energies
    Prasanna, S. R. Mahadeva
    Reddy, B. V. Sandeep
    Krishnamoorthy, P.
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 17 (04): : 556 - 565
  • [6] Vowel Onset Point Detection for Low Bit Rate Coded Speech
    Vuppala, Anil Kumar
    Yadav, Jainath
    Chakrabarti, Saswat
    Rao, K. Sreenivasa
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (06): : 1894 - 1903
  • [7] VOWEL-ONSET DETECTION
    HERMES, DJ
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1990, 87 (02): : 866 - 873
  • [8] Robust analysis for improvement of vowel onset point detection under noisy conditions
    Saha P.
    Baruah U.
    Laskar R.H.
    Mishra S.
    Choudhury S.P.
    Das T.K.
    International Journal of Speech Technology, 2016, 19 (3) : 433 - 448
  • [9] Vowel Onset Point Based Characterization of Velopharyngeal Activity Using Imaging Techniques
    Sudro, Protima Nomo
    Vikram, C. M.
    Prasanna, S. R. Mahadeva
    2017 TWENTY-THIRD NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2017,
  • [10] Improved Vowel Onset and Offset Points Detection Using Bessel Features
    Sarma, Biswajit Dev
    Prajwal, Supreeth S.
    Prasanna, S. R. Mahadeva
    2014 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS (SPCOM), 2014,