Significance of Automatic Detection of Vowel Regions for Automatic Shout Detection in Continuous Speech

被引:0
|
作者
Mittal, Vinay Kumar [1 ]
Vuppala, Anil Kumar [2 ]
机构
[1] Indian Inst Informat Technol Chittoor, Sri City, Andhra Pradesh, India
[2] Int Inst Informat Technol, Hyderabad, Telangana, India
关键词
automatic vowel detection; shout detection; vowel onset point; zero-frequency filtering; dominant frequency; ONSET POINT DETECTION;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Automatic detection of shout prosody in continuous speech signal involves examining changes in its production characteristics. Our recent study of electroglottograph signals highlighted that significant changes occur in the glottal excitation source characteristics during production of shouted speech, especially in the vowel contexts. But the differences between normal and shouted speech, in the production features derived over utterances or word segments, may be masked sometimes by pauses or unvoiced regions related variations. Also, for such a real-time system, these vowel regions need to be found automatically. In this paper, changes in the shout production features are examined in the automatically detected vowel regions. Production of a vowel involves periodic impulse-like excitation and relatively high signal energy. Hence, the knowledge of epochs using zero-frequency filtering, and accurate vowel onset points can be used for detecting these regions. Changes in two excitation source features, the instantaneous fundamental frequency and strength of excitation, and in a vocal tract filter feature the dominant frequency, are examined for five steady vowel regions. Larger changes in these distinguishing features are observed in the automatically found vowel regions, than in word segments. This approach can help improving the systems for automatic detection of shout regions in continuous speech, and in paralinguistic applications that involve detection of prosody or emotions.
引用
收藏
页数:5
相关论文
共 50 条
  • [41] Automatic Smoker Detection from Telephone Speech Signals
    Poorjam, Amir Hossein
    Hesaraki, Soheila
    Safavi, Saeid
    van Hamme, Hugo
    Bahari, Mohamad Hasan
    SPEECH AND COMPUTER, SPECOM 2017, 2017, 10458 : 200 - 210
  • [42] Automatic detection of a prosodic hierarchy in a journalistic speech corpus
    Gendrot, Cedric
    Gerdes, Kim
    Adda-Decker, Martine
    LANGUE FRANCAISE, 2016, (191): : 123 - +
  • [43] Automatic detection of consonant omission in cleft palate speech
    Ling He
    Xiyue Wang
    Jing Zhang
    Qi Liu
    Heng Yin
    Margaret Lech
    International Journal of Speech Technology, 2019, 22 : 59 - 65
  • [44] Detection and Classification of Neurodegenerative Diseases by Automatic Speech Analysis
    Kehili, Ahlem
    Bouafif, Lamia
    Cherif, Adnen
    2024 IEEE 7TH INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES, SIGNAL AND IMAGE PROCESSING, ATSIP 2024, 2024, : 500 - 505
  • [45] Speech, Thought and Writing Representation - Towards Automatic Detection
    Brunner, Annelen
    ZEITSCHRIFT FUR GERMANISTISCHE LINGUISTIK, 2019, 47 (01): : 216 - 248
  • [46] Multiword Expression Features for Automatic Hate Speech Detection
    Zampieri, Nicolas
    Illina, Irina
    Fohr, Dominique
    NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS (NLDB 2021), 2021, 12801 : 156 - 164
  • [47] Data set for automatic detection of online misogynistic speech
    Lynn, Theo
    Endo, Patricia Takako
    Rosati, Pierangelo
    Silva, Ivanovitch
    Santos, Guto Leoni
    Ging, Debbie
    DATA IN BRIEF, 2019, 26
  • [48] Automatic Wheezing Detection Using Speech Recognition Technique
    Bor-Shing Lin
    Bor-Shyh Lin
    Journal of Medical and Biological Engineering, 2016, 36 : 545 - 554
  • [49] Residual Excitation Skewness for Automatic Speech Polarity Detection
    Drugman, Thomas
    IEEE SIGNAL PROCESSING LETTERS, 2013, 20 (04) : 387 - 390
  • [50] Automatic Wheezing Detection Using Speech Recognition Technique
    Lin, Bor-Shing
    Lin, Bor-Shyh
    JOURNAL OF MEDICAL AND BIOLOGICAL ENGINEERING, 2016, 36 (04) : 545 - 554