Significance of Automatic Detection of Vowel Regions for Automatic Shout Detection in Continuous Speech

被引:0
|
作者
Mittal, Vinay Kumar [1 ]
Vuppala, Anil Kumar [2 ]
机构
[1] Indian Inst Informat Technol Chittoor, Sri City, Andhra Pradesh, India
[2] Int Inst Informat Technol, Hyderabad, Telangana, India
关键词
automatic vowel detection; shout detection; vowel onset point; zero-frequency filtering; dominant frequency; ONSET POINT DETECTION;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Automatic detection of shout prosody in continuous speech signal involves examining changes in its production characteristics. Our recent study of electroglottograph signals highlighted that significant changes occur in the glottal excitation source characteristics during production of shouted speech, especially in the vowel contexts. But the differences between normal and shouted speech, in the production features derived over utterances or word segments, may be masked sometimes by pauses or unvoiced regions related variations. Also, for such a real-time system, these vowel regions need to be found automatically. In this paper, changes in the shout production features are examined in the automatically detected vowel regions. Production of a vowel involves periodic impulse-like excitation and relatively high signal energy. Hence, the knowledge of epochs using zero-frequency filtering, and accurate vowel onset points can be used for detecting these regions. Changes in two excitation source features, the instantaneous fundamental frequency and strength of excitation, and in a vocal tract filter feature the dominant frequency, are examined for five steady vowel regions. Larger changes in these distinguishing features are observed in the automatically found vowel regions, than in word segments. This approach can help improving the systems for automatic detection of shout regions in continuous speech, and in paralinguistic applications that involve detection of prosody or emotions.
引用
收藏
页数:5
相关论文
共 50 条
  • [21] Automatic Disfluency Detection From Untranscribed Speech
    Romana, Amrit
    Koishida, Kazuhito
    Provost, Emily Mower
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 4727 - 4740
  • [22] A Survey on Automatic Detection of Hate Speech in Text
    Fortuna, Paula
    Nunes, Sergio
    ACM COMPUTING SURVEYS, 2018, 51 (04)
  • [23] Identification and Automatic Detection of Parasitic Speech Sounds
    Matousek, Jindrich
    Skarnitzl, Radek
    Machac, Pavel
    Trmal, Jan
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 840 - +
  • [24] A Study on Detection Based Automatic Speech Recognition
    Ma, Chengyuan
    Tsao, Yu
    Lee, Chin-Hui
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2350 - 2353
  • [25] Automatic detection of syllable boundaries in spontaneous speech
    Bigi, Brigitte
    Meunier, Christine
    Nesterenko, Irina
    Bertrand, Roxane
    LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010,
  • [26] Automatic detection of contrastive elements in spontaneous speech
    Nenkova, Ani
    Jurafsky, Dan
    2007 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, VOLS 1 AND 2, 2007, : 201 - +
  • [27] Detection of confusable words in automatic speech recognition
    Anguita, J
    Hernando, J
    Peillon, S
    Bramoullé, A
    IEEE SIGNAL PROCESSING LETTERS, 2005, 12 (08) : 585 - 588
  • [28] Automatic detection of prosodic boundaries in spontaneous speech
    Biron, Tirza
    Baum, Daniel
    Freche, Dominik
    Matalon, Nadav
    Ehrmann, Netanel
    Weinreb, Eyal
    Biron, David
    Moses, Elisha
    PLOS ONE, 2021, 16 (05):
  • [29] Automatic Dialect Detection in Arabic Broadcast Speech
    Ali, Ahmed
    Dehak, Najim
    Cardinal, Patrick
    Khurana, Sameer
    Yella, Sree Harsha
    Glass, James
    Bell, Peter
    Renals, Steve
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 2934 - 2938
  • [30] Feature analysis for automatic detection of pathological speech
    Dibazar, AA
    Narayanan, S
    Berger, TW
    SECOND JOINT EMBS-BMES CONFERENCE 2002, VOLS 1-3, CONFERENCE PROCEEDINGS: BIOENGINEERING - INTEGRATIVE METHODOLOGIES, NEW TECHNOLOGIES, 2002, : 182 - 183