Significance of Automatic Detection of Vowel Regions for Automatic Shout Detection in Continuous Speech

被引:0
|
作者
Mittal, Vinay Kumar [1 ]
Vuppala, Anil Kumar [2 ]
机构
[1] Indian Inst Informat Technol Chittoor, Sri City, Andhra Pradesh, India
[2] Int Inst Informat Technol, Hyderabad, Telangana, India
关键词
automatic vowel detection; shout detection; vowel onset point; zero-frequency filtering; dominant frequency; ONSET POINT DETECTION;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Automatic detection of shout prosody in continuous speech signal involves examining changes in its production characteristics. Our recent study of electroglottograph signals highlighted that significant changes occur in the glottal excitation source characteristics during production of shouted speech, especially in the vowel contexts. But the differences between normal and shouted speech, in the production features derived over utterances or word segments, may be masked sometimes by pauses or unvoiced regions related variations. Also, for such a real-time system, these vowel regions need to be found automatically. In this paper, changes in the shout production features are examined in the automatically detected vowel regions. Production of a vowel involves periodic impulse-like excitation and relatively high signal energy. Hence, the knowledge of epochs using zero-frequency filtering, and accurate vowel onset points can be used for detecting these regions. Changes in two excitation source features, the instantaneous fundamental frequency and strength of excitation, and in a vocal tract filter feature the dominant frequency, are examined for five steady vowel regions. Larger changes in these distinguishing features are observed in the automatically found vowel regions, than in word segments. This approach can help improving the systems for automatic detection of shout regions in continuous speech, and in paralinguistic applications that involve detection of prosody or emotions.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] AUTOMATIC DETECTION OF VOWEL CENTERS FROM CONTINUOUS SPEECH.
    Kasuya, Hideki
    Wakita, Hisashi
    Transactions of the Institute of Electronics and Communication Engineers of Japan. Section E, 1981, E64 (10): : 640 - 645
  • [2] An Automatic Shout Detection System Using Speech Production Features
    Mittal, Vinay Kumar
    Yegnanarayana, Bayya
    MULTIMODAL ANALYSES ENABLING ARTIFICIAL AGENTS IN HUMAN-MACHINE INTERACTION, 2015, 8757 : 88 - 98
  • [3] Homomorphic Filtered Spectral Peaks Energy for Automatic Detection of Vowel Onset Point in Continuous Speech
    Zang, Xian
    Chong, Kil To
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2013, E96D (04): : 949 - 956
  • [4] Automatic Emotion Variation Detection in Continuous Speech
    Fan, Yuchao
    Xu, Mingxing
    Wu, Zhiyong
    Cai, Lianhong
    2014 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2014,
  • [5] Automatic Detection of Irregular Phonation in Continuous Speech
    Vishnubhotla, Srikanth
    Espy-Wilson, Carol
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 949 - +
  • [6] Automatic Detection of Retroflex Approximants in a Continuous Tamil Speech
    Ramakrishna Thirumuru
    Anil Kumar Vuppala
    Circuits, Systems, and Signal Processing, 2018, 37 : 2837 - 2851
  • [7] AUTOMATIC DETECTION AND DESCRIPTION OF SYLLABIC FEATURES IN CONTINUOUS SPEECH
    DEMORI, R
    LAFACE, P
    PICCOLO, E
    IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1976, 24 (05): : 365 - 379
  • [8] Features for Automatic Detection of Voice Bars in Continuous Speech
    Dhananjaya, N.
    Rajendran, S.
    Yegnanarayana, B.
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1321 - +
  • [9] Automatic Detection of Retroflex Approximants in a Continuous Tamil Speech
    Thirumuru, Ramakrishna
    Vuppala, Anil Kumar
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2018, 37 (07) : 2837 - 2851
  • [10] Automatic Detection of Hyperarticulated Speech
    Ribeiro, Eugenio
    Batista, Fernando
    Trancoso, Isabel
    Ribeiro, Ricardo
    de Matos, David Martins
    ADVANCES IN SPEECH AND LANGUAGE TECHNOLOGIES FOR IBERIAN LANGUAGES, IBERSPEECH 2016, 2016, 10077 : 182 - 191