Utterance and Syllable Level Prosodic Features for Automatic Emotion Recognition

被引:0
|
作者
Ben Alex, Starlet [1 ]
Babu, Ben P. [2 ]
Mary, Leena [2 ]
机构
[1] Rajiv Gandhi Inst Technol, Dept Elect & Commun, Kottayam, Kerala, India
[2] Govt Coll Engn, Dept Elect & Commun, Idukki, Kerala, India
关键词
automatic emotion recognition; prosodic features; syllable level segmentation;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes an automatic emotion recognition (AER) system that combines prosodic features extracted at utterance level and syllable level to recognize its emotional content. The prosodic features are extracted after identifying speech/non-speech intervals, followed by syllable level segmentation. Prosodic features chosen include parameters for representing dynamics of pitch and energy, along with duration information. Two separate classifiers are built using Deep Neural Networks (DNN). The decision scores based on both levels are fused to identify the emotion of a test utterance from the German Emotion Database (Emo-DB) which contains seven emotions, namely anger, boredom, disgust, fear, happiness, sadness and neutral. The proposed system gives a Weighted Average Recall (WAR) of 58.88% for both utterance level and syllable level prosodic features. Fusion of scores by merely adding the scores gives an overall WAR of 61.68%.
引用
收藏
页码:31 / 35
页数:5
相关论文
共 50 条
  • [41] COMMUNICATING EMOTION - THE ROLE OF PROSODIC FEATURES
    FRICK, RW
    PSYCHOLOGICAL BULLETIN, 1985, 97 (03) : 412 - 429
  • [42] Speech emotion recognition with utterance-level FFT-based spectral features and Its Application to an Interactive Pet Robot
    Huang, Yongming
    Zhang, Guobao
    Da, Feipeng
    2011 INTERNATIONAL CONFERENCE ON COMPUTER, ELECTRICAL, AND SYSTEMS SCIENCES, AND ENGINEERING (CESSE 2011), 2011, : 134 - 137
  • [43] Hierarchical emotion recognition from speech using source, power spectral and prosodic features
    Arijul Haque
    K. Sreenivasa Rao
    Multimedia Tools and Applications, 2024, 83 : 19629 - 19661
  • [44] Hierarchical emotion recognition from speech using source, power spectral and prosodic features
    Haque, Arijul
    Rao, K. Sreenivasa
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (07) : 19629 - 19661
  • [45] Emotion recognition from mid-level features
    Sanchez-Mendoza, David
    Masip, David
    Lapedriza, Agata
    PATTERN RECOGNITION LETTERS, 2015, 67 : 66 - 74
  • [46] Class-level spectral features for emotion recognition
    Bitouk, Dmitri
    Verma, Ragini
    Nenkova, Ani
    SPEECH COMMUNICATION, 2010, 52 (7-8) : 613 - 625
  • [47] Automatic speech based emotion recognition using paralinguistics features
    Hook, J.
    Noroozi, F.
    Toygar, O.
    Anbarjafari, G.
    BULLETIN OF THE POLISH ACADEMY OF SCIENCES-TECHNICAL SCIENCES, 2019, 67 (03) : 479 - 488
  • [48] Automatic speech emotion recognition using modulation spectral features
    Wu, Siqing
    Falk, Tiago H.
    Chan, Wai-Yip
    SPEECH COMMUNICATION, 2011, 53 (05) : 768 - 785
  • [49] On the Correlation and Transferability of Features between Automatic Speech Recognition and Speech Emotion Recognition
    Fayek, Haytham M.
    Lech, Margaret
    Cavedon, Lawrence
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3618 - 3622
  • [50] Acoustic-Prosodic Recognition of Emotion in Speech
    Montenegro, Chuchi S.
    Maravillas, Elmer A.
    2015 INTERNATIONAL CONFERENCE ON HUMANOID, NANOTECHNOLOGY, INFORMATION TECHNOLOGY,COMMUNICATION AND CONTROL, ENVIRONMENT AND MANAGEMENT (HNICEM), 2015, : 527 - +