Emotion Recognition from Speech Signals using Excitation Source and Spectral Features

被引:0
|
作者
Choudhury, Akash Roy [1 ]
Ghosh, Anik [1 ]
Pandey, Rahul [1 ]
Barman, Subhas [1 ]
机构
[1] Jalpaiguri Govt Engn Coll, Dept Comp Sci & Engn, Jalpaiguri, W Bengal, India
关键词
Emotion Recognition; Spectral Features; Prosodic Features; Excitation Source Features; SMO; Random Forest; LINEAR PREDICTION; SPEAKER; CLASSIFICATION;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The task of recognition of emotions from speech signals is one that has been going on for a long time. In the previous works, the dominance of prosodic and spectral features have been observed when it comes to recognition of emotions. But a speech signal also consists of Source level information which gets lost during this process. In this work, we have combined several spectral features with several excitation source features to see how well the model can perform the emotion recognition task. For the task in hand we have taken 3 databases namely, Berlin Emotional Database (Berlin Emo-DB), Surrey Audio-Visual Expressed Emotion (SAVEE) Database and Toronto emotional speech set (TESS) Database. The reason behind taking these databases is that the variation they offer is effective to judge the robustness of the recognition model. We chose Sequential Minimal Optimization (SMO)and Random Forest to perform classification.
引用
收藏
页码:257 / 261
页数:5
相关论文
共 50 条
  • [1] Analysis of Excitation Source Features of Speech for Emotion Recognition
    Kadiri, Sudarsana Reddy
    Gangamohan, P.
    Gangashetty, Suryakanth V.
    Yegnanarayana, B.
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1324 - 1328
  • [2] Hierarchical emotion recognition from speech using source, power spectral and prosodic features
    Arijul Haque
    K. Sreenivasa Rao
    Multimedia Tools and Applications, 2024, 83 : 19629 - 19661
  • [3] Hierarchical emotion recognition from speech using source, power spectral and prosodic features
    Haque, Arijul
    Rao, K. Sreenivasa
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (07) : 19629 - 19661
  • [4] Recognition of Emotions from Speech using Excitation Source Features
    Koolagudi, Shashidhar G.
    Devliyal, Swati
    Chawla, Bhavna
    Barthwal, Anurag
    Rao, K. Sreenivasa
    INTERNATIONAL CONFERENCE ON MODELLING OPTIMIZATION AND COMPUTING, 2012, 38 : 3409 - 3417
  • [5] Emotion recognition from speech signals using new harmony features
    Yang, B.
    Lugger, M.
    SIGNAL PROCESSING, 2010, 90 (05) : 1415 - 1423
  • [6] Emotion Recognition from Semi Natural Speech Using Artificial Neural Networks and Excitation Source Features
    Koolagudi, Shashidhar G.
    Devliyal, Swati
    Barthwal, Anurag
    Rao, K. Sreenivasa
    CONTEMPORARY COMPUTING, 2012, 306 : 273 - +
  • [7] Excitation Features of Speech for Emotion Recognition Using Neutral Speech as Reference
    Kadin, Sudarsana Reddy
    Gangamohan, P.
    Gangashetty, Suryakanth, V
    Alku, Paavo
    Yegnanarayana, B.
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2020, 39 (09) : 4459 - 4481
  • [8] Excitation Features of Speech for Emotion Recognition Using Neutral Speech as Reference
    Sudarsana Reddy Kadiri
    P. Gangamohan
    Suryakanth V. Gangashetty
    Paavo Alku
    B. Yegnanarayana
    Circuits, Systems, and Signal Processing, 2020, 39 : 4459 - 4481
  • [9] Emotion recognition from speech using source, system, and prosodic features
    Koolagudi, Shashidhar G.
    Rao, K. Sreenivasa
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2012, 15 (02) : 265 - 289
  • [10] Significance of incorporating excitation source parameters for improved emotion recognition from speech and electroglottographic signals
    Pravena D.
    Govind D.
    Govind, D. (d_govind@cb.amrita.edu), 1600, Springer Science and Business Media, LLC (20): : 787 - 797