Emotion Recognition from Human Speech Using Temporal Information and Deep Learning

被引:31
|
作者
Kim, John W. [1 ]
Saurous, Rif A. [2 ]
机构
[1] Menlo Sch, Atherton, CA USA
[2] Google Inc, Mountain View, CA USA
关键词
emotion recognition; temporal information; deep learning; CNN; LSTM;
D O I
10.21437/Interspeech.2018-1132
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Emotion recognition by machine is a challenging task, but it has great potential to make empathic human-machine communications possible. In conventional approaches that consist of feature extraction and classifier stages, extensive studies have devoted their effort to developing good feature representations, but relatively little effort was made to make proper use of the important temporal information in these features. In this paper, we propose a model combining features known to be useful for emotion recognition and deep neural networks to exploit temporal information when recognizing emotion status. A benchmark evaluation on EMO-DB demonstrates that the proposed model achieves a state-of-the-art performance of 88.9% recognition rate.
引用
收藏
页码:937 / 940
页数:4
相关论文
共 50 条
  • [41] LEARNING TEMPORAL INFORMATION FROM SPATIAL INFORMATION USING CAPSNETS FOR HUMAN ACTION RECOGNITION
    Algamdi, Abdullah M.
    Sanchez, Victor
    Li, Chang-Tsun
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 3867 - 3871
  • [42] EMOTION RECOGNITION USING DEEP LEARNING
    Priya, R. N. Beena
    Hanmandlu, M.
    Vasikarla, Shantaram
    2021 IEEE APPLIED IMAGERY PATTERN RECOGNITION WORKSHOP (AIPR), 2021,
  • [43] Deep Multimodal Emotion Recognition on Human Speech: A Review
    Koromilas, Panagiotis
    Giannakopoulos, Theodoros
    APPLIED SCIENCES-BASEL, 2021, 11 (17):
  • [44] Speech Emotion Recognition Using Transfer Learning
    Song, Peng
    Jin, Yun
    Zhao, Li
    Xin, Minghai
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2014, E97D (09): : 2530 - 2532
  • [45] Speech Emotion Classification Using Deep Learning
    Mishra, Siba Prasad
    Warule, Pankaj
    Deb, Suman
    PROCEEDINGS OF 27TH INTERNATIONAL SYMPOSIUM ON FRONTIERS OF RESEARCH IN SPEECH AND MUSIC, FRSM 2023, 2024, 1455 : 19 - 31
  • [46] Emotion Recognition from Children Speech Signals Using Attention Based Time Series Deep Learning
    Cao, Guitao
    Tang, Yunming
    Sheng, Jiyu
    Cao, Wenming
    2019 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2019, : 1296 - 1300
  • [47] Emotion Recognition based on Human Gesture and Speech Information using RT Middleware
    Vu, H. A.
    Yamazaki, Y.
    Dong, F.
    Hirota, K.
    IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ 2011), 2011, : 787 - 791
  • [48] Speech Emotion Recognition using Deep Dropout Autoencoders
    Pal, Arghya
    Baskar, S.
    Baskar, S.
    2015 IEEE INTERNATIONAL CONFERENCE ON ENGINEERING AND TECHNOLOGY (ICETECH), 2015, : 124 - 129
  • [49] Deep learning based Affective Model for Speech Emotion Recognition
    Zhou, Xi
    Guo, Junqi
    Bie, Rongfang
    2016 INT IEEE CONFERENCES ON UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING AND COMMUNICATIONS, CLOUD AND BIG DATA COMPUTING, INTERNET OF PEOPLE, AND SMART WORLD CONGRESS (UIC/ATC/SCALCOM/CBDCOM/IOP/SMARTWORLD), 2016, : 841 - 846
  • [50] Deep Learning Approach towards Emotion Recognition Based on Speech
    Butala, Padmanabh
    Pawar, Rajendra
    Jadhav, Nagesh
    Kalangan, Manas
    Dhumal, Aniket
    Kakad, Sahil
    JOURNAL OF ADVANCED APPLIED SCIENTIFIC RESEARCH, 2024, 6 (03): : 16 - 24