EMOCEPTION: AN INCEPTION INSPIRED EFFICIENT SPEECH EMOTION RECOGNITION NETWORK

被引:0
|
作者
Singh, Chirag [1 ]
Kumar, Abhay [1 ]
Nagar, Ajay [1 ]
Tripathi, Suraj [1 ]
Yenigalla, Promod [1 ]
机构
[1] Samsung R&D Inst India, Bangalore, Karnataka, India
关键词
Speech Emotion Recognition; Inception; Multi-Task Learning; CNN;
D O I
10.1109/asru46091.2019.9004020
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This research proposes a Deep Neural Network architecture for Speech Emotion Recognition called Emoception, which takes inspiration from Inception modules. The network takes speech features like Mel-Frequency Spectral Coefficients (MFSC) or Mel-Frequency Cepstral Coefficients (MFCC) as input and recognizes the relevant emotion in the speech. We use USC-IEMOCAP dataset for training but the limited amount of training data and large depth of the network makes the network prone to overfitting, reducing validation accuracy. The Emoception network overcomes this problem by extending in width without increase in computational cost. We also employ a powerful regularization technique, Multi-Task Learning (MTL) to make the network robust. The model using MFSC input with MTL increases the accuracy by 1.6% vis-a-vis Emoception without MTL. We report an overall accuracy improvement of around 4.6% compared to the existing state-of-art methods for four emotion classes on IEMOCAP dataset.
引用
收藏
页码:787 / 791
页数:5
相关论文
共 50 条
  • [21] A New Network Structure for Speech Emotion Recognition Research
    Xu, Chunsheng
    Liu, Yunqing
    Song, Wenjun
    Liang, Zonglin
    Chen, Xing
    SENSORS, 2024, 24 (05)
  • [22] CONVOLUTIONAL NEURAL NETWORK TECHNIQUES FOR SPEECH EMOTION RECOGNITION
    Parthasarathy, Srinivas
    Tashev, Ivan
    2018 16TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2018, : 121 - 125
  • [23] Speech Emotion Recognition Based on Deep Neural Network
    Zhu, Zijiang
    Hu, Yi
    Li, Junshan
    Li, Jianjun
    Wang, Junhua
    BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2020, 126 : 154 - 154
  • [24] Accurate Speech Emotion Recognition by using Brain-Inspired Decision-Making Spiking Neural Network
    Jain, Madhu
    Shukla, Shilpi
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2019, 10 (12) : 370 - 379
  • [25] Inception inspired CNN-GRU hybrid network for human activity recognition
    Dua, Nidhi
    Singh, Shiva Nand
    Semwal, Vijay Bhaskar
    Challa, Sravan Kumar
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (04) : 5369 - 5403
  • [26] Inception inspired CNN-GRU hybrid network for human activity recognition
    Nidhi Dua
    Shiva Nand Singh
    Vijay Bhaskar Semwal
    Sravan Kumar Challa
    Multimedia Tools and Applications, 2023, 82 : 5369 - 5403
  • [27] Quantum-inspired Neural Network for Conversational Emotion Recognition
    Li, Qiuchi
    Gkoumas, Dimitris
    Sordoni, Alessandro
    Nie, Jian-Yun
    Melucci, Massimo
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 13270 - 13278
  • [28] An Efficient Graph Learning System for Emotion Recognition Inspired by the Cognitive Prior Graph of EEG Brain Network
    Li, Cunbo
    Tang, Tian
    Pan, Yue
    Yang, Lei
    Zhang, Shuhan
    Chen, Zhaojin
    Li, Peiyang
    Gao, Dongrui
    Chen, Huafu
    Li, Fali
    Yao, Dezhong
    Cao, Zehong
    Xu, Peng
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, : 1 - 15
  • [29] Improvement of Speech Emotion Recognition by Deep Convolutional Neural Network and Speech Features
    Mohanty, Aniruddha
    Cherukuri, Ravindranath C.
    Prusty, Alok Ranjan
    THIRD CONGRESS ON INTELLIGENT SYSTEMS, CIS 2022, VOL 1, 2023, 608 : 117 - 129
  • [30] Improvement Of Speech Emotion Recognition with Neural Network Classifier by Using Speech Spectrogram
    Prasomphan, Sathit
    2015 INTERNATIONAL CONFERENCE ON SYSTEMS, SIGNALS AND IMAGE PROCESSING (IWSSIP 2015), 2015, : 73 - 76