STUDY OF DENSE NETWORK APPROACHES FOR SPEECH EMOTION RECOGNITION

被引:0
|
作者
Abdelwahab, Mohammed [1 ]
Busso, Carlos [1 ]
机构
[1] Univ Texas Dallas, Dept Elect Comp Engn, Multimodal Signal Proc MSP Lab, Richardson, TX 75080 USA
关键词
Speech emotion recognition; Deep Neural Networks; NEURAL-NETWORKS;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Deep neural networks have been proven to be very effective in various classification problems and show great promise for emotion recognition from speech. Studies have proposed various architectures that further improve the performance of emotion recognition systems. However, there are still various open questions regarding the best approach to building a speech emotion recognition system. Would the system's performance improve if we have more labeled data? How much do we benefit from data augmentation? What activation and regularization schemes are more beneficial? How does the depth of the network affect the performance? We are collecting the MSP-Podcast corpus, a large dataset with over 30 hours of data, which provides an ideal resource to address these questions. This study explores various dense architectures to predict arousal, valence and dominance scores. We investigate varying the training set size, width, and depth of the network, as well as the activation functions used during training. We also study the effect of data augmentation on the network's performance. We find that bigger training set improves the performance. Batch normalization is crucial to achieving a good performance for deeper networks. We do not observe significant differences in the performance in residual networks compared to dense networks.
引用
收藏
页码:5084 / 5088
页数:5
相关论文
共 50 条
  • [31] Temporal Relation Inference Network for Multimodal Speech Emotion Recognition
    Dong, Guan-Nan
    Pun, Chi-Man
    Zhang, Zheng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (09) : 6472 - 6485
  • [32] A multi-dilated convolution network for speech emotion recognition
    Madanian, Samaneh
    Adeleye, Olayinka
    Templeton, John Michael
    Chen, Talen
    Poellabauer, Christian
    Zhang, Enshi
    Schneider, Sandra L.
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [33] Attention Based Fully Convolutional Network for Speech Emotion Recognition
    Zhang, Yuanyuan
    Du, Jun
    Wang, Zirui
    Zhang, Jianshu
    Tu, Yanhui
    2018 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2018, : 1771 - 1775
  • [34] Speech Emotion Recognition Using Neural Network and Wavelet Features
    Roy, Tanmoy
    Marwala, Tshilidzi
    Chakraverty, S.
    RECENT TRENDS IN WAVE MECHANICS AND VIBRATIONS, WMVC 2018, 2020, : 427 - 438
  • [35] A Speech Emotion Recognition Method Based on Lightweight Capsule Network
    Wang Y.
    Gao S.
    Dianzi Keji Daxue Xuebao/Journal of the University of Electronic Science and Technology of China, 2023, 52 (03): : 423 - 429
  • [36] An improved speech emotion recognition method based on RepVGG network
    Huang, Chuan-Bao
    Zhu, Kai
    Hu, Zhen
    PROCEEDINGS OF 2024 3RD INTERNATIONAL CONFERENCE ON CRYPTOGRAPHY, NETWORK SECURITY AND COMMUNICATION TECHNOLOGY, CNSCT 2024, 2024, : 451 - 457
  • [37] Relative Speech Emotion Recognition Based Artificial Neural Network
    Fu, Liqin
    Mao, Xia
    Chen, Lijiang
    PACIIA: 2008 PACIFIC-ASIA WORKSHOP ON COMPUTATIONAL INTELLIGENCE AND INDUSTRIAL APPLICATION, VOLS 1-3, PROCEEDINGS, 2008, : 1111 - 1115
  • [38] EMOCEPTION: AN INCEPTION INSPIRED EFFICIENT SPEECH EMOTION RECOGNITION NETWORK
    Singh, Chirag
    Kumar, Abhay
    Nagar, Ajay
    Tripathi, Suraj
    Yenigalla, Promod
    2019 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU 2019), 2019, : 787 - 791
  • [39] Bidirectional parallel echo state network for speech emotion recognition
    Ibrahim, Hemin
    Loo, Chu Kiong
    Alnajjar, Fady
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (20): : 17581 - 17599
  • [40] Speech Emotion Recognition Based on Deep Residual Shrinkage Network
    Han, Tian
    Zhang, Zhu
    Ren, Mingyuan
    Dong, Changchun
    Jiang, Xiaolin
    Zhuang, Quansheng
    ELECTRONICS, 2023, 12 (11)