Fusing traditionally extracted features with deep learned features from the speech spectrogram for anger and stress detection using convolution neural network

被引:0
|
作者
Shalini Kapoor
Tarun Kumar
机构
[1] Research Scholar,Department of Computer Science & Engineering
[2] Dr. A.P.J Abdul Kalam Technical University,undefined
[3] Radha Govind Group of Institution,undefined
来源
关键词
Speech emotion recognition; Convolutional neural networks; Deep learning; Emotion change detection; Spectrograms;
D O I
暂无
中图分类号
学科分类号
摘要
Stress and anger are two negative emotions that affect individuals both mentally and physically; there is a need to tackle them as soon as possible. Automated systems are highly required to monitor mental states and to detect early signs of emotional health issues. In the present work convolutional neural network is proposed for anger and stress detection using handcrafted features and deep learned features from the spectrogram. The objective of using a combined feature set is gathering information from two different representations of speech signals to obtain more prominent features and to boost the accuracy of recognition. The proposed method of emotion assessment is more computationally efficient than similar approaches used for emotion assessment. The preliminary results obtained on experimental evaluation of the proposed approach on three datasets Toronto Emotional Speech Set (TESS), Ryerson Audio-Visual Database of Emotional Speech and Song (RAVDESS), and Berlin Emotional Database (EMO-DB) indicate that categorical accuracy is boosted and cross-entropy loss is reduced to a considerable extent. The proposed convolutional neural network (CNN) obtains training (T) and validation (V) categorical accuracy of T = 93.7%, V = 95.6% for TESS, T = 97.5%, V = 95.6% for EMO-DB and T = 96.7%, V = 96.7% for RAVDESS dataset.
引用
收藏
页码:31107 / 31128
页数:21
相关论文
共 50 条
  • [41] Fabric Defect Detection Using Deep Convolution Neural Network
    Fan, Junjun
    Wong, Wai Keung
    Wen, Jiajun
    Gao, Can
    Mo, Dongmei
    Lai, Zhihui
    AATCC JOURNAL OF RESEARCH, 2021, 8 (1_SUPPL) : 144 - 151
  • [42] Speech enhancement from fused features based on deep neural network and gated recurrent unit network
    Wang, Youming
    Han, Jiali
    Zhang, Tianqi
    Qing, Didi
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2021, 2021 (01)
  • [43] Speech enhancement from fused features based on deep neural network and gated recurrent unit network
    Youming Wang
    Jiali Han
    Tianqi Zhang
    Didi Qing
    EURASIP Journal on Advances in Signal Processing, 2021
  • [44] Sepsis Detection Using Features Extracted from Photoplethysmography
    Adelucci, Elena
    Falagiani, Martina
    Lombardi, Sara
    Francia, Piergiorgio
    Bocchi, Leonardo
    MEDICON 2023 AND CMBEBIH 2023, VOL 1, 2024, 93 : 636 - 646
  • [45] Differentiation of thyroid nodules on US using features learned and extracted from various convolutional neural networks
    Lee, Eunjung
    Ha, Heonkyu
    Kim, Hye Jung
    Moon, Hee Jung
    Byon, Jung Hee
    Huh, Sun
    Son, Jinwoo
    Yoon, Jiyoung
    Han, Kyunghwa
    Kwak, Jin Young
    SCIENTIFIC REPORTS, 2019, 9 (1)
  • [46] Differentiation of thyroid nodules on US using features learned and extracted from various convolutional neural networks
    Eunjung Lee
    Heonkyu Ha
    Hye Jung Kim
    Hee Jung Moon
    Jung Hee Byon
    Sun Huh
    Jinwoo Son
    Jiyoung Yoon
    Kyunghwa Han
    Jin Young Kwak
    Scientific Reports, 9
  • [47] NEURAL-NETWORK APPROACH FOR CLASSIFICATION USING FEATURES EXTRACTED BY A MAPPING
    SUN, Y
    PATTERN RECOGNITION LETTERS, 1993, 14 (10) : 749 - 752
  • [48] ENTRAINMENT ANALYSIS FOR ASSESSMENT OF AUTISTIC SPEECH PROSODY USING BOTTLENECK FEATURES OF DEEP NEURAL NETWORK
    Ochi, Keiko
    Ono, Nobutaka
    Owada, Keiho
    Kuroda, Miho
    Sagayama, Shigeki
    Yamasue, Hidenori
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 8492 - 8496
  • [49] A Novel Adaptive Affective Cognition Analysis Model for College Students Using a Deep Convolution Neural Network and Deep Features
    Feng, Huali
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [50] Dialect Identification in Telugu Language Speech Utterance Using Modified Features with Deep Neural Network
    Satla, Shivaprasad
    Manchala, Sadanandam
    TRAITEMENT DU SIGNAL, 2021, 38 (06) : 1793 - 1799