Fusing traditionally extracted features with deep learned features from the speech spectrogram for anger and stress detection using convolution neural network

被引:0
|
作者
Shalini Kapoor
Tarun Kumar
机构
[1] Research Scholar,Department of Computer Science & Engineering
[2] Dr. A.P.J Abdul Kalam Technical University,undefined
[3] Radha Govind Group of Institution,undefined
来源
关键词
Speech emotion recognition; Convolutional neural networks; Deep learning; Emotion change detection; Spectrograms;
D O I
暂无
中图分类号
学科分类号
摘要
Stress and anger are two negative emotions that affect individuals both mentally and physically; there is a need to tackle them as soon as possible. Automated systems are highly required to monitor mental states and to detect early signs of emotional health issues. In the present work convolutional neural network is proposed for anger and stress detection using handcrafted features and deep learned features from the spectrogram. The objective of using a combined feature set is gathering information from two different representations of speech signals to obtain more prominent features and to boost the accuracy of recognition. The proposed method of emotion assessment is more computationally efficient than similar approaches used for emotion assessment. The preliminary results obtained on experimental evaluation of the proposed approach on three datasets Toronto Emotional Speech Set (TESS), Ryerson Audio-Visual Database of Emotional Speech and Song (RAVDESS), and Berlin Emotional Database (EMO-DB) indicate that categorical accuracy is boosted and cross-entropy loss is reduced to a considerable extent. The proposed convolutional neural network (CNN) obtains training (T) and validation (V) categorical accuracy of T = 93.7%, V = 95.6% for TESS, T = 97.5%, V = 95.6% for EMO-DB and T = 96.7%, V = 96.7% for RAVDESS dataset.
引用
收藏
页码:31107 / 31128
页数:21
相关论文
共 50 条
  • [21] Deep Clustering Network for Steganographer Detection Using Latent Features Extracted from a Novel Convolutional Autoencoder
    Amrutha, E.
    Arivazhagan, S.
    Jebarani, W. Sylvia Lilly
    NEURAL PROCESSING LETTERS, 2023, 55 (03) : 2953 - 2964
  • [22] Deepfake Speech Detection: Approaches from Acoustic Features to Deep Neural Networks
    Unoki, Masashi
    Li, Kai
    Chaiwongyen, Anuwat
    Nguyen, Quoc-Huy
    Zaman, Khalid
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2025, E108D (04) : 300 - 310
  • [23] Combining Information from Multi-Stream Features Using Deep Neural Network in Speech Recognition
    Zhou, Pan
    Dai, Lirong
    Liu, Qingfeng
    Jiang, Hui
    PROCEEDINGS OF 2012 IEEE 11TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP) VOLS 1-3, 2012, : 557 - +
  • [24] Improvement of Speech Emotion Recognition by Deep Convolutional Neural Network and Speech Features
    Mohanty, Aniruddha
    Cherukuri, Ravindranath C.
    Prusty, Alok Ranjan
    THIRD CONGRESS ON INTELLIGENT SYSTEMS, CIS 2022, VOL 1, 2023, 608 : 117 - 129
  • [25] NEURAL NETWORK APPROACH TO CLASSIFICATION USING FEATURES EXTRACTED BY MAPPING
    SUN, Y
    ELECTRONICS LETTERS, 1992, 28 (13) : 1263 - 1264
  • [26] Deep Neural Network for Disease Detection in Rice Plant Using the Texture and Deep Features
    Daniya, T.
    Vigneshwari, S.
    COMPUTER JOURNAL, 2022, 65 (07): : 1812 - 1825
  • [27] Deep Convolutional Neural Network-based Speech Signal Enhancement Using Extensive Speech Features
    Garg, Anil
    Sahu, O. P.
    INTERNATIONAL JOURNAL OF COMPUTATIONAL METHODS, 2022, 19 (08)
  • [28] Ensemble Boosted Tree based Mammogram image classification using Texture features and extracted smart features of Deep Neural Network
    Sharma, Bhanu Prakash
    Purwar, Ravindra Kumar
    ADCAIJ-ADVANCES IN DISTRIBUTED COMPUTING AND ARTIFICIAL INTELLIGENCE JOURNAL, 2021, 10 (04): : 419 - 434
  • [29] Deep Features using Convolutional Neural Network for Early Stage Cancer Detection
    Pratiher, Sawon
    Bhattacharya, Shubhobrata
    Mukhopadhyay, Sabyasachi
    Ghosh, Nirmalya
    Pasupuleti, Gautham
    Panigrahi, Prasanta K.
    OPTICS, PHOTONICS, AND DIGITAL TECHNOLOGIES FOR IMAGING APPLICATIONS V, 2018, 10679
  • [30] Detection of Twitter Spam Using GLoVe Vocabulary Features, Bidirectional LSTM and Convolution Neural Network
    Manasa P.
    Malik A.
    Batra I.
    SN Computer Science, 5 (2)