A Novel Optimized Recurrent Network-Based Automatic System for Speech Emotion Identification

被引:5
|
作者
Koppula, Neeraja [1 ]
Rao, Koppula Srinivas [2 ]
Nabi, Shaik Abdul [3 ]
Balaram, Allam [4 ]
机构
[1] Geetanjali Coll Engn & Technol, Dept Comp Sci & Engn, Hyderabad 501301, Telangana, India
[2] MLR Inst Technol, Dept Comp Sci & Engn, Hyderabad 500043, Telangana, India
[3] Sreyas Inst Engn & Technol, Dept Comp Sci & Engn, Hyderabad 500068, Telangana, India
[4] MLR Inst Technol, Dept Informat Technol, Hyderabad 500043, Telangana, India
关键词
Firefly algorithm; Recurrent neural network; Speech recognition; Speech emotion identification; Speech signal; RECOGNITION;
D O I
10.1007/s11277-022-10040-5
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
Speech is a unique characteristic of humans that expresses one's emotional viewpoint to others. Speech emotion recognition (SER) identifies the speaker's emotion from the speech signal. Nowadays, (SER) plays a vital role in real-time applications such as human-machine interface, lie detection, virtual reality, security, audio mining, etc. But in SER, filtering the noise content and extracting the emotional features is complex. Moreover, incorporating digital filters increases the cost and complexity of the system. Thus, a novel hybrid firefly-based recurrent neural speech recognition (FbRNSR) was developed with preprocessing and a feature analysis module to classify human emotions based on the speech input. The extracted features from the feature extraction module are trained to classify the emotions as happy, sad, or average. Moreover, the incorporation of firefly fitness improves the classification rate. The presented model is executed in Python, and the results are estimated. The performance of the presented approach is analyzed using the confusion matrix. The designed model achieved high true positive rate of 99.34%, true negative rate of 99.12%, false positive of 99.21%, and false negative rate of 99.07%. The designed model achieved 99.2% accuracy, 98.9% recall, and precision value for the speech signal dataset. Finally, the effectiveness and robustness of the proposed approach are proved by comparing it with the existing techniques. Hence, this method is applicable in various sectors such as medicine, security, etc., to identify the state of emotions among the people.
引用
收藏
页码:2217 / 2243
页数:27
相关论文
共 50 条
  • [31] Advanced recurrent network-based hybrid acoustic models for low resource speech recognition
    Jian Kang
    Wei-Qiang Zhang
    Wei-Wei Liu
    Jia Liu
    Michael T. Johnson
    EURASIP Journal on Audio, Speech, and Music Processing, 2018
  • [32] Advanced recurrent network-based hybrid acoustic models for low resource speech recognition
    Kang, Jian
    Zhang, Wei-Qiang
    Liu, Wei-Wei
    Liu, Jia
    Johnson, Michael T.
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2018,
  • [33] Low-dimensional recurrent neural network-based Kalman filter for speech enhancement
    Xia, Youshen
    Wang, Jun
    NEURAL NETWORKS, 2015, 67 : 131 - 139
  • [34] Deep recurrent neural network-based Aquila optimization-based online shaming emotion analysis
    Aarthi, B.
    Chelliah, Balika J.
    Concurrency and Computation: Practice and Experience, 2022, 34 (11)
  • [35] Deep recurrent neural network-based Aquila optimization-based online shaming emotion analysis
    Aarthi, B.
    Chelliah, Balika J.
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (11):
  • [36] Investigating Modulation Spectrogram Features for Deep Neural Network-based Automatic Speech Recognition
    Baby, Deepak
    Van Hamme, Hugo
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2479 - 2483
  • [37] A novel convolutional neural network with gated recurrent unit for automated speech emotion recognition and classification
    Prakash, P. Ravi
    Anuradha, D.
    Iqbal, Javid
    Galety, Mohammad Gouse
    Singh, Ruby
    Neelakandan, S.
    JOURNAL OF CONTROL AND DECISION, 2023, 10 (01) : 54 - 63
  • [38] Intrusion Detection System for Network Security Using Novel Adaptive Recurrent Neural Network-Based Fox Optimizer Concept
    Manivannan, R.
    Senthilkumar, S.
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2025, 18 (01)
  • [39] Speech emotion recognition based on improved masking EMD and convolutional recurrent neural network
    Sun, Congshan
    Li, Haifeng
    Ma, Lin
    FRONTIERS IN PSYCHOLOGY, 2023, 13
  • [40] Speech Emotion Recognition Based on Dual-Channel Convolutional Gated Recurrent Network
    Sun, Hanyu
    Huang, Lixia
    Zhang, Xueying
    Li, Juan
    Computer Engineering and Applications, 2024, 59 (02) : 170 - 177