A Novel Optimized Recurrent Network-Based Automatic System for Speech Emotion Identification

被引：5

作者：

Koppula, Neeraja ^{[1
]}

Rao, Koppula Srinivas ^{[2
]}

Nabi, Shaik Abdul ^{[3
]}

Balaram, Allam ^{[4
]}

机构：

[1] Geetanjali Coll Engn & Technol, Dept Comp Sci & Engn, Hyderabad 501301, Telangana, India

[2] MLR Inst Technol, Dept Comp Sci & Engn, Hyderabad 500043, Telangana, India

[3] Sreyas Inst Engn & Technol, Dept Comp Sci & Engn, Hyderabad 500068, Telangana, India

[4] MLR Inst Technol, Dept Informat Technol, Hyderabad 500043, Telangana, India

来源：

WIRELESS PERSONAL COMMUNICATIONS | 2023年 / 128卷 / 03期

关键词：

Firefly algorithm; Recurrent neural network; Speech recognition; Speech emotion identification; Speech signal; RECOGNITION;

D O I：

10.1007/s11277-022-10040-5

中图分类号：

TN [电子技术、通信技术];

学科分类号：

0809 ;

摘要：

Speech is a unique characteristic of humans that expresses one's emotional viewpoint to others. Speech emotion recognition (SER) identifies the speaker's emotion from the speech signal. Nowadays, (SER) plays a vital role in real-time applications such as human-machine interface, lie detection, virtual reality, security, audio mining, etc. But in SER, filtering the noise content and extracting the emotional features is complex. Moreover, incorporating digital filters increases the cost and complexity of the system. Thus, a novel hybrid firefly-based recurrent neural speech recognition (FbRNSR) was developed with preprocessing and a feature analysis module to classify human emotions based on the speech input. The extracted features from the feature extraction module are trained to classify the emotions as happy, sad, or average. Moreover, the incorporation of firefly fitness improves the classification rate. The presented model is executed in Python, and the results are estimated. The performance of the presented approach is analyzed using the confusion matrix. The designed model achieved high true positive rate of 99.34%, true negative rate of 99.12%, false positive of 99.21%, and false negative rate of 99.07%. The designed model achieved 99.2% accuracy, 98.9% recall, and precision value for the speech signal dataset. Finally, the effectiveness and robustness of the proposed approach are proved by comparing it with the existing techniques. Hence, this method is applicable in various sectors such as medicine, security, etc., to identify the state of emotions among the people.

引用

页码：2217 / 2243

页数：27

共 50 条

[31] Advanced recurrent network-based hybrid acoustic models for low resource speech recognition
Jian Kang
Wei-Qiang Zhang
Wei-Wei Liu
Jia Liu
Michael T. Johnson
EURASIP Journal on Audio, Speech, and Music Processing, 2018
[32] Advanced recurrent network-based hybrid acoustic models for low resource speech recognition
Kang, Jian
Zhang, Wei-Qiang
Liu, Wei-Wei
Liu, Jia
Johnson, Michael T.
EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2018,
[33] Low-dimensional recurrent neural network-based Kalman filter for speech enhancement
Xia, Youshen
Wang, Jun
NEURAL NETWORKS, 2015, 67 : 131 - 139
[34] Deep recurrent neural network-based Aquila optimization-based online shaming emotion analysis
Aarthi, B.
Chelliah, Balika J.
Concurrency and Computation: Practice and Experience, 2022, 34 (11)
[35] Deep recurrent neural network-based Aquila optimization-based online shaming emotion analysis
Aarthi, B.
Chelliah, Balika J.
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (11):
[36] Investigating Modulation Spectrogram Features for Deep Neural Network-based Automatic Speech Recognition
Baby, Deepak
Van Hamme, Hugo
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2479 - 2483
[37] A novel convolutional neural network with gated recurrent unit for automated speech emotion recognition and classification
Prakash, P. Ravi
Anuradha, D.
Iqbal, Javid
Galety, Mohammad Gouse
Singh, Ruby
Neelakandan, S.
JOURNAL OF CONTROL AND DECISION, 2023, 10 (01) : 54 - 63
[38] Intrusion Detection System for Network Security Using Novel Adaptive Recurrent Neural Network-Based Fox Optimizer Concept
Manivannan, R.
Senthilkumar, S.
INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2025, 18 (01)
[39] Speech emotion recognition based on improved masking EMD and convolutional recurrent neural network
Sun, Congshan
Li, Haifeng
Ma, Lin
FRONTIERS IN PSYCHOLOGY, 2023, 13
[40] Speech Emotion Recognition Based on Dual-Channel Convolutional Gated Recurrent Network
Sun, Hanyu
Huang, Lixia
Zhang, Xueying
Li, Juan
Computer Engineering and Applications, 2024, 59 (02) : 170 - 177

← 1 2 3 4 5 →