DESCU: Dyadic emotional speech corpus and recognition system for Urdu language

被引：3

作者：

Qasim, Muhammad ^{[1
]}

Habib, Tania ^{[1
]}

Urooj, Saba ^{[2
]}

Mumtaz, Benazir ^{[2
]}

机构：

[1] Univ Engn & Technol, Dept Comp Engn, Lahore, Pakistan

[2] Univ Engn & Technol, Ctr Language Engn, Lahore, Pakistan

来源：

SPEECH COMMUNICATION | 2023年 / 148卷

关键词：

Speech emotion recognition; Speech databases; Speech processing; Classification; FEATURES;

D O I：

10.1016/j.specom.2023.02.002

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Speech signal contains the emotional state of a speaker along with the message. The recognition of the emotional state of a speaker helps in determining the true meaning of a message and allows for more natural communication between humans and machines. This paper presents the design and development of a dyadic emotional speech corpus for the Urdu language. The corpus is developed by recording dialog scenarios for anger, happy, neutral, and sad emotions. The performance of frame-level features, utterance -level features, and spectrograms have been evaluated in this work. Emotion recognition experiments have been conducted using classifiers including Support Vector Machine, Hidden Markov Models and Convolutional Neural Networks. Experimental results show that the utterance-level features outperform the frame-level features and spectrograms. The combined feature set of cepstral, spectral, prosodic, and voice quality features performs better than the individual feature sets. The unweighted average recalls of 84.1%, 80.2%, 84.7% have been achieved for speaker-dependent and speaker-independent and text-independent emotion recognition, respectively.

引用

页码：40 / 52

页数：13

共 50 条

[41] Urdu Speech Emotion Recognition: A Systematic Literature Review
Taj, Soonh
Mujtaba, Ghulam
Daudpota, Sher Muhammad
Mughal, Muhammad Hussain
ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2023, 22 (07)
[42] Speaker Independent Urdu Speech Recognition Using HMM
Ashraf, Javed
Iqbal, Naveed
Khattak, Naveed Sarfraz
Zaidi, Ather Mohsin
NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, 2010, 6177 : 140 - 148
[43] Improving Large Vocabulary Urdu Speech Recognition System using Deep Neural Networks
Farooq, Muhammad Umar
Adeeba, Farah
Rauf, Sahar
Hussain, Sarmad
INTERSPEECH 2019, 2019, : 2978 - 2982
[44] Introducing the Urdu-Sindhi Speech Emotion Corpus: A Novel Dataset of Speech Recordings for Emotion Recognition for Two Low-Resource Languages
Syed, Zafi Sherhan
Memon, Sajjad Ali
Shah, Muhammad Shehram
Syed, Abbas Shah
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (04) : 805 - 810
[45] The Korean Speech Recognition Sentences: A Large Corpus for Evaluating Semantic Context and Language Experience in Speech Perception
Song, Jieun
Kim, Byungjun
Kim, Minjeong
Iverson, Paul
JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 2023, 66 (09): : 3399 - 3412
[46] A new emotional corpus for the Romanian Language
Monica, Feraru Silvia
Monica, Fira
Dan, Zbancioc Marius
2016 13TH INTERNATIONAL CONFERENCE ON DEVELOPMENT AND APPLICATION SYSTEMS (DAS 2016), 2016, : 260 - 263
[47] The Makerere Radio Speech Corpus: A Luganda Radio Corpus for Automatic Speech Recognition
Mukiibi, Jonathan
Katumba, Andrew
Nakatumba-Nabende, Joyce
Hussein, Ali
Meyer, Josh
LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 1945 - 1954
[48] A computer vision-based system for recognition and classification of Urdu sign language dataset
Zahid, Hira
Rashid, Munaf
Syed, Sidra Abid
Ullah, Rafi
Asif, Muhammad
Khan, Muzammil
Mujeeb, Amenah Abdul
Khan, Ali Haider
PEERJ COMPUTER SCIENCE, 2022, 8
[49] Language-Independent Socio-Emotional Role Recognition in the AMI Meetings Corpus
Valente, Fabio
Vinciarelli, Alessandro
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 3084 - +
[50] Forming the set of recognition units for the speech recognition system for the Azerbaijani language
Abbasov, A.
Fatullayev, A.
APPLIED AND COMPUTATIONAL MATHEMATICS, 2007, 6 (02) : 181 - 191

← 1 2 3 4 5 →