DESCU: Dyadic emotional speech corpus and recognition system for Urdu language

被引:3
|
作者
Qasim, Muhammad [1 ]
Habib, Tania [1 ]
Urooj, Saba [2 ]
Mumtaz, Benazir [2 ]
机构
[1] Univ Engn & Technol, Dept Comp Engn, Lahore, Pakistan
[2] Univ Engn & Technol, Ctr Language Engn, Lahore, Pakistan
关键词
Speech emotion recognition; Speech databases; Speech processing; Classification; FEATURES;
D O I
10.1016/j.specom.2023.02.002
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Speech signal contains the emotional state of a speaker along with the message. The recognition of the emotional state of a speaker helps in determining the true meaning of a message and allows for more natural communication between humans and machines. This paper presents the design and development of a dyadic emotional speech corpus for the Urdu language. The corpus is developed by recording dialog scenarios for anger, happy, neutral, and sad emotions. The performance of frame-level features, utterance -level features, and spectrograms have been evaluated in this work. Emotion recognition experiments have been conducted using classifiers including Support Vector Machine, Hidden Markov Models and Convolutional Neural Networks. Experimental results show that the utterance-level features outperform the frame-level features and spectrograms. The combined feature set of cepstral, spectral, prosodic, and voice quality features performs better than the individual feature sets. The unweighted average recalls of 84.1%, 80.2%, 84.7% have been achieved for speaker-dependent and speaker-independent and text-independent emotion recognition, respectively.
引用
收藏
页码:40 / 52
页数:13
相关论文
共 50 条
  • [1] A Speech Recognition System for Urdu Language
    Beg, Azam
    Hasnain, S. K.
    WIRELESS NETWORKS, INFORMATION PROCESSING AND SYSTEMS, 2008, 20 : 118 - +
  • [2] An Analysis of Malay Language Emotional Speech Corpus for Emotion Recognition System
    Apandi, Nurfarihah
    Jamil, Nursuriati
    2016 IEEE INDUSTRIAL ELECTRONICS AND APPLICATIONS CONFERENCE (IEACON), 2016, : 225 - 231
  • [3] An Urdu speech corpus for emotion recognition
    Asghar, Awais
    Sohaib, Sarmad
    Iftikhar, Saman
    Sha, Muhammad
    Fatima, Kiran
    PEERJ COMPUTER SCIENCE, 2022, 8
  • [4] Urdu Speech Corpus and Preliminary Results on Speech Recognition
    Ali, Hazrat
    Ahmad, Nasir
    Hafeez, Abdul
    ENGINEERING APPLICATIONS OF NEURAL NETWORKS, EANN 2016, 2016, 629 : 317 - 325
  • [5] Speech emotion recognition for the Urdu language
    Zaheer, Nimra
    Ahmad, Obaid Ullah
    Shabbir, Mudassir
    Raza, Agha Ali
    LANGUAGE RESOURCES AND EVALUATION, 2023, 57 (02) : 915 - 944
  • [6] Emotional Speech Corpus of Croatian Language
    Dropuljic, Branimir
    Chmura, Milosz Thomasz
    Kolak, Antonio
    Petrinovic, Davor
    PROCEEDINGS OF THE 7TH INTERNATIONAL SYMPOSIUM ON IMAGE AND SIGNAL PROCESSING AND ANALYSIS (ISPA 2011), 2011, : 95 - 100
  • [7] A Cross-Corpus Recognition of Emotional Speech
    Xiao, Zhongzhe
    Wu, Di
    Zhang, Xiaojun
    Tao, Zhi
    PROCEEDINGS OF 2016 9TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID), VOL 2, 2016, : 42 - 46
  • [8] Emotional Speech Recognition for Marathi Language
    Borade, Bharati
    Deshmukh, R. R.
    JOURNAL OF ADVANCED APPLIED SCIENTIFIC RESEARCH, 2024, 6 (03): : 85 - 105
  • [9] BembaSpeech: A Speech Recognition Corpus for the Bemba Language
    Sikasote, Claytone
    Anastasopoulos, Antonios
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 7277 - 7283
  • [10] Analysis of Corpus Development for Urdu Language
    Naseer, Asma
    Shakeel, Tanzeela
    Arshad, Kinza
    Ather, Zeenia
    4TH INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING (IC)2, 2021, : 7 - 11