DESCU: Dyadic emotional speech corpus and recognition system for Urdu language

被引:3
|
作者
Qasim, Muhammad [1 ]
Habib, Tania [1 ]
Urooj, Saba [2 ]
Mumtaz, Benazir [2 ]
机构
[1] Univ Engn & Technol, Dept Comp Engn, Lahore, Pakistan
[2] Univ Engn & Technol, Ctr Language Engn, Lahore, Pakistan
关键词
Speech emotion recognition; Speech databases; Speech processing; Classification; FEATURES;
D O I
10.1016/j.specom.2023.02.002
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Speech signal contains the emotional state of a speaker along with the message. The recognition of the emotional state of a speaker helps in determining the true meaning of a message and allows for more natural communication between humans and machines. This paper presents the design and development of a dyadic emotional speech corpus for the Urdu language. The corpus is developed by recording dialog scenarios for anger, happy, neutral, and sad emotions. The performance of frame-level features, utterance -level features, and spectrograms have been evaluated in this work. Emotion recognition experiments have been conducted using classifiers including Support Vector Machine, Hidden Markov Models and Convolutional Neural Networks. Experimental results show that the utterance-level features outperform the frame-level features and spectrograms. The combined feature set of cepstral, spectral, prosodic, and voice quality features performs better than the individual feature sets. The unweighted average recalls of 84.1%, 80.2%, 84.7% have been achieved for speaker-dependent and speaker-independent and text-independent emotion recognition, respectively.
引用
收藏
页码:40 / 52
页数:13
相关论文
共 50 条
  • [41] Urdu Speech Emotion Recognition: A Systematic Literature Review
    Taj, Soonh
    Mujtaba, Ghulam
    Daudpota, Sher Muhammad
    Mughal, Muhammad Hussain
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2023, 22 (07)
  • [42] Speaker Independent Urdu Speech Recognition Using HMM
    Ashraf, Javed
    Iqbal, Naveed
    Khattak, Naveed Sarfraz
    Zaidi, Ather Mohsin
    NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, 2010, 6177 : 140 - 148
  • [43] Improving Large Vocabulary Urdu Speech Recognition System using Deep Neural Networks
    Farooq, Muhammad Umar
    Adeeba, Farah
    Rauf, Sahar
    Hussain, Sarmad
    INTERSPEECH 2019, 2019, : 2978 - 2982
  • [44] Introducing the Urdu-Sindhi Speech Emotion Corpus: A Novel Dataset of Speech Recordings for Emotion Recognition for Two Low-Resource Languages
    Syed, Zafi Sherhan
    Memon, Sajjad Ali
    Shah, Muhammad Shehram
    Syed, Abbas Shah
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (04) : 805 - 810
  • [45] The Korean Speech Recognition Sentences: A Large Corpus for Evaluating Semantic Context and Language Experience in Speech Perception
    Song, Jieun
    Kim, Byungjun
    Kim, Minjeong
    Iverson, Paul
    JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 2023, 66 (09): : 3399 - 3412
  • [46] A new emotional corpus for the Romanian Language
    Monica, Feraru Silvia
    Monica, Fira
    Dan, Zbancioc Marius
    2016 13TH INTERNATIONAL CONFERENCE ON DEVELOPMENT AND APPLICATION SYSTEMS (DAS 2016), 2016, : 260 - 263
  • [47] The Makerere Radio Speech Corpus: A Luganda Radio Corpus for Automatic Speech Recognition
    Mukiibi, Jonathan
    Katumba, Andrew
    Nakatumba-Nabende, Joyce
    Hussein, Ali
    Meyer, Josh
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 1945 - 1954
  • [48] A computer vision-based system for recognition and classification of Urdu sign language dataset
    Zahid, Hira
    Rashid, Munaf
    Syed, Sidra Abid
    Ullah, Rafi
    Asif, Muhammad
    Khan, Muzammil
    Mujeeb, Amenah Abdul
    Khan, Ali Haider
    PEERJ COMPUTER SCIENCE, 2022, 8
  • [49] Language-Independent Socio-Emotional Role Recognition in the AMI Meetings Corpus
    Valente, Fabio
    Vinciarelli, Alessandro
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 3084 - +
  • [50] Forming the set of recognition units for the speech recognition system for the Azerbaijani language
    Abbasov, A.
    Fatullayev, A.
    APPLIED AND COMPUTATIONAL MATHEMATICS, 2007, 6 (02) : 181 - 191