Analysis of Speaker Recognition in Blended Emotional Environment Using Deep Learning Approaches

被引:1
|
作者
Tomar, Shalini [1 ]
Koolagudi, Shashidhar G. [1 ]
机构
[1] Natl Inst Technol, Dept Comp Sci & Engn, Mangalore, Karnataka, India
关键词
Blended emotion; Mel Frequency Cepstral Coefficients; Convolutional Neural Network; Speaker Recognition; Speaker Recognition in Blended Emotion Environment; Valence;
D O I
10.1007/978-3-031-45170-6_72
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Generally, human conversation has some emotion, and natural emotions are often blended. Today's Speaker Recognition systems lack the component of emotion. This work proposes a Speaker Recognition approaches in Blended Emotion Environment (SRBEE) system to enhance Speaker Recognition (SR) in an emotional context. Speaker Recognition algorithms nearly always achieve perfect performance in the case of neutral speech, but it is not true from an emotional perspective. This work attempts the recognition of speakers in blended emotion with the Mel-Frequency Cepstral Coefficients (MFCC) feature extraction using the Conv2D classifier. In the blended emotional environment, calculating the accuracy of the Speaker Recognition task is complex. The blend of four basic natural emotions (happy, sad, angry, and fearful) utterances tested in the proposed system to reduce SR's complexity in a blended emotional environment. The proposed system achieves an average accuracy of 99.3% for blended emotion with neutral speech and 92.8% for four basic blended natural emotions (happy, sad, angry, and fearful). The dataset was prepared by blending two emotions in one utterance.
引用
收藏
页码:691 / 698
页数:8
相关论文
共 50 条
  • [1] Speaker Recognition with Deep Learning Approaches: A Review
    Alenizi, Abdulrahman S.
    Al-Karawi, Khamis A.
    PROCEEDINGS OF NINTH INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY, VOL 5, ICICT 2024, 2024, 1000 : 481 - 499
  • [2] Speaker Recognition in Emotional Environment
    Koolagudi, Shashidhar G.
    Sharma, Kritika
    Rao, K. Sreenivasa
    ECO-FRIENDLY COMPUTING AND COMMUNICATION SYSTEMS, 2012, 305 : 117 - +
  • [3] Speaker Recognition Systems in the Emotional Environment
    Shahin, Ismail
    2008 3RD INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGIES: FROM THEORY TO APPLICATIONS, VOLS 1-5, 2008, : 652 - 656
  • [4] A deep learning approach for speaker recognition
    Hourri, Soufiane
    Kharroubi, Jamal
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2020, 23 (01) : 123 - 131
  • [5] A deep learning approach for speaker recognition
    Soufiane Hourri
    Jamal Kharroubi
    International Journal of Speech Technology, 2020, 23 : 123 - 131
  • [6] A DISCRIMINATIVE UNSUPERVISED METHOD FOR SPEAKER RECOGNITION USING DEEP LEARNING
    Saleem, Muhammad Muneeb
    Hansen, John H. L.
    2016 IEEE 26TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2016,
  • [7] A comparative study in emotional speaker recognition in noisy environment
    Mansour, Asma
    Lachiri, Zied
    2017 IEEE/ACS 14TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2017, : 980 - 986
  • [8] Deep Learning Analysis Models for Speech and Emotional Recognition
    Wu, Jun
    Zhu, Tianliang
    Yu, Chengtian
    Wang, Chunzhi
    Zhou, Xianjing
    Liu, Hu
    2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 1541 - 1545
  • [9] Automatic Speaker Recognition using Transfer Learning Approach of Deep Learning Models
    Ganvir, Sonal
    Lal, Nidhi
    PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTATION TECHNOLOGIES (ICICT 2021), 2021, : 595 - 601
  • [10] Approaches to learning in a blended learning environment: preliminary results
    Bralic, Antonia
    2018 41ST INTERNATIONAL CONVENTION ON INFORMATION AND COMMUNICATION TECHNOLOGY, ELECTRONICS AND MICROELECTRONICS (MIPRO), 2018, : 777 - 782