Speaker identification in emotional talking environments based on CSPHMM2s

被引:24
|
作者
Shahin, Ismail [1 ]
机构
[1] Univ Sharjah, Dept Elect & Comp Engn, Sharjah, U Arab Emirates
关键词
Emotional talking environments; Hidden Markov models; Second-order circular suprasegmental hidden Markov models; Speaker identification; Suprasegmental hidden Markov models; RECOGNITION; SPEECH;
D O I
10.1016/j.engappai.2013.03.013
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Speaker recognition systems perform almost ideal in neutral talking environments; however, these systems perform poorly in emotional talking environments. This research is devoted to enhancing the low performance of text-independent and emotion-dependent speaker identification in emotional talking environments based on employing Second-Order Circular Suprasegmental Hidden Markov Models (CSPHMM2s) as classifiers. This work has been tested on our speech database which is composed of 50 speakers talking in six different emotional states. These states are neutral, angry, sad, happy, disgust, and fear. Our results show that the average speaker identification performance in these talking environments based on CSPHMM25 is 81.50% with an improvement rate of 5.61%, 339%, and 3.06% compared, respectively, to First-Order Left-to-Right Suprasegmental Hidden Markov Models (LTRSPHMM1s), Second-Order Left-to-Right Suprasegmental Hidden Markov Models (LTRSPHMM2s), and First-Order Circular Suprasegmental Hidden Markov Models (CSPHMM1s). Our results based on subjective evaluation by human judges fall within 2.26% of those obtained based on CSPHMM2s. (C) 2013 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1652 / 1659
页数:8
相关论文
共 50 条
  • [41] Real-Time Speaker Identification using the AEREAR2 Event-Based Silicon Cochlea
    Li, Cheng-Han
    Delbruck, Tobi
    Liu, Shih-Chii
    2012 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS 2012), 2012, : 1159 - 1162
  • [42] Emirati Speaker Verification Based on HMM1s, HMM2s, and HMM3s
    Shahin, Ismail
    PROCEEDINGS OF 2016 IEEE 13TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP 2016), 2016, : 562 - 567
  • [43] A lightweight 2D CNN based approach for speaker-independent emotion recognition from speech with new Indian Emotional Speech Corpora
    Youddha Beer Singh
    Shivani Goel
    Multimedia Tools and Applications, 2023, 82 : 23055 - 23073
  • [44] A lightweight 2D CNN based approach for speaker-independent emotion recognition from speech with new Indian Emotional Speech Corpora
    Singh, Youddha Beer
    Goel, Shivani
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (15) : 23055 - 23073
  • [45] Identification and Height Localization of Sugarcane Tip Bifurcation Points in Complex Environments Based on Improved YOLO v5s
    Li S.
    Bian J.
    Li K.
    Ren H.
    Nongye Jixie Xuebao/Transactions of the Chinese Society for Agricultural Machinery, 2023, 54 (11): : 247 - 258
  • [46] DiffV2S: Diffusion-based Video-to-Speech Synthesis with Vision-guided Speaker Embedding
    Choi, Jeongsoo
    Hong, Joanna
    Ro, Yong Man
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 7778 - 7787
  • [47] Corrosion behaviours two nickel-based coatings in H2S-containing environments
    Li, XF
    SURFACE & COATINGS TECHNOLOGY, 2004, 183 (2-3): : 212 - 215
  • [48] Load Identification Based on S-Transform and (2D)(2)PCA of Transient Current
    Lu Wei
    Cai Zhiqiang
    Chu Jinghui
    LASER & OPTOELECTRONICS PROGRESS, 2018, 55 (08)
  • [49] A novel fluorescent probe based on naphthimide for H2S identification and application
    Zhang, Cheng-lu
    Liu, Chang
    Ding, Yan-wei
    Wang, Hai-tao
    Nie, Shi-ru
    Zhang, Yan-peng
    ANALYTICAL BIOCHEMISTRY, 2023, 677
  • [50] The Corrosion Behavior about two Ni-based alloys in CO2 / H2S Environments
    Zhao, XueHui
    Bai, ZhenQuan
    Lin, Kai
    Han, Yan
    NEW MATERIALS AND ADVANCED MATERIALS, PTS 1 AND 2, 2011, 152-153 : 1624 - 1631