Speaker identification in emotional talking environments based on CSPHMM2s

被引：24

作者：

Shahin, Ismail ^{[1
]}

机构：

[1] Univ Sharjah, Dept Elect & Comp Engn, Sharjah, U Arab Emirates

来源：

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE | 2013年 / 26卷 / 07期

关键词：

Emotional talking environments; Hidden Markov models; Second-order circular suprasegmental hidden Markov models; Speaker identification; Suprasegmental hidden Markov models; RECOGNITION; SPEECH;

D O I：

10.1016/j.engappai.2013.03.013

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Speaker recognition systems perform almost ideal in neutral talking environments; however, these systems perform poorly in emotional talking environments. This research is devoted to enhancing the low performance of text-independent and emotion-dependent speaker identification in emotional talking environments based on employing Second-Order Circular Suprasegmental Hidden Markov Models (CSPHMM2s) as classifiers. This work has been tested on our speech database which is composed of 50 speakers talking in six different emotional states. These states are neutral, angry, sad, happy, disgust, and fear. Our results show that the average speaker identification performance in these talking environments based on CSPHMM25 is 81.50% with an improvement rate of 5.61%, 339%, and 3.06% compared, respectively, to First-Order Left-to-Right Suprasegmental Hidden Markov Models (LTRSPHMM1s), Second-Order Left-to-Right Suprasegmental Hidden Markov Models (LTRSPHMM2s), and First-Order Circular Suprasegmental Hidden Markov Models (CSPHMM1s). Our results based on subjective evaluation by human judges fall within 2.26% of those obtained based on CSPHMM2s. (C) 2013 Elsevier Ltd. All rights reserved.

引用

页码：1652 / 1659

页数：8

共 50 条

[41] Real-Time Speaker Identification using the AEREAR2 Event-Based Silicon Cochlea
Li, Cheng-Han
Delbruck, Tobi
Liu, Shih-Chii
2012 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS 2012), 2012, : 1159 - 1162
[42] Emirati Speaker Verification Based on HMM1s, HMM2s, and HMM3s
Shahin, Ismail
PROCEEDINGS OF 2016 IEEE 13TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP 2016), 2016, : 562 - 567
[43] A lightweight 2D CNN based approach for speaker-independent emotion recognition from speech with new Indian Emotional Speech Corpora
Youddha Beer Singh
Shivani Goel
Multimedia Tools and Applications, 2023, 82 : 23055 - 23073
[44] A lightweight 2D CNN based approach for speaker-independent emotion recognition from speech with new Indian Emotional Speech Corpora
Singh, Youddha Beer
Goel, Shivani
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (15) : 23055 - 23073
[45] Identification and Height Localization of Sugarcane Tip Bifurcation Points in Complex Environments Based on Improved YOLO v5s
Li S.
Bian J.
Li K.
Ren H.
Nongye Jixie Xuebao/Transactions of the Chinese Society for Agricultural Machinery, 2023, 54 (11): : 247 - 258
[46] DiffV2S: Diffusion-based Video-to-Speech Synthesis with Vision-guided Speaker Embedding
Choi, Jeongsoo
Hong, Joanna
Ro, Yong Man
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 7778 - 7787
[47] Corrosion behaviours two nickel-based coatings in H2S-containing environments
Li, XF
SURFACE & COATINGS TECHNOLOGY, 2004, 183 (2-3): : 212 - 215
[48] Load Identification Based on S-Transform and (2D)(2)PCA of Transient Current
Lu Wei
Cai Zhiqiang
Chu Jinghui
LASER & OPTOELECTRONICS PROGRESS, 2018, 55 (08)
[49] A novel fluorescent probe based on naphthimide for H2S identification and application
Zhang, Cheng-lu
Liu, Chang
Ding, Yan-wei
Wang, Hai-tao
Nie, Shi-ru
Zhang, Yan-peng
ANALYTICAL BIOCHEMISTRY, 2023, 677
[50] The Corrosion Behavior about two Ni-based alloys in CO2 / H2S Environments
Zhao, XueHui
Bai, ZhenQuan
Lin, Kai
Han, Yan
NEW MATERIALS AND ADVANCED MATERIALS, PTS 1 AND 2, 2011, 152-153 : 1624 - 1631

← 1 2 3 4 5 →