Speaker Identification Based on Robust Sparse Coding with Limited Data

被引：0

作者：

Wang, Taolin ^{[1
]}

Cheng, Jian ^{[1
]}

机构：

[1] Univ Elect Sci & Technol China, Sch Elect Engn, Chengdu 610054, Peoples R China

来源：

2012 5TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING (CISP) | 2012年

关键词：

speaker identification; limited data; robust sparse coding; GMM supervector;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

The sparse representation classifier has achieved interesting classification results in face recognition. In speaker identification task, we intend to form an over complete dictionary using the GMM supervector for the training data. Then, the sparse representation is shaped as a sparsity-restricted robust regression problem. By supposing that the representation residuary and the representation coefficient are respectively independent, we use robust sparse coding (RSC) based on maximum likelihood estimation (MLE) solution to solve the sparse representation problem. In RSC, the collaborative representation strategy, taking the training utterances from all the extra classes as the nonlocal utterances of one class, is quite suitable for speaker recognition with limited data. Finally, experiments were carried out to evaluate the RSC on the ELSDSR database. The results have shown the performance of the proposed algorithm is much effective than the state-of-the-art methods of speaker identification.

引用

页码：1611 / 1614

页数：4

共 50 条

[1] Limited data speaker identification
H. S. Jayanna
S. R. Mahadeva Prasanna
Sadhana, 2010, 35 : 525 - 546
[2] Limited data speaker identification
Jayanna, H. S.
Prasanna, S. R. Mahadeva
SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2010, 35 (05): : 525 - 546
[3] Noise Robust Speaker Recognition with Convolutive Sparse Coding
Hurmalainen, Antti
Saeidi, Rahim
Virtanen, Tuomas
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 244 - 248
[4] Sparse Coding Based Lip Texture Representation For Visual Speaker Identification
Lai, Jun-Yao
Wang, Shi-Lin
Shi, Xing-Jian
Liew, Alan Wee-Chung
2014 19TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2014, : 607 - 610
[5] Robust Speaker Verification With Joint Sparse Coding Over Learned Dictionaries
Haris, B. C.
Sinha, Rohit
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2015, 10 (10) : 2143 - 2157
[6] Speaker Identification Based on Sparse Subspace Model
Xu, Longting
Yang, Zhen
2013 19TH ASIA-PACIFIC CONFERENCE ON COMMUNICATIONS (APCC): SMART COMMUNICATIONS TO ENHANCE THE QUALITY OF LIFE, 2013, : 37 - 41
[7] A robust feature based on sparse representation for speaker recognition
Xie, Yining
Huang, Jinjie
Wang, Xinlei
Journal of Computational Information Systems, 2013, 9 (09): : 3553 - 3561
[8] Enrollee-constrained sparse coding of test data for speaker verification
Kumar, Nagendra
Sinha, Rohit
PATTERN RECOGNITION LETTERS, 2018, 116 : 15 - 21
[9] Visual speaker identification and authentication by joint spatiotemporal sparse coding and hierarchical pooling
Lai, Jun-Yao
Wang, Shi-Lin
Liew, Alan Wee-Chung
Shi, Xing-Jian
INFORMATION SCIENCES, 2016, 373 : 219 - 232
[10] Intrinsic Variation Robust Speaker Verification based on Sparse Representation
Nie, Yi
Xu, Mingxing
Xianyu, Haishu
2014 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2014,

← 1 2 3 4 5 →