Speaker Identification Based on Robust Sparse Coding with Limited Data

被引:0
|
作者
Wang, Taolin [1 ]
Cheng, Jian [1 ]
机构
[1] Univ Elect Sci & Technol China, Sch Elect Engn, Chengdu 610054, Peoples R China
关键词
speaker identification; limited data; robust sparse coding; GMM supervector;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The sparse representation classifier has achieved interesting classification results in face recognition. In speaker identification task, we intend to form an over complete dictionary using the GMM supervector for the training data. Then, the sparse representation is shaped as a sparsity-restricted robust regression problem. By supposing that the representation residuary and the representation coefficient are respectively independent, we use robust sparse coding (RSC) based on maximum likelihood estimation (MLE) solution to solve the sparse representation problem. In RSC, the collaborative representation strategy, taking the training utterances from all the extra classes as the nonlocal utterances of one class, is quite suitable for speaker recognition with limited data. Finally, experiments were carried out to evaluate the RSC on the ELSDSR database. The results have shown the performance of the proposed algorithm is much effective than the state-of-the-art methods of speaker identification.
引用
收藏
页码:1611 / 1614
页数:4
相关论文
共 50 条
  • [1] Limited data speaker identification
    H. S. Jayanna
    S. R. Mahadeva Prasanna
    Sadhana, 2010, 35 : 525 - 546
  • [2] Limited data speaker identification
    Jayanna, H. S.
    Prasanna, S. R. Mahadeva
    SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2010, 35 (05): : 525 - 546
  • [3] Noise Robust Speaker Recognition with Convolutive Sparse Coding
    Hurmalainen, Antti
    Saeidi, Rahim
    Virtanen, Tuomas
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 244 - 248
  • [4] Sparse Coding Based Lip Texture Representation For Visual Speaker Identification
    Lai, Jun-Yao
    Wang, Shi-Lin
    Shi, Xing-Jian
    Liew, Alan Wee-Chung
    2014 19TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2014, : 607 - 610
  • [5] Robust Speaker Verification With Joint Sparse Coding Over Learned Dictionaries
    Haris, B. C.
    Sinha, Rohit
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2015, 10 (10) : 2143 - 2157
  • [6] Speaker Identification Based on Sparse Subspace Model
    Xu, Longting
    Yang, Zhen
    2013 19TH ASIA-PACIFIC CONFERENCE ON COMMUNICATIONS (APCC): SMART COMMUNICATIONS TO ENHANCE THE QUALITY OF LIFE, 2013, : 37 - 41
  • [7] A robust feature based on sparse representation for speaker recognition
    Xie, Yining
    Huang, Jinjie
    Wang, Xinlei
    Journal of Computational Information Systems, 2013, 9 (09): : 3553 - 3561
  • [8] Enrollee-constrained sparse coding of test data for speaker verification
    Kumar, Nagendra
    Sinha, Rohit
    PATTERN RECOGNITION LETTERS, 2018, 116 : 15 - 21
  • [9] Visual speaker identification and authentication by joint spatiotemporal sparse coding and hierarchical pooling
    Lai, Jun-Yao
    Wang, Shi-Lin
    Liew, Alan Wee-Chung
    Shi, Xing-Jian
    INFORMATION SCIENCES, 2016, 373 : 219 - 232
  • [10] Intrinsic Variation Robust Speaker Verification based on Sparse Representation
    Nie, Yi
    Xu, Mingxing
    Xianyu, Haishu
    2014 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2014,