Speaker Identification Based on Robust Sparse Coding with Limited Data

被引:0
|
作者
Wang, Taolin [1 ]
Cheng, Jian [1 ]
机构
[1] Univ Elect Sci & Technol China, Sch Elect Engn, Chengdu 610054, Peoples R China
关键词
speaker identification; limited data; robust sparse coding; GMM supervector;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The sparse representation classifier has achieved interesting classification results in face recognition. In speaker identification task, we intend to form an over complete dictionary using the GMM supervector for the training data. Then, the sparse representation is shaped as a sparsity-restricted robust regression problem. By supposing that the representation residuary and the representation coefficient are respectively independent, we use robust sparse coding (RSC) based on maximum likelihood estimation (MLE) solution to solve the sparse representation problem. In RSC, the collaborative representation strategy, taking the training utterances from all the extra classes as the nonlocal utterances of one class, is quite suitable for speaker recognition with limited data. Finally, experiments were carried out to evaluate the RSC on the ELSDSR database. The results have shown the performance of the proposed algorithm is much effective than the state-of-the-art methods of speaker identification.
引用
收藏
页码:1611 / 1614
页数:4
相关论文
共 50 条
  • [21] Exploring Data-Independent Dimensionality Reduction in Sparse Representation-Based Speaker Identification
    Haris, B. C.
    Sinha, Rohit
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2014, 33 (08) : 2521 - 2538
  • [22] Exploring Data-Independent Dimensionality Reduction in Sparse Representation-Based Speaker Identification
    B. C. Haris
    Rohit Sinha
    Circuits, Systems, and Signal Processing, 2014, 33 : 2521 - 2538
  • [23] Auditory Sparse Representation for Robust Speaker Recognition Based on Tensor Structure
    Qiang Wu
    Liqing Zhang
    EURASIP Journal on Audio, Speech, and Music Processing, 2008
  • [24] Auditory Sparse Representation for Robust Speaker Recognition Based on Tensor Structure
    Wu, Qiang
    Zhang, Liqing
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2008, 2008 (1)
  • [25] Noise-robust feature based on sparse representation for speaker recognition
    Qi, Hongzhuo
    Metallurgical and Mining Industry, 2015, 7 (04): : 64 - 69
  • [26] A robust speaker identification system based on wavelet transform
    Hsieh, CT
    Wang, YC
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2001, E84D (07): : 839 - 846
  • [27] A robust sparse auditory feature for speaker verification
    Han, J. (jqhan@hit.edu.cn), 1600, Binary Information Press, P.O. Box 162, Bethel, CT 06801-0162, United States (09):
  • [28] Robust speaker identification and verification
    Wang, Jia-Ching
    Yang, Chung-Hsien
    Wang, Jhing-Fa
    Lee, Hsiao-Ping
    IEEE COMPUTATIONAL INTELLIGENCE MAGAZINE, 2007, 2 (02) : 52 - 59
  • [29] Adaptive and Robust Sparse Coding for Laser Range Data Denoising and Inpainting
    Gao, Zhi
    Li, Qingquan
    Zhai, Ruifang
    Shan, Mo
    Lin, Feng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2016, 26 (12) : 2165 - 2175
  • [30] Robust identification from band-limited data
    Baratchart, L
    Leblond, J
    Partington, JR
    Torkhani, N
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1997, 42 (09) : 1318 - 1325