Speaker Identification Based on Robust Sparse Coding with Limited Data

被引:0
|
作者
Wang, Taolin [1 ]
Cheng, Jian [1 ]
机构
[1] Univ Elect Sci & Technol China, Sch Elect Engn, Chengdu 610054, Peoples R China
来源
2012 5TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING (CISP) | 2012年
关键词
speaker identification; limited data; robust sparse coding; GMM supervector;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The sparse representation classifier has achieved interesting classification results in face recognition. In speaker identification task, we intend to form an over complete dictionary using the GMM supervector for the training data. Then, the sparse representation is shaped as a sparsity-restricted robust regression problem. By supposing that the representation residuary and the representation coefficient are respectively independent, we use robust sparse coding (RSC) based on maximum likelihood estimation (MLE) solution to solve the sparse representation problem. In RSC, the collaborative representation strategy, taking the training utterances from all the extra classes as the nonlocal utterances of one class, is quite suitable for speaker recognition with limited data. Finally, experiments were carried out to evaluate the RSC on the ELSDSR database. The results have shown the performance of the proposed algorithm is much effective than the state-of-the-art methods of speaker identification.
引用
收藏
页码:1611 / 1614
页数:4
相关论文
共 50 条
  • [41] Robust sparse coding for subspace learning
    School of Three Gorges Artificial Intelligence, Chongqing Three Gorges University, Wanzhou, Chongqing
    404100, China
    Ital. J. Pure Appl. Math., 2020, (986-994):
  • [42] Improved MFCC-Based Feature for Robust Speaker Identification
    吴尊敬
    曹志刚
    TsinghuaScienceandTechnology, 2005, (02) : 158 - 161
  • [43] Robust Access based on Speaker Identification for Optical Communications Security
    Zao, L.
    Alcaim, A.
    Coelho, R.
    2009 16TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING, VOLS 1 AND 2, 2009, : 770 - +
  • [44] Robust sparse coding for subspace learning
    Dai, Xiangguang
    Tao, Yingyin
    Xiong, Jiang
    Feng, Yuming
    ITALIAN JOURNAL OF PURE AND APPLIED MATHEMATICS, 2020, (44): : 986 - 994
  • [45] Robust Sparse Coding for Face Recognition
    Yang, Meng
    Zhang, Lei
    Yang, Jian
    Zhang, David
    2011 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2011, : 625 - 632
  • [46] Speaker Identification based on Robust AM-FM Features
    Deshpande, Mangesh S.
    Holambe, Raghunath S.
    2009 SECOND INTERNATIONAL CONFERENCE ON EMERGING TRENDS IN ENGINEERING AND TECHNOLOGY (ICETET 2009), 2009, : 62 - +
  • [47] PSO Based Optimized Reliability for Robust Multimodal Speaker Identification
    Tariquzzaman, Md.
    Kim, Jin Young
    Na, Seung You
    CISST'10: PROCEEDINGS OF THE 4TH WSEAS INTERNATIONAL CONFERENCE ON CIRCUITS, SYSTEMS, SIGNAL AND TELECOMMUNICATIONS, 2009, : 157 - 162
  • [48] Speaker adaptations in sparse training data for improved speaker verification
    Ahn, S
    Ko, H
    ELECTRONICS LETTERS, 2000, 36 (04) : 371 - 373
  • [49] Robust sparse coding via self-paced learning for data representation
    Feng, Xiaodong
    Wu, Sen
    INFORMATION SCIENCES, 2021, 546 : 448 - 468
  • [50] Speaker identification based on the frame linear predictive coding spectrum technique
    Wu, Jian-Da
    Lin, Bing-Fu
    EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (04) : 8056 - 8063