Speaker identification by anchor models with PCA/LDA post-processing

被引:0
|
作者
Mami, Y [1 ]
Charlet, D [1 ]
机构
[1] France Telecom, R&D, DIH IPS, F-22307 Lannion, France
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Speaker representation by location is a new technique of speaker recognition and adaptation. It consists in representing a new speaker not in an absolute manner, but relatively to a set of well trained speaker models. Each new speaker is represented by its location in an optimal representation space. This paper addresses the location task. It describes a representation space built either by clustering speakers or by selecting an optimal subset of them. In this representation space, speaker location is then performed by the anchor models technique to find vector of coordinates. An orthogonalization process is then applied to the vector of coordinates, so as to compute the distance properly. This orthogonalization process (PCA or LDA) proves experimentally to improve significantly the recognition.
引用
收藏
页码:180 / 183
页数:4
相关论文
共 50 条
  • [1] Post-processing techniques for a speaker diarization system
    Tavarez, David
    Navas, Eva
    Erro, Daniel
    Saratxaga, Ibon
    Hernaez, Inma
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2012, (49): : 109 - 115
  • [2] Regularization of LDA for face recognition: A post-processing approach
    Zuo, WM
    Wang, KQ
    Zhang, D
    Yang, J
    ANALYSIS AND MODELLING OF FACES AND GESTURES, PROCEEDINGS, 2005, 3723 : 377 - 391
  • [3] END-TO-END SPEAKER DIARIZATION AS POST-PROCESSING
    Horiguchi, Shota
    Garcia, Paola
    Fujita, Yusuke
    Watanabe, Shinji
    Nagamatsu, Kenji
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 7188 - 7192
  • [4] PURE SEGMENT SELECTION AS SPEAKER DIARIZATION POST-PROCESSING
    Ben-Harush, Oshry
    Guterman, Hugo
    Lapidot, Itshak
    2008 IEEE 25TH CONVENTION OF ELECTRICAL AND ELECTRONICS ENGINEERS IN ISRAEL, VOLS 1 AND 2, 2008, : 461 - +
  • [5] Post-processing on LDA's discriminant vectors for facial feature extraction
    Wang, KQ
    Zuo, WM
    Zhang, D
    AUDIO AND VIDEO BASED BIOMETRIC PERSON AUTHENTICATION, PROCEEDINGS, 2005, 3546 : 346 - 354
  • [6] Robust text-independent speaker identification using hybrid PCA&LDA
    Kim, Min-Seok
    Yu, Ha-Jin
    Kwak, Keun-Chang
    Chi, Su-Young
    MICAI 2006: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, 4293 : 1067 - +
  • [7] Improvement of a speaker authentication system through MLP's post-processing
    Rodríguez-Liñares, L
    García-Mateo, C
    Alba-Castro, JL
    NEURAL NETWORKS FOR SIGNAL PROCESSING XI, 2001, : 461 - 470
  • [8] PCA/LDA Approach for Text-Independent Speaker Recognition
    Ge, Zhenhao
    Sharma, Sudhendu R.
    Smith, Mark J. T.
    INDEPENDENT COMPONENT ANALYSES, COMPRESSIVE SAMPLING, WAVELETS, NEURAL NET, BIOSYSTEMS, AND NANOENGINEERING X, 2012, 8401
  • [9] Post-processing options and their effects on target identification performance
    Sylvester, VB
    Cohen, MN
    SIGNAL PROCESSING, SENSOR FUSION, AND TARGET RECOGNITION VI, 1997, 3068 : 522 - 531
  • [10] On correlator pre- and post-processing for object identification
    Coffield, PC
    OPTICAL PATTERN RECOGNITION XI, 2000, 4043 : 317 - 328