Speaker identification by anchor models with PCA/LDA post-processing

被引:0
|
作者
Mami, Y [1 ]
Charlet, D [1 ]
机构
[1] France Telecom, R&D, DIH IPS, F-22307 Lannion, France
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Speaker representation by location is a new technique of speaker recognition and adaptation. It consists in representing a new speaker not in an absolute manner, but relatively to a set of well trained speaker models. Each new speaker is represented by its location in an optimal representation space. This paper addresses the location task. It describes a representation space built either by clustering speakers or by selecting an optimal subset of them. In this representation space, speaker location is then performed by the anchor models technique to find vector of coordinates. An orthogonalization process is then applied to the vector of coordinates, so as to compute the distance properly. This orthogonalization process (PCA or LDA) proves experimentally to improve significantly the recognition.
引用
收藏
页码:180 / 183
页数:4
相关论文
共 50 条
  • [21] A Review of Methods for the Geometric Post-Processing of Topology Optimized Models
    Subedi, Subodh C.
    Verma, Chaman Singh
    Suresh, Krishnan
    JOURNAL OF COMPUTING AND INFORMATION SCIENCE IN ENGINEERING, 2020, 20 (06)
  • [22] IMPROVING SEPARATION-BASED SPEAKER DIARIZATION VIA ITERATIVE MODEL REFINEMENT AND SPEAKER EMBEDDING BASED POST-PROCESSING
    Niu, Shu-Tong
    Du, Jun
    Sun, Lei
    Lee, Chin-Hui
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 8387 - 8391
  • [23] Similarity-based semilocal estimation of post-processing models
    Lerch, Sebastian
    Baran, Sandor
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 2017, 66 (01) : 29 - 51
  • [24] Speaker tracking by anchor models using speaker segment cluster information
    Collet, Mikael
    Charlet, Delphine
    Bimbot, Frederic
    2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 1009 - 1012
  • [25] PCA fuzzy mixture model for speaker identification
    Lee, Y
    Lee, J
    Lee, KY
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING, 2003, 2690 : 992 - 999
  • [26] GMM based on local PCA for speaker identification
    Seo, CW
    Lee, KY
    Lee, J
    ELECTRONICS LETTERS, 2001, 37 (24) : 1486 - 1488
  • [27] The ISAR Image Post-Processing for Multi-Point Target Identification
    Konovalyuk, Maxim
    Gorbunova, Anastasia
    Kuznetsov, Yury
    Baev, Andrey
    INTERNATIONAL JOURNAL OF ELECTRONICS AND TELECOMMUNICATIONS, 2011, 57 (04) : 433 - 436
  • [28] Efficient OCR Post-Processing Combining Language, Hypothesis and Error Models
    Llobet, Rafael
    Ramon Navarro-Cerdan, J.
    Perez-Cortes, Juan-Carlos
    Arlandis, Joaquim
    STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, 2010, 6218 : 728 - 737
  • [29] Perceptrons with polynomial post-processing
    Sanzogni, L
    Bonner, RF
    Chan, R
    Vaccaro, JA
    EIGHTH IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 1996, : 472 - 474
  • [30] Post-processing and tapering of PCFs
    Leon-Saval, Sergio G.
    Birks, Tim A.
    Witkowska, Agata
    Lai, Ke
    Wadsworth, William J.
    2008 IEEE/LEOS WINTER TOPICAL MEETING SERIES, 2008, : 204 - 205