Speaker identification by anchor models with PCA/LDA post-processing

被引：0

作者：

Mami, Y ^{[1
]}

Charlet, D ^{[1
]}

机构：

[1] France Telecom, R&D, DIH IPS, F-22307 Lannion, France

来源：

2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I | 2003年

关键词：

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Speaker representation by location is a new technique of speaker recognition and adaptation. It consists in representing a new speaker not in an absolute manner, but relatively to a set of well trained speaker models. Each new speaker is represented by its location in an optimal representation space. This paper addresses the location task. It describes a representation space built either by clustering speakers or by selecting an optimal subset of them. In this representation space, speaker location is then performed by the anchor models technique to find vector of coordinates. An orthogonalization process is then applied to the vector of coordinates, so as to compute the distance properly. This orthogonalization process (PCA or LDA) proves experimentally to improve significantly the recognition.

引用

页码：180 / 183

页数：4

共 50 条

[1] Post-processing techniques for a speaker diarization system
Tavarez, David
Navas, Eva
Erro, Daniel
Saratxaga, Ibon
Hernaez, Inma
PROCESAMIENTO DEL LENGUAJE NATURAL, 2012, (49): : 109 - 115
[2] Regularization of LDA for face recognition: A post-processing approach
Zuo, WM
Wang, KQ
Zhang, D
Yang, J
ANALYSIS AND MODELLING OF FACES AND GESTURES, PROCEEDINGS, 2005, 3723 : 377 - 391
[3] END-TO-END SPEAKER DIARIZATION AS POST-PROCESSING
Horiguchi, Shota
Garcia, Paola
Fujita, Yusuke
Watanabe, Shinji
Nagamatsu, Kenji
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 7188 - 7192
[4] PURE SEGMENT SELECTION AS SPEAKER DIARIZATION POST-PROCESSING
Ben-Harush, Oshry
Guterman, Hugo
Lapidot, Itshak
2008 IEEE 25TH CONVENTION OF ELECTRICAL AND ELECTRONICS ENGINEERS IN ISRAEL, VOLS 1 AND 2, 2008, : 461 - +
[5] Post-processing on LDA's discriminant vectors for facial feature extraction
Wang, KQ
Zuo, WM
Zhang, D
AUDIO AND VIDEO BASED BIOMETRIC PERSON AUTHENTICATION, PROCEEDINGS, 2005, 3546 : 346 - 354
[6] Robust text-independent speaker identification using hybrid PCA&LDA
Kim, Min-Seok
Yu, Ha-Jin
Kwak, Keun-Chang
Chi, Su-Young
MICAI 2006: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, 4293 : 1067 - +
[7] Improvement of a speaker authentication system through MLP's post-processing
Rodríguez-Liñares, L
García-Mateo, C
Alba-Castro, JL
NEURAL NETWORKS FOR SIGNAL PROCESSING XI, 2001, : 461 - 470
[8] PCA/LDA Approach for Text-Independent Speaker Recognition
Ge, Zhenhao
Sharma, Sudhendu R.
Smith, Mark J. T.
INDEPENDENT COMPONENT ANALYSES, COMPRESSIVE SAMPLING, WAVELETS, NEURAL NET, BIOSYSTEMS, AND NANOENGINEERING X, 2012, 8401
[9] Post-processing options and their effects on target identification performance
Sylvester, VB
Cohen, MN
SIGNAL PROCESSING, SENSOR FUSION, AND TARGET RECOGNITION VI, 1997, 3068 : 522 - 531
[10] On correlator pre- and post-processing for object identification
Coffield, PC
OPTICAL PATTERN RECOGNITION XI, 2000, 4043 : 317 - 328

← 1 2 3 4 5 →