On the use of nearest feature line for speaker identification

被引:22
|
作者
Chen, K [1 ]
Wu, TY
Zhang, HJ
机构
[1] Univ Birmingham, Sch Comp Sci, Birmingham B15 2TT, W Midlands, England
[2] Peking Univ, Ctr Informat Sci, Natl Lab Machine Percept, Beijing 100871, Peoples R China
[3] Microsoft Res Asia, Sigma Ctr, Beijing 100080, Peoples R China
基金
中国国家自然科学基金;
关键词
nearest feature line; speaker identification; dynamic time warping; vector quantization; nearest neighboring measure;
D O I
10.1016/S0167-8655(02)00147-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As a new pattern classification method, nearest feature line (NFL) provides an effective way to tackle the sort of pattern recognition problems where only limited data are available for training. In this paper, we explore the use of NFL for speaker identification in terms of limited data and examine how the NFL performs in such a vexing problem of various mismatches between training and test. In order to speed up NFL in decision-making, we propose an alternative method for similarity measure. We have applied the improved NFL to speaker identification of different operating modes. Its text-dependent performance is better than the dynamic time warping (DTW) on the Ti46 corpus, while its computational load is much lower than that of DTW. Moreover, we propose an utterance partitioning strategy used in the NFL for better performance. For the text-independent mode, we employ the NFL to be a new similarity measure in vector quantization (VQ), which causes the VQ to perform better on the KING corpus. Some computational issues on the NFL are also discussed in this paper. (C) 2002 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:1735 / 1746
页数:12
相关论文
共 50 条
  • [21] Avoiding the interpolation inaccuracy in nearest feature line classifier by spectral feature analysis
    Altincay, Hakan
    Erenel, Zafer
    PATTERN RECOGNITION LETTERS, 2013, 34 (12) : 1372 - 1380
  • [22] Nearest feature line classifier based on collaborative representation with nearest neighbour search algorithm
    Zhuang, Zhongjie
    Pan, Jeng-Shyang
    Chu, Shu-Chuan
    Luo, Hao
    ELECTRONICS LETTERS, 2021, 57 (01) : 20 - 22
  • [23] Nearest Feature Line Discriminant Analysis in DFRCT Domain for Image Feature Extraction
    Yan, Lijun
    Wang, Cong
    Pan, Jeng-Shyang
    COMPUTATIONAL COLLECTIVE INTELLIGENCE - TECHNOLOGIES AND APPLICATIONS, PT II, 2012, 7654 : 264 - 272
  • [24] MISSING FEATURE RECONSTRUCTION METHODS FOR ROBUST SPEAKER IDENTIFICATION
    Zhang, Xueliang
    Zhang, Hui
    Gao, Guanglai
    2014 PROCEEDINGS OF THE 22ND EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2014, : 1482 - 1486
  • [25] Audio-Visual Feature Fusion for Speaker Identification
    Almaadeed, Noor
    Aggoun, Amar
    Amira, Abbes
    NEURAL INFORMATION PROCESSING, ICONIP 2012, PT I, 2012, 7663 : 56 - 67
  • [26] PHYSIOLOGICALLY-MOTIVATED FEATURE EXTRACTION FOR SPEAKER IDENTIFICATION
    Wang, Jianglin
    Johnson, Michael T.
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [27] Incorporating auditory feature uncertainties in robust speaker identification
    Shao, Yang
    Srinivasan, Soundararajan
    Wang, DeLiang
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 277 - +
  • [28] Acoustic feature extraction method for robust speaker identification
    Zuoqiang Li
    Yong Gao
    Multimedia Tools and Applications, 2016, 75 : 7391 - 7406
  • [29] A Feature Level Fusion Scheme for Robust Speaker Identification
    Sekkate, Sara
    Khalil, Mohammed
    Adib, Abdellah
    BIG DATA, CLOUD AND APPLICATIONS, BDCA 2018, 2018, 872 : 289 - 300
  • [30] Acoustic feature extraction method for robust speaker identification
    Li, Zuoqiang
    Gao, Yong
    MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (12) : 7391 - 7406