On the use of nearest feature line for speaker identification

被引:22
|
作者
Chen, K [1 ]
Wu, TY
Zhang, HJ
机构
[1] Univ Birmingham, Sch Comp Sci, Birmingham B15 2TT, W Midlands, England
[2] Peking Univ, Ctr Informat Sci, Natl Lab Machine Percept, Beijing 100871, Peoples R China
[3] Microsoft Res Asia, Sigma Ctr, Beijing 100080, Peoples R China
基金
中国国家自然科学基金;
关键词
nearest feature line; speaker identification; dynamic time warping; vector quantization; nearest neighboring measure;
D O I
10.1016/S0167-8655(02)00147-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As a new pattern classification method, nearest feature line (NFL) provides an effective way to tackle the sort of pattern recognition problems where only limited data are available for training. In this paper, we explore the use of NFL for speaker identification in terms of limited data and examine how the NFL performs in such a vexing problem of various mismatches between training and test. In order to speed up NFL in decision-making, we propose an alternative method for similarity measure. We have applied the improved NFL to speaker identification of different operating modes. Its text-dependent performance is better than the dynamic time warping (DTW) on the Ti46 corpus, while its computational load is much lower than that of DTW. Moreover, we propose an utterance partitioning strategy used in the NFL for better performance. For the text-independent mode, we employ the NFL to be a new similarity measure in vector quantization (VQ), which causes the VQ to perform better on the KING corpus. Some computational issues on the NFL are also discussed in this paper. (C) 2002 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:1735 / 1746
页数:12
相关论文
共 50 条
  • [41] Robust feature based on speech harmonic structure for speaker identification
    College of Communication and Information Engineering, Nanjing Univ. of Posts and Telecom., Nanjing 210003, China
    Dianzi Yu Xinxi Xuebao, 2006, 10 (1786-1789):
  • [42] A new feature transformation method based on rotation for speaker identification
    Kim, Min-Seok
    Yu, Ha-Jin
    19TH IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, VOL I, PROCEEDINGS, 2007, : 68 - 73
  • [43] ROBUST SPEAKER IDENTIFICATION USING AN AUDITORY-BASED FEATURE
    Li, Qi
    Huang, Yan
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4514 - 4517
  • [44] Audio-visual speaker identification with asynchronous articulatory feature
    Chen, Yanxiang
    Liu, M.
    ELECTRONICS LETTERS, 2010, 46 (03) : 242 - U77
  • [45] The performance comparison of fitting feature with segment model in speaker identification
    Yu, CG
    Yang, YC
    Wu, ZH
    2003 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS, VOLS 1-5, CONFERENCE PROCEEDINGS, 2003, : 4216 - 4221
  • [46] Whispered speaker identification based on feature and model hybrid compensation
    GU Xiaojiang ZHAO Heming L(U|¨) Gang (School of Electronics and Information
    ChineseJournalofAcoustics, 2012, 31 (04) : 499 - 508
  • [47] Improved MFCC-Based Feature for Robust Speaker Identification
    吴尊敬
    曹志刚
    Tsinghua Science and Technology, 2005, (02) : 158 - 161
  • [48] Fuzzy audio-visual feature maps for speaker identification
    Chibelushi, CC
    APPLICATIONS AND SCIENCE IN SOFT COMPUTING, 2004, : 317 - 322
  • [49] STATISTICAL FEATURE OF PITCH FREQUENCY DISTRIBUTIONS FOR OBUST SPEAKER IDENTIFICATION
    Zhang Linghua Zheng Baoyu Yang Zhen (Dept of Info. Eng.
    Journal of Electronics(China), 2005, (04) : 437 - 442
  • [50] A syntactic approach to automatic lip feature extraction for speaker identification
    Wark, T
    Sridharan, S
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 3693 - 3696