Modeling long-range dependencies in speech data for text-independent speaker recognition

被引:0
|
作者
Ming, Ji [1 ]
Lin, Jie [2 ]
机构
[1] Queens Univ Belfast, Inst ECIT, Belfast BT7 1NN, Antrim, North Ireland
[2] Univ Elect Sci & Technol China, Sch Comp Sci, Chengdu, Peoples R China
来源
2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12 | 2008年
关键词
time dependence; segment modeling; speaker modeling; speaker recognition;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In the paper, a new approach for modeling and matching long-range dependencies in free-text speech data is proposed for speaker recognition. The new approach consists of a sentence model to detail up to sentence-level dependencies in the training data, and a search algorithm that is capable of locating the matches of arbitrary-length segments between the training and testing sentences. The search algorithm is optimized to increase the probability for the match of long, continuous segments as opposed to short, separated segments, assuming that long, continuous segments contain more specific information about the speaker. The new approach has been evaluated on the NIST 1998 Speaker Recognition Evaluation database, and has shown improved performance.
引用
收藏
页码:4825 / +
页数:2
相关论文
共 50 条
  • [31] Text-independent speaker recognition using graph matching
    Hautamaki, Ville
    Kinnunen, Tomi
    Franti, Pasi
    PATTERN RECOGNITION LETTERS, 2008, 29 (09) : 1427 - 1432
  • [32] Searching through a speech memory for text-independent speaker verification
    Petrovska-Delacrétaz, D
    El Hannani, A
    Chollet, G
    AUDIO-BASED AND VIDEO-BASED BIOMETRIC PERSON AUTHENTICATION, PROCEEDINGS, 2003, 2688 : 95 - 103
  • [33] Text-independent speaker verification using covariance modeling
    Zilca, RD
    IEEE SIGNAL PROCESSING LETTERS, 2001, 8 (04) : 97 - 99
  • [34] FACTORED COVARIANCE MODELING FOR TEXT-INDEPENDENT SPEAKER VERIFICATION
    Wang, Eryu
    Lee, Kong Aik
    Ma, Bin
    Li, Haizhou
    Guo, Wu
    Dai, Lirong
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4856 - 4859
  • [35] Text-independent speaker identification using fenonic speaker Markov modeling
    Birnbaum, M
    Brown, KL
    Bardenhagen, S
    1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 677 - 680
  • [36] AN ACOUSTIC SEGMENT MODEL APPROACH TO INCORPORATING TEMPORAL INFORMATION INTO SPEAKER MODELING FOR TEXT-INDEPENDENT SPEAKER RECOGNITION
    Tsao, Yu
    Sun, Hanwu
    Li, Haizhou
    Lee, Chin-Hui
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4422 - 4425
  • [37] Text-independent speaker identification
    Gish, Herbert
    Schmidt, Michael
    IEEE SIGNAL PROCESSING MAGAZINE, 1994, 11 (04) : 18 - 32
  • [38] An Euclidean distance measure between covariance matrices of speech cepstra for text-independent speaker recognition
    Brummer, JNL
    Strydom, LR
    COMSIG '97 - PROCEEDINGS OF THE 1997 SOUTH AFRICAN SYMPOSIUM ON COMMUNICATIONS AND SIGNAL PROCESSING, 1997, : 167 - 172
  • [39] Multigrained modeling with pattern specific maximum likelihood transformations for text-independent speaker recognition
    Chaudhari, UV
    Navrátil, J
    Maes, SH
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2003, 11 (01): : 61 - 69
  • [40] Cepstral Trajectories in Linguistic Units for Text-Independent Speaker Recognition
    Franco-Pedroso, Javier
    Espinoza-Cuadros, Fernando
    Gonzalez-Rodriguez, Joaquin
    ADVANCES IN SPEECH AND LANGUAGE TECHNOLOGIES FOR IBERIAN LANGUAGES, 2012, 328 : 20 - 29