Modeling long-range dependencies in speech data for text-independent speaker recognition

被引:0
|
作者
Ming, Ji [1 ]
Lin, Jie [2 ]
机构
[1] Queens Univ Belfast, Inst ECIT, Belfast BT7 1NN, Antrim, North Ireland
[2] Univ Elect Sci & Technol China, Sch Comp Sci, Chengdu, Peoples R China
来源
2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12 | 2008年
关键词
time dependence; segment modeling; speaker modeling; speaker recognition;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In the paper, a new approach for modeling and matching long-range dependencies in free-text speech data is proposed for speaker recognition. The new approach consists of a sentence model to detail up to sentence-level dependencies in the training data, and a search algorithm that is capable of locating the matches of arbitrary-length segments between the training and testing sentences. The search algorithm is optimized to increase the probability for the match of long, continuous segments as opposed to short, separated segments, assuming that long, continuous segments contain more specific information about the speaker. The new approach has been evaluated on the NIST 1998 Speaker Recognition Evaluation database, and has shown improved performance.
引用
收藏
页码:4825 / +
页数:2
相关论文
共 50 条
  • [22] PCA/LDA Approach for Text-Independent Speaker Recognition
    Ge, Zhenhao
    Sharma, Sudhendu R.
    Smith, Mark J. T.
    INDEPENDENT COMPONENT ANALYSES, COMPRESSIVE SAMPLING, WAVELETS, NEURAL NET, BIOSYSTEMS, AND NANOENGINEERING X, 2012, 8401
  • [23] Text-independent Hakka Speaker Recognition in Noisy Environments
    Peng, Jie
    Chen, Chin-Ta
    Yang, Cheng-Fu
    SENSORS AND MATERIALS, 2025, 37 (01) : 441 - 451
  • [24] A study of variational method for text-independent speaker recognition
    He, Liang
    Tian, Yao
    Liu, Yi
    Dong, Fang
    Zhang, WeiQiang
    Liu, Jia
    2016 10TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2016,
  • [25] Attention and DCT Based Global Context Modeling for Text-Independent Speaker Recognition
    Xia, Wei
    Hansen, John H. L.
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 2668 - 2679
  • [26] An investigation of dependencies between frequency components and speaker characteristics for text-independent speaker identification
    Lu, Xugang
    Dang, Jianwu
    SPEECH COMMUNICATION, 2008, 50 (04) : 312 - 322
  • [27] Compensation for domain mismatch in text-independent speaker recognition
    Bahmaninezhad, Fahimeh
    Hansen, John H. L.
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1071 - 1075
  • [28] Exploring discriminative learning for text-independent speaker recognition
    Liu, Ming
    Zhang, Zhengyou
    Hasegawa-Johnson, Mark
    Huang, Thomas S.
    2007 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-5, 2007, : 56 - 59
  • [29] A discriminative training approach for text-independent speaker recognition
    Hong, QY
    Kwong, S
    SIGNAL PROCESSING, 2005, 85 (07) : 1449 - 1463
  • [30] I-MATRIX FOR TEXT-INDEPENDENT SPEAKER RECOGNITION
    He, Liang
    Liu, Jia
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7194 - 7198