Modeling long-range dependencies in speech data for text-independent speaker recognition

被引:0
|
作者
Ming, Ji [1 ]
Lin, Jie [2 ]
机构
[1] Queens Univ Belfast, Inst ECIT, Belfast BT7 1NN, Antrim, North Ireland
[2] Univ Elect Sci & Technol China, Sch Comp Sci, Chengdu, Peoples R China
来源
2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12 | 2008年
关键词
time dependence; segment modeling; speaker modeling; speaker recognition;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In the paper, a new approach for modeling and matching long-range dependencies in free-text speech data is proposed for speaker recognition. The new approach consists of a sentence model to detail up to sentence-level dependencies in the training data, and a search algorithm that is capable of locating the matches of arbitrary-length segments between the training and testing sentences. The search algorithm is optimized to increase the probability for the match of long, continuous segments as opposed to short, separated segments, assuming that long, continuous segments contain more specific information about the speaker. The new approach has been evaluated on the NIST 1998 Speaker Recognition Evaluation database, and has shown improved performance.
引用
收藏
页码:4825 / +
页数:2
相关论文
共 50 条
  • [41] Performance of Text-Independent Automatic Speaker Recognition on a Multicore System
    Kouatly, Rand
    Khan, Talha Ali
    TSINGHUA SCIENCE AND TECHNOLOGY, 2024, 29 (02): : 447 - 456
  • [42] Robust features for text-independent speaker recognition with short utterances
    Rania Chakroun
    Mondher Frikha
    Neural Computing and Applications, 2020, 32 : 13863 - 13883
  • [43] Text-independent speaker recognition using support vector machine
    Hou, FL
    Wang, BX
    2001 INTERNATIONAL CONFERENCES ON INFO-TECH AND INFO-NET PROCEEDINGS, CONFERENCE A-G: INFO-TECH & INFO-NET: A KEY TO BETTER LIFE, 2001, : C402 - C407
  • [44] A Chain of Gaussian Mixture Model for Text-independent Speaker Recognition
    Chen, Yanxiang
    Liu, Ming
    ORIENTAL COCOSDA 2009 - INTERNATIONAL CONFERENCE ON SPEECH DATABASE AND ASSESSMENTS, 2009, : 100 - +
  • [45] Adaptive fuzzy wavelet algorithm for text-independent speaker recognition
    Lung, SY
    PATTERN RECOGNITION, 2004, 37 (10) : 2095 - 2096
  • [46] A TEXT-INDEPENDENT SPEAKER RECOGNITION SYSTEM BASED ON VOWEL SPOTTING
    FAKOTAKIS, N
    TSOPANOGLOU, A
    KOKKINAKIS, G
    SPEECH COMMUNICATION, 1993, 12 (01) : 57 - 68
  • [47] TLS-NAP algorithm for text-independent speaker recognition
    He, Liang
    Yang, Yi
    Liu, Jia
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2012, 25 (06): : 916 - 921
  • [48] A Longest Matching Segment Approach for Text-Independent Speaker Recognition
    Jafari, Ayeh
    Srinivasan, Ramji
    Crookes, Danny
    Ming, Ji
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 1469 - 1472
  • [49] Spin-Image Descriptors for Text-Independent Speaker Recognition
    Mohammed, Suhaila N.
    Jabir, Adnan J.
    Abbas, Zaid Ali
    EMERGING TRENDS IN INTELLIGENT COMPUTING AND INFORMATICS: DATA SCIENCE, INTELLIGENT INFORMATION SYSTEMS AND SMART COMPUTING, 2020, 1073 : 216 - 226
  • [50] FREQUENCY AND TEMPORAL CONVOLUTIONAL ATTENTION FOR TEXT-INDEPENDENT SPEAKER RECOGNITION
    Yadav, Sarthak
    Rai, Atul
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6794 - 6798