Modeling long-range dependencies in speech data for text-independent speaker recognition

被引:0
|
作者
Ming, Ji [1 ]
Lin, Jie [2 ]
机构
[1] Queens Univ Belfast, Inst ECIT, Belfast BT7 1NN, Antrim, North Ireland
[2] Univ Elect Sci & Technol China, Sch Comp Sci, Chengdu, Peoples R China
关键词
time dependence; segment modeling; speaker modeling; speaker recognition;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In the paper, a new approach for modeling and matching long-range dependencies in free-text speech data is proposed for speaker recognition. The new approach consists of a sentence model to detail up to sentence-level dependencies in the training data, and a search algorithm that is capable of locating the matches of arbitrary-length segments between the training and testing sentences. The search algorithm is optimized to increase the probability for the match of long, continuous segments as opposed to short, separated segments, assuming that long, continuous segments contain more specific information about the speaker. The new approach has been evaluated on the NIST 1998 Speaker Recognition Evaluation database, and has shown improved performance.
引用
收藏
页码:4825 / +
页数:2
相关论文
共 50 条
  • [1] TEXT-INDEPENDENT SPEAKER RECOGNITION
    ATAL, BS
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1972, 52 (01): : 181 - &
  • [2] A novel speech feature fusion algorithm for text-independent speaker recognition
    Ma, Biao
    Xu, Chengben
    Zhang, Ye
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (24) : 64139 - 64156
  • [3] Data-Model Relationship in Text-Independent Speaker Recognition
    John S. D. Mason
    Nicholas W. D. Evans
    Robert Stapert
    Roland Auckenthaler
    EURASIP Journal on Advances in Signal Processing, 2005
  • [4] Data-model relationship in text-independent speaker recognition
    Mason, JSD
    Evans, NWD
    Stapert, R
    Auckenthaler, R
    EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2005, 2005 (04) : 471 - 481
  • [5] Text-dependent and text-independent speaker recognition of reverberant speech based on CNN
    El-Moneim, Samia Abd
    Sedik, Ahmed
    Nassar, M. A.
    El-Fishawy, Adel S.
    Sharshar, A. M.
    Hassan, Shaimaa E. A.
    Mahmoud, Adel Zaghloul
    Dessouky, Moawd I.
    El-Banby, Ghada M.
    El-Samie, Fathi E. Abd
    El-Rabaie, El-Sayed M.
    Neyazi, Badawi
    Seddeq, H. S.
    Ismail, Nabil A.
    Khalaf, Ashraf A. M.
    Elabyad, G. S. M.
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2021, 24 (04) : 993 - 1006
  • [6] Investigation of the effect of data duration and speaker gender on text-independent speaker recognition
    Hanilci, Cemal
    Ertas, Figen
    COMPUTERS & ELECTRICAL ENGINEERING, 2013, 39 (02) : 441 - 452
  • [7] Effect of Spoken Text on Text-independent Speaker Recognition
    Alsulaiman, Mansour
    PROCEEDINGS FIFTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS, MODELLING AND SIMULATION, 2014, : 279 - 284
  • [8] Text-dependent and text-independent speaker recognition of reverberant speech based on CNN
    Samia Abd El-Moneim
    Ahmed Sedik
    M. A. Nassar
    Adel S. El-Fishawy
    A. M. Sharshar
    Shaimaa E. A. Hassan
    Adel Zaghloul Mahmoud
    Moawd I. Dessouky
    Ghada M. El-Banby
    Fathi E. Abd El-Samie
    El-Sayed M. El-Rabaie
    Badawi Neyazi
    H. S. Seddeq
    Nabil A. Ismail
    Ashraf A. M. Khalaf
    G. S. M. Elabyad
    International Journal of Speech Technology, 2021, 24 : 993 - 1006
  • [9] A new approach for text-independent speaker recognition
    Lung, SY
    Chen, CCT
    PATTERN RECOGNITION, 2000, 33 (08) : 1401 - 1403
  • [10] Improving Text-independent Speaker Recognition with GMM
    Chakroun, Rania
    Zouari, Leila Beltaifa
    Frikha, Mondher
    Ben Hamida, Ahmed
    2016 2ND INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR SIGNAL AND IMAGE PROCESSING (ATSIP), 2016, : 693 - 696