Improved i-vector extraction technique for speaker verification with short utterances

被引:2
|
作者
Poddar A. [1 ]
Sahidullah M. [2 ]
Saha G. [1 ]
机构
[1] Department of Electronics and Electrical Communication Engineering, Indian Institute of Technology, Kharagpur
[2] Speech and Image Processing Unit, School of Computing, University of Eastern Finland, Joensuu
关键词
Baum–Welch statistics; Duration variability; GMM-UBM; i-Vector; Short utterance; Speaker recognition;
D O I
10.1007/s10772-017-9477-2
中图分类号
学科分类号
摘要
A major challenge in ASV is to improve performance with short speech segments for end-user convenience in real-world applications. In this paper, we present a detailed analysis of ASV systems to observe the duration variability effects on state-of-the-art i-vector and classical Gaussian mixture model-universal background model (GMM-UBM) based ASV systems. We observe an increase in uncertainty of model parameter estimation for i-vector based ASV with speech of shorter duration. In order to compensate the effect of duration variability in short utterances, we have proposed adaptation technique for Baum-Welch statistics estimation used to i-vector extraction. Information from pre-estimated background model parameters are used for adaptation method. The ASV performance with the proposed approach is considerably superior to the conventional i-vector based system. Furthermore, the fusion of proposed i-vector based system and GMM-UBM further improves the ASV performance, especially for short speech segments. Experiments conducted on two speech corpora, NIST SRE 2008 and 2010, have shown relative improvement in equal error rate (EER) in the range of 12–20%. © 2017, Springer Science+Business Media, LLC, part of Springer Nature.
引用
收藏
页码:473 / 488
页数:15
相关论文
共 50 条
  • [21] i-Vector with sparse representation classification for speaker verification
    Kua, Jia Min Karen
    Epps, Julien
    Ambikairajah, Eliathamby
    SPEECH COMMUNICATION, 2013, 55 (05) : 707 - 720
  • [22] FAST DISCRIMINATIVE SPEAKER VERIFICATION IN THE I-VECTOR SPACE
    Cumani, Sandro
    Bruemmer, Niko
    Burget, Lukas
    Laface, Pietro
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4852 - 4855
  • [23] Discriminatively Trained i-vector Extractor for Speaker Verification
    Glembek, Ondrej
    Burget, Lukas
    Bruemmer, Niko
    Plchot, Oldrich
    Matejka, Pavel
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 144 - +
  • [24] A Novel Boosting Algorithm for Improved i-Vector based Speaker Verification in Noisy Environments
    Sarkar, Sourjya
    Rao, K. Sreenivasa
    15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 671 - 675
  • [25] Study of the Effect of I-vector Modeling on Short and Mismatch Utterance Duration for Speaker Verification
    Sarkar, A. K.
    Matrouf, D.
    Bousquet, P. M.
    Bonastre, J. F.
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 2661 - 2664
  • [26] Sparsity Analysis and Compensation for i-Vector Based Speaker Verification
    Li, Wei
    Fu, Tian Fan
    Zhu, Jie
    Chen, Ning
    SPEECH AND COMPUTER (SPECOM 2015), 2015, 9319 : 381 - 388
  • [27] Feature sparsity analysis for i-vector based speaker verification
    Li, Wei
    Fu, Tianfan
    You, Hanxu
    Zhu, Jie
    Chen, Ning
    SPEECH COMMUNICATION, 2016, 80 : 60 - 70
  • [28] Cosine Metric Learning for Speaker Verification in the i-Vector Space
    Bai, Zhong
    Zhang, Xiao-Lei
    Chen, Jingdong
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1126 - 1130
  • [29] Geometric Discriminant Analysis for I-vector Based Speaker Verification
    Xu, Can
    Chen, Xianhong
    He, Liang
    Liu, Jia
    2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 1636 - 1640
  • [30] Bayesian Principal Component Analysis for I-Vector Speaker Verification
    Rong Y.-F.
    Chen C.
    Chen D.-Y.
    He Y.-J.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2021, 49 (11): : 2186 - 2194