Modeling long-range dependencies in speech data for text-independent speaker recognition

被引：0

作者：

Ming, Ji ^{[1
]}

Lin, Jie ^{[2
]}

机构：

[1] Queens Univ Belfast, Inst ECIT, Belfast BT7 1NN, Antrim, North Ireland

[2] Univ Elect Sci & Technol China, Sch Comp Sci, Chengdu, Peoples R China

来源：

2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12 | 2008年

关键词：

time dependence; segment modeling; speaker modeling; speaker recognition;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In the paper, a new approach for modeling and matching long-range dependencies in free-text speech data is proposed for speaker recognition. The new approach consists of a sentence model to detail up to sentence-level dependencies in the training data, and a search algorithm that is capable of locating the matches of arbitrary-length segments between the training and testing sentences. The search algorithm is optimized to increase the probability for the match of long, continuous segments as opposed to short, separated segments, assuming that long, continuous segments contain more specific information about the speaker. The new approach has been evaluated on the NIST 1998 Speaker Recognition Evaluation database, and has shown improved performance.

引用

页码：4825 / +

页数：2

共 50 条

[21] TEXT-INDEPENDENT SPEAKER RECOGNITION USING NEURAL NETWORKS
HATTORI, H
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 1993, E76D (03) : 345 - 351
[22] PCA/LDA Approach for Text-Independent Speaker Recognition
Ge, Zhenhao
Sharma, Sudhendu R.
Smith, Mark J. T.
INDEPENDENT COMPONENT ANALYSES, COMPRESSIVE SAMPLING, WAVELETS, NEURAL NET, BIOSYSTEMS, AND NANOENGINEERING X, 2012, 8401
[23] Text-independent Hakka Speaker Recognition in Noisy Environments
Peng, Jie
Chen, Chin-Ta
Yang, Cheng-Fu
SENSORS AND MATERIALS, 2025, 37 (01) : 441 - 451
[24] A study of variational method for text-independent speaker recognition
He, Liang
Tian, Yao
Liu, Yi
Dong, Fang
Zhang, WeiQiang
Liu, Jia
2016 10TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2016,
[25] Attention and DCT Based Global Context Modeling for Text-Independent Speaker Recognition
Xia, Wei
Hansen, John H. L.
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 2668 - 2679
[26] An investigation of dependencies between frequency components and speaker characteristics for text-independent speaker identification
Lu, Xugang
Dang, Jianwu
SPEECH COMMUNICATION, 2008, 50 (04) : 312 - 322
[27] Compensation for domain mismatch in text-independent speaker recognition
Bahmaninezhad, Fahimeh
Hansen, John H. L.
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1071 - 1075
[28] Exploring discriminative learning for text-independent speaker recognition
Liu, Ming
Zhang, Zhengyou
Hasegawa-Johnson, Mark
Huang, Thomas S.
2007 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-5, 2007, : 56 - 59
[29] A discriminative training approach for text-independent speaker recognition
Hong, QY
Kwong, S
SIGNAL PROCESSING, 2005, 85 (07) : 1449 - 1463
[30] I-MATRIX FOR TEXT-INDEPENDENT SPEAKER RECOGNITION
He, Liang
Liu, Jia
2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7194 - 7198

← 1 2 3 4 5 →