Modeling long-range dependencies in speech data for text-independent speaker recognition

被引：0

作者：

Ming, Ji ^{[1
]}

Lin, Jie ^{[2
]}

机构：

[1] Queens Univ Belfast, Inst ECIT, Belfast BT7 1NN, Antrim, North Ireland

[2] Univ Elect Sci & Technol China, Sch Comp Sci, Chengdu, Peoples R China

来源：

2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12 | 2008年

关键词：

time dependence; segment modeling; speaker modeling; speaker recognition;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In the paper, a new approach for modeling and matching long-range dependencies in free-text speech data is proposed for speaker recognition. The new approach consists of a sentence model to detail up to sentence-level dependencies in the training data, and a search algorithm that is capable of locating the matches of arbitrary-length segments between the training and testing sentences. The search algorithm is optimized to increase the probability for the match of long, continuous segments as opposed to short, separated segments, assuming that long, continuous segments contain more specific information about the speaker. The new approach has been evaluated on the NIST 1998 Speaker Recognition Evaluation database, and has shown improved performance.

引用

页码：4825 / +

页数：2

共 50 条

[1] TEXT-INDEPENDENT SPEAKER RECOGNITION
ATAL, BS
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1972, 52 (01): : 181 - &
[2] A novel speech feature fusion algorithm for text-independent speaker recognition
Ma, Biao
Xu, Chengben
Zhang, Ye
MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (24) : 64139 - 64156
[3] Data-Model Relationship in Text-Independent Speaker Recognition
John S. D. Mason
Nicholas W. D. Evans
Robert Stapert
Roland Auckenthaler
EURASIP Journal on Advances in Signal Processing, 2005
[4] Data-model relationship in text-independent speaker recognition
Mason, JSD
Evans, NWD
Stapert, R
Auckenthaler, R
EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2005, 2005 (04) : 471 - 481
[5] Text-dependent and text-independent speaker recognition of reverberant speech based on CNN
El-Moneim, Samia Abd
Sedik, Ahmed
Nassar, M. A.
El-Fishawy, Adel S.
Sharshar, A. M.
Hassan, Shaimaa E. A.
Mahmoud, Adel Zaghloul
Dessouky, Moawd I.
El-Banby, Ghada M.
El-Samie, Fathi E. Abd
El-Rabaie, El-Sayed M.
Neyazi, Badawi
Seddeq, H. S.
Ismail, Nabil A.
Khalaf, Ashraf A. M.
Elabyad, G. S. M.
INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2021, 24 (04) : 993 - 1006
[6] Investigation of the effect of data duration and speaker gender on text-independent speaker recognition
Hanilci, Cemal
Ertas, Figen
COMPUTERS & ELECTRICAL ENGINEERING, 2013, 39 (02) : 441 - 452
[7] Effect of Spoken Text on Text-independent Speaker Recognition
Alsulaiman, Mansour
PROCEEDINGS FIFTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS, MODELLING AND SIMULATION, 2014, : 279 - 284
[8] Text-dependent and text-independent speaker recognition of reverberant speech based on CNN
Samia Abd El-Moneim
Ahmed Sedik
M. A. Nassar
Adel S. El-Fishawy
A. M. Sharshar
Shaimaa E. A. Hassan
Adel Zaghloul Mahmoud
Moawd I. Dessouky
Ghada M. El-Banby
Fathi E. Abd El-Samie
El-Sayed M. El-Rabaie
Badawi Neyazi
H. S. Seddeq
Nabil A. Ismail
Ashraf A. M. Khalaf
G. S. M. Elabyad
International Journal of Speech Technology, 2021, 24 : 993 - 1006
[9] A new approach for text-independent speaker recognition
Lung, SY
Chen, CCT
PATTERN RECOGNITION, 2000, 33 (08) : 1401 - 1403
[10] Improving Text-independent Speaker Recognition with GMM
Chakroun, Rania
Zouari, Leila Beltaifa
Frikha, Mondher
Ben Hamida, Ahmed
2016 2ND INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR SIGNAL AND IMAGE PROCESSING (ATSIP), 2016, : 693 - 696

← 1 2 3 4 5 →