Speaker recognition using convolutional siamese neural networks

被引：0

作者：

Jung H. ^{[1
]}

Yoon S. ^{[1
]}

Park N. ^{[1
]}

机构：

[1] Dept. of Computer Science and Engineering, Konkuk University

来源：

Transactions of the Korean Institute of Electrical Engineers | 2020年 / 69卷 / 01期

关键词：

Convolutional Neural Netowork(CNN); MFCC; Siamese Networks; Speaker Recognition;

D O I：

10.5370/KIEE.2020.69.1.164

中图分类号：

学科分类号：

摘要：

Recently, machine learning has been applied in a variety of fields. Speaker recognition is one of attractive applications of machine learning. In this paper, we propose a convolutional Siamese neural network for speaker recognition. The proposed model generates feature vectors through the identical two convolutional neural networks for speech data of two speakers. The similarity is measured by calculating the Euclidean distance of two output feature vectors. If the calculated similarity is less than the threshold, it is judged that two speakers are the same. The experimental result of the proposed speaker recognition based on the convolutional Siamese neural network showed its accuracy was achieved up to 96%. The accuracy of one-shot classification using the trained convolutional Siamese neural network was evaluated also. For the evaluation, the 10-way one-shot classification for 10 speakers not used for learning stages were tested, resulting in 92% accuracy. © 2020 Korean Institute of Electrical Engineers. All rights reserved.

引用

页码：164 / 169

页数：5

共 50 条

[21] Siamese Convolutional Neural Network for ASL Alphabet Recognition
Fierro Radilla, Atoany Nazareth
Perez Daniel, Karina Ruby
COMPUTACION Y SISTEMAS, 2020, 24 (03): : 1211 - 1218
[22] Convolutional neural network vectors for speaker recognition
Soufiane Hourri
Nikola S. Nikolov
Jamal Kharroubi
International Journal of Speech Technology, 2021, 24 : 389 - 400
[23] Convolutional neural network vectors for speaker recognition
Hourri, Soufiane
Nikolov, Nikola S.
Kharroubi, Jamal
INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2021, 24 (02) : 389 - 400
[24] Speaker age and gender recognition using 1D and 2D convolutional neural networks
Yucesoy, Erguen
NEURAL COMPUTING & APPLICATIONS, 2024, 36 (06): : 3065 - 3075
[25] TEXT-INDEPENDENT SPEAKER RECOGNITION USING NEURAL NETWORKS
HATTORI, H
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 1993, E76D (03) : 345 - 351
[26] Biometric Speaker Recognition Using Neural Networks and Wavelet Transform
Daghbosheh, Mohammed
Hattab, Ezz
Bisher, Ahmad
2011 INTERNATIONAL CONFERENCE ON CIVIL ENGINEERING AND INFORMATION TECHNOLOGY (CEIT 2011), 2011, : 1 - 8
[27] Speaker age and gender recognition using 1D and 2D convolutional neural networks
Ergün Yücesoy
Neural Computing and Applications, 2024, 36 : 3065 - 3075
[28] Using neural networks for automatic speaker recognition: A practical approach
Pinto, RGCP
Pinto, HLCP
Caloba, LP
38TH MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS, PROCEEDINGS, VOLS 1 AND 2, 1996, : 1078 - 1080
[29] Speaker recognition using Radial Basis Function neural networks
Deng, JP
Venkateswarlu, R
HYBRID INFORMATION SYSTEMS, 2002, : 57 - 64
[30] Speaker recognition using dynamic synapse-neural networks
George, S
Dibazar, A
Berger, TW
SECOND JOINT EMBS-BMES CONFERENCE 2002, VOLS 1-3, CONFERENCE PROCEEDINGS: BIOENGINEERING - INTEGRATIVE METHODOLOGIES, NEW TECHNOLOGIES, 2002, : 151 - 152

← 1 2 3 4 5 →