Speaker recognition using convolutional siamese neural networks

被引：0

作者：

Jung H. ^{[1
]}

Yoon S. ^{[1
]}

Park N. ^{[1
]}

机构：

[1] Dept. of Computer Science and Engineering, Konkuk University

来源：

Transactions of the Korean Institute of Electrical Engineers | 2020年 / 69卷 / 01期

关键词：

Convolutional Neural Netowork(CNN); MFCC; Siamese Networks; Speaker Recognition;

D O I：

10.5370/KIEE.2020.69.1.164

中图分类号：

学科分类号：

摘要：

Recently, machine learning has been applied in a variety of fields. Speaker recognition is one of attractive applications of machine learning. In this paper, we propose a convolutional Siamese neural network for speaker recognition. The proposed model generates feature vectors through the identical two convolutional neural networks for speech data of two speakers. The similarity is measured by calculating the Euclidean distance of two output feature vectors. If the calculated similarity is less than the threshold, it is judged that two speakers are the same. The experimental result of the proposed speaker recognition based on the convolutional Siamese neural network showed its accuracy was achieved up to 96%. The accuracy of one-shot classification using the trained convolutional Siamese neural network was evaluated also. For the evaluation, the 10-way one-shot classification for 10 speakers not used for learning stages were tested, resulting in 92% accuracy. © 2020 Korean Institute of Electrical Engineers. All rights reserved.

引用

页码：164 / 169

页数：5

共 50 条

[11] A deep learning approach to integrate convolutional neural networks in speaker recognition
Soufiane Hourri
Nikola S. Nikolov
Jamal Kharroubi
International Journal of Speech Technology, 2020, 23 : 615 - 623
[12] Speaker Diarization Using Deep Recurrent Convolutional Neural Networks for Speaker Embeddings
Cyrta, Pawel
Trzcinski, Tomasz
Stokowiec, Wojciech
INFORMATION SYSTEMS ARCHITECTURE AND TECHNOLOGY, PT I, 2018, 655 : 107 - 117
[13] Speaker Recognition Using Neural Networks and Conventional Classifiers
Farrell, Kevin R.
Mammone, Richard J.
Assaleh, Khaled T.
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (01): : 194 - 205
[14] AN APPLICATION OF SPEAKER RECOGNITION USING ARTIFICIAL NEURAL NETWORKS
Caner, Murat
Ustun, Seydi Vakkas
PAMUKKALE UNIVERSITY JOURNAL OF ENGINEERING SCIENCES-PAMUKKALE UNIVERSITESI MUHENDISLIK BILIMLERI DERGISI, 2006, 12 (02): : 279 - 284
[15] Speaker recognition using pulse coupled neural networks
Timoszczuk, Antonio Pedro
Cabral, Euvaldo F., Jr.
2007 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-6, 2007, : 1965 - +
[16] Personality Recognition Using Convolutional Neural Networks
Gimenez, Maite
Paredes, Roberto
Rosso, Paolo
COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, CICLING 2017, PT II, 2018, 10762 : 313 - 323
[17] Locally-Connected and Convolutional Neural Networks for Small Footprint Speaker Recognition
Chen, Yu-hsin
Lopez-Moreno, Ignacio
Sainath, Tara N.
Visontai, Mirko
Alvarez, Raziel
Parada, Carolina
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1136 - 1140
[18] Recognition of flowers using convolutional neural networks
Alkhonin, Abdulrahman
Almutairi, Abdulelah
Alburaidi, Abdulmajeed
Saudagar, Abdul Khader Jilani
INTERNATIONAL JOURNAL OF INTELLIGENT ENGINEERING INFORMATICS, 2020, 8 (03) : 186 - 197
[19] Research on Inception Module Incorporated Siamese Convolutional Neural Networks to Realize Face Recognition
Xu, Xian-Feng
Zhang, Li
Duan, Chen-Dong
Lu, Yong
IEEE ACCESS, 2020, 8 : 12168 - 12178
[20] Research on Inception Module Incorporated Siamese Convolutional Neural Networks to Realize Face Recognition
Xu X.-F.
Zhang L.
Lang B.
Xia Z.
1600, Chinese Institute of Electronics (48): : 643 - 647

← 1 2 3 4 5 →