Speaker recognition using convolutional siamese neural networks

被引:0
|
作者
Jung H. [1 ]
Yoon S. [1 ]
Park N. [1 ]
机构
[1] Dept. of Computer Science and Engineering, Konkuk University
关键词
Convolutional Neural Netowork(CNN); MFCC; Siamese Networks; Speaker Recognition;
D O I
10.5370/KIEE.2020.69.1.164
中图分类号
学科分类号
摘要
Recently, machine learning has been applied in a variety of fields. Speaker recognition is one of attractive applications of machine learning. In this paper, we propose a convolutional Siamese neural network for speaker recognition. The proposed model generates feature vectors through the identical two convolutional neural networks for speech data of two speakers. The similarity is measured by calculating the Euclidean distance of two output feature vectors. If the calculated similarity is less than the threshold, it is judged that two speakers are the same. The experimental result of the proposed speaker recognition based on the convolutional Siamese neural network showed its accuracy was achieved up to 96%. The accuracy of one-shot classification using the trained convolutional Siamese neural network was evaluated also. For the evaluation, the 10-way one-shot classification for 10 speakers not used for learning stages were tested, resulting in 92% accuracy. © 2020 Korean Institute of Electrical Engineers. All rights reserved.
引用
收藏
页码:164 / 169
页数:5
相关论文
共 50 条
  • [21] Siamese Convolutional Neural Network for ASL Alphabet Recognition
    Fierro Radilla, Atoany Nazareth
    Perez Daniel, Karina Ruby
    COMPUTACION Y SISTEMAS, 2020, 24 (03): : 1211 - 1218
  • [22] Convolutional neural network vectors for speaker recognition
    Soufiane Hourri
    Nikola S. Nikolov
    Jamal Kharroubi
    International Journal of Speech Technology, 2021, 24 : 389 - 400
  • [23] Convolutional neural network vectors for speaker recognition
    Hourri, Soufiane
    Nikolov, Nikola S.
    Kharroubi, Jamal
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2021, 24 (02) : 389 - 400
  • [24] Speaker age and gender recognition using 1D and 2D convolutional neural networks
    Yucesoy, Erguen
    NEURAL COMPUTING & APPLICATIONS, 2024, 36 (06): : 3065 - 3075
  • [26] Biometric Speaker Recognition Using Neural Networks and Wavelet Transform
    Daghbosheh, Mohammed
    Hattab, Ezz
    Bisher, Ahmad
    2011 INTERNATIONAL CONFERENCE ON CIVIL ENGINEERING AND INFORMATION TECHNOLOGY (CEIT 2011), 2011, : 1 - 8
  • [27] Speaker age and gender recognition using 1D and 2D convolutional neural networks
    Ergün Yücesoy
    Neural Computing and Applications, 2024, 36 : 3065 - 3075
  • [28] Using neural networks for automatic speaker recognition: A practical approach
    Pinto, RGCP
    Pinto, HLCP
    Caloba, LP
    38TH MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS, PROCEEDINGS, VOLS 1 AND 2, 1996, : 1078 - 1080
  • [29] Speaker recognition using Radial Basis Function neural networks
    Deng, JP
    Venkateswarlu, R
    HYBRID INFORMATION SYSTEMS, 2002, : 57 - 64
  • [30] Speaker recognition using dynamic synapse-neural networks
    George, S
    Dibazar, A
    Berger, TW
    SECOND JOINT EMBS-BMES CONFERENCE 2002, VOLS 1-3, CONFERENCE PROCEEDINGS: BIOENGINEERING - INTEGRATIVE METHODOLOGIES, NEW TECHNOLOGIES, 2002, : 151 - 152