Speaker recognition using convolutional siamese neural networks

被引:0
|
作者
Jung H. [1 ]
Yoon S. [1 ]
Park N. [1 ]
机构
[1] Dept. of Computer Science and Engineering, Konkuk University
关键词
Convolutional Neural Netowork(CNN); MFCC; Siamese Networks; Speaker Recognition;
D O I
10.5370/KIEE.2020.69.1.164
中图分类号
学科分类号
摘要
Recently, machine learning has been applied in a variety of fields. Speaker recognition is one of attractive applications of machine learning. In this paper, we propose a convolutional Siamese neural network for speaker recognition. The proposed model generates feature vectors through the identical two convolutional neural networks for speech data of two speakers. The similarity is measured by calculating the Euclidean distance of two output feature vectors. If the calculated similarity is less than the threshold, it is judged that two speakers are the same. The experimental result of the proposed speaker recognition based on the convolutional Siamese neural network showed its accuracy was achieved up to 96%. The accuracy of one-shot classification using the trained convolutional Siamese neural network was evaluated also. For the evaluation, the 10-way one-shot classification for 10 speakers not used for learning stages were tested, resulting in 92% accuracy. © 2020 Korean Institute of Electrical Engineers. All rights reserved.
引用
收藏
页码:164 / 169
页数:5
相关论文
共 50 条
  • [11] A deep learning approach to integrate convolutional neural networks in speaker recognition
    Soufiane Hourri
    Nikola S. Nikolov
    Jamal Kharroubi
    International Journal of Speech Technology, 2020, 23 : 615 - 623
  • [12] Speaker Diarization Using Deep Recurrent Convolutional Neural Networks for Speaker Embeddings
    Cyrta, Pawel
    Trzcinski, Tomasz
    Stokowiec, Wojciech
    INFORMATION SYSTEMS ARCHITECTURE AND TECHNOLOGY, PT I, 2018, 655 : 107 - 117
  • [13] Speaker Recognition Using Neural Networks and Conventional Classifiers
    Farrell, Kevin R.
    Mammone, Richard J.
    Assaleh, Khaled T.
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (01): : 194 - 205
  • [14] AN APPLICATION OF SPEAKER RECOGNITION USING ARTIFICIAL NEURAL NETWORKS
    Caner, Murat
    Ustun, Seydi Vakkas
    PAMUKKALE UNIVERSITY JOURNAL OF ENGINEERING SCIENCES-PAMUKKALE UNIVERSITESI MUHENDISLIK BILIMLERI DERGISI, 2006, 12 (02): : 279 - 284
  • [15] Speaker recognition using pulse coupled neural networks
    Timoszczuk, Antonio Pedro
    Cabral, Euvaldo F., Jr.
    2007 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-6, 2007, : 1965 - +
  • [16] Personality Recognition Using Convolutional Neural Networks
    Gimenez, Maite
    Paredes, Roberto
    Rosso, Paolo
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, CICLING 2017, PT II, 2018, 10762 : 313 - 323
  • [17] Locally-Connected and Convolutional Neural Networks for Small Footprint Speaker Recognition
    Chen, Yu-hsin
    Lopez-Moreno, Ignacio
    Sainath, Tara N.
    Visontai, Mirko
    Alvarez, Raziel
    Parada, Carolina
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1136 - 1140
  • [18] Recognition of flowers using convolutional neural networks
    Alkhonin, Abdulrahman
    Almutairi, Abdulelah
    Alburaidi, Abdulmajeed
    Saudagar, Abdul Khader Jilani
    INTERNATIONAL JOURNAL OF INTELLIGENT ENGINEERING INFORMATICS, 2020, 8 (03) : 186 - 197
  • [19] Research on Inception Module Incorporated Siamese Convolutional Neural Networks to Realize Face Recognition
    Xu, Xian-Feng
    Zhang, Li
    Duan, Chen-Dong
    Lu, Yong
    IEEE ACCESS, 2020, 8 : 12168 - 12178
  • [20] Research on Inception Module Incorporated Siamese Convolutional Neural Networks to Realize Face Recognition
    Xu X.-F.
    Zhang L.
    Lang B.
    Xia Z.
    1600, Chinese Institute of Electronics (48): : 643 - 647