Convolutional neural network vectors for speaker recognition

被引:0
|
作者
Soufiane Hourri
Nikola S. Nikolov
Jamal Kharroubi
机构
[1] Laboratoire des Systèmes Intelligents et Applications,
[2] Faculté des Sciences et Techniques,undefined
[3] Université Sidi Mohamed Ben Abdellah,undefined
[4] University of Limerick,undefined
关键词
Speaker recognition; MFCC; Convolutional neural network; Restricted Boltzmann machine; Deep learning;
D O I
暂无
中图分类号
学科分类号
摘要
Deep learning models are now considered state-of-the-art in many areas of pattern recognition. In speaker recognition, several architectures have been studied, such as deep neural networks (DNNs), deep belief networks (DBNs), restricted Boltzmann machines (RBMs), and so on, while convolutional neural networks (CNNs) are the most widely used models in computer vision. The problem is that CNN is limited to the computer vision field due to its structure which is designed for two-dimensional data. To overcome this limitation, we aim at developing a customized CNN for speaker recognition. The goal of this paper is to propose a new approach to extract speaker characteristics by constructing CNN filters linked to the speaker. Besides, we propose new vectors to identify speakers, which we call in this work convVectors. Experiments have been performed with a gender-dependent corpus (THUYG-20 SRE) under three noise conditions : clean, 9db, and 0db. We compared the proposed method with our baseline system and the state-of-the-art methods. Results showed that the convVectors method was the most robust, improving the baseline system by an average of 43%, and recording an equal error rate of 1.05% EER. This is an important finding to understand how deep learning models can be adapted to the problem of speaker recognition.
引用
收藏
页码:389 / 400
页数:11
相关论文
共 50 条
  • [41] Emotion Recognition Using a Convolutional Neural Network
    Zatarain-Cabada, Ramon
    Lucia Barron-Estrada, Maria
    Gonzalez-Hernandez, Francisco
    Rodriguez-Rangel, Hector
    ADVANCES IN COMPUTATIONAL INTELLIGENCE, MICAI 2017, PT II, 2018, 10633 : 208 - 219
  • [42] Convolutional Neural Network Architecture for Semaphore Recognition
    Li, Wanchong
    Yang, Yuliang
    Wang, Mengyuan
    Zhang, Linhao
    Zhu, Mengyu
    PROCEEDINGS OF 2018 IEEE 9TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS), 2018, : 559 - 562
  • [43] Dynamic Convolutional Neural Network for Activity Recognition
    You, Chih-Hsiang
    Chiang, Chen-Kuo
    2016 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2016,
  • [44] Implementation of Convolutional Neural Network for Speech Recognition
    Wang, Zhichao
    Na, Xingyu
    Liu, Yong
    Pan, Jielin
    Yan, Yonghong
    INTERNATIONAL ACADEMIC CONFERENCE ON THE INFORMATION SCIENCE AND COMMUNICATION ENGINEERING (ISCE 2014), 2014, : 239 - 243
  • [45] A Hybrid convolutional neural network for sketch recognition
    Zhang, Xingyuan
    Huang, Yaping
    Zou, Qi
    Pei, Yanting
    Zhang, Runsheng
    Wang, Song
    PATTERN RECOGNITION LETTERS, 2020, 130 : 73 - 82
  • [46] Fish Recognition Using Convolutional Neural Network
    Ding, Guoqing
    Song, Yan
    Guo, Jia
    Feng, Chen
    Li, Guangliang
    He, Bo
    Yan, Tianhong
    OCEANS 2017 - ANCHORAGE, 2017,
  • [47] Iris Recognition Using Convolutional Neural Network
    Zhuang, Yuan
    Chuah, Joon Huang
    Chow, Chee Onn
    Lim, Marcus Guozong
    2020 IEEE 10TH INTERNATIONAL CONFERENCE ON SYSTEM ENGINEERING AND TECHNOLOGY (ICSET), 2020, : 134 - 138
  • [48] Face Recognition Based on Convolutional Neural Network
    Coskun, Musab
    Ucar, Aysegul
    Yildirim, Ozal
    Demir, Yakup
    2017 INTERNATIONAL CONFERENCE ON MODERN ELECTRICAL AND ENERGY SYSTEMS (MEES), 2017, : 376 - 379
  • [49] A Convolutional Neural Network for Emotion Assessment and Recognition
    Anyanwu, Comfort
    Hays, Caitlin
    2022 IEEE 19TH INTERNATIONAL CONFERENCE ON MOBILE AD HOC AND SMART SYSTEMS (MASS 2022), 2022, : 759 - 763
  • [50] A Convolutional Neural Network for Smoking Activity Recognition
    Alharbi, Fayez
    Farrahi, Katayoun
    2018 IEEE 20TH INTERNATIONAL CONFERENCE ON E-HEALTH NETWORKING, APPLICATIONS AND SERVICES (HEALTHCOM), 2018,