Speaker Recognition with Deep Learning Approaches: A Review

被引:0
|
作者
Alenizi, Abdulrahman S. [1 ]
Al-Karawi, Khamis A. [2 ]
机构
[1] PAAET, Shuwaikh Ind, Kuwait
[2] Diyala Univ, Baqubah, Diyala, Iraq
关键词
Deep learning text independence; Feature extraction; Statistical models; Discriminative models; Speaker identification; And speaker verification; MACHINES; NOISE;
D O I
10.1007/978-981-97-3289-0_39
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This article gives an overview of the methods for using deep learning to identify and verify speakers. Speaker recognition is an everyday use of speech technology. Many research initiatives have been carried out in the past few years, but little progress has been achieved. But just as deep learning techniques are replacing previous state-of-the-art approaches in speech recognition, they are also developing in most machine learning fields. Deep learning seems to have evolved into the most advanced speaker verification and identification technique. Most novel efforts start with the common x-vectors in addition to i-vectors. The increasing volume of data gathered makes the area where deep learning is most effective more accessible.
引用
收藏
页码:481 / 499
页数:19
相关论文
共 50 条
  • [21] A review of speaker diarization: Recent advances with deep learning
    Park, Tae Jin
    Kanda, Naoyuki
    Dimitriadis, Dimitrios
    Han, Kyu J.
    Watanabe, Shinji
    Narayanan, Shrikanth
    COMPUTER SPEECH AND LANGUAGE, 2022, 72
  • [22] Deep Learning Approaches for Phantom Movement Recognition
    Akbulut, Akhan
    Asci, Guven
    Gungor, Feray
    Tarakci, Ela
    Aydin, Muhammed Ali
    Zaim, Abdul Halim
    2019 MEDICAL TECHNOLOGIES CONGRESS (TIPTEKNO), 2019, : 487 - 490
  • [23] Review Article on Deep Learning approaches
    Modi, Arpit Sunilkumar
    PROCEEDINGS OF THE 2018 SECOND INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICICCS), 2018, : 1635 - 1639
  • [24] A deep learning approach to integrate convolutional neural networks in speaker recognition
    Hourri, Soufiane
    Nikolov, Nikola S.
    Kharroubi, Jamal
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2020, 23 (03) : 615 - 623
  • [25] Sign Language Recognition: A Comprehensive Review of Traditional and Deep Learning Approaches, Datasets, and Challenges
    Tao, Tangfei
    Zhao, Yizhe
    Liu, Tianyu
    Zhu, Jieli
    IEEE ACCESS, 2024, 12 : 75034 - 75060
  • [26] A deep learning approach to integrate convolutional neural networks in speaker recognition
    Soufiane Hourri
    Nikola S. Nikolov
    Jamal Kharroubi
    International Journal of Speech Technology, 2020, 23 : 615 - 623
  • [27] Machine Learning and Deep Learning Approaches for Arabic Sign Language Recognition: A Decade Systematic Literature Review
    Alayed, Asmaa
    SENSORS, 2024, 24 (23)
  • [28] A tibetan-dependent speaker recognition method based on deep learning
    Gan, Zhen-ye
    Yu, Yue
    Luo, Min
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (21) : 30821 - 30840
  • [29] A tibetan-dependent speaker recognition method based on deep learning
    Zhen-ye Gan
    Yue Yu
    Min Luo
    Multimedia Tools and Applications, 2022, 81 : 30821 - 30840
  • [30] A Review of Deep Learning in Image Recognition
    Pak, Myeongsuk
    Kim, Sanghoon
    2017 4TH INTERNATIONAL CONFERENCE ON COMPUTER APPLICATIONS AND INFORMATION PROCESSING TECHNOLOGY (CAIPT), 2017, : 367 - 369