Speaker Recognition with Deep Learning Approaches: A Review

被引:0
|
作者
Alenizi, Abdulrahman S. [1 ]
Al-Karawi, Khamis A. [2 ]
机构
[1] PAAET, Shuwaikh Ind, Kuwait
[2] Diyala Univ, Baqubah, Diyala, Iraq
关键词
Deep learning text independence; Feature extraction; Statistical models; Discriminative models; Speaker identification; And speaker verification; MACHINES; NOISE;
D O I
10.1007/978-981-97-3289-0_39
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This article gives an overview of the methods for using deep learning to identify and verify speakers. Speaker recognition is an everyday use of speech technology. Many research initiatives have been carried out in the past few years, but little progress has been achieved. But just as deep learning techniques are replacing previous state-of-the-art approaches in speech recognition, they are also developing in most machine learning fields. Deep learning seems to have evolved into the most advanced speaker verification and identification technique. Most novel efforts start with the common x-vectors in addition to i-vectors. The increasing volume of data gathered makes the area where deep learning is most effective more accessible.
引用
收藏
页码:481 / 499
页数:19
相关论文
共 50 条
  • [31] Machine Learning and Deep Learning Approaches for CyberSecurity: A Review
    Halbouni, Asmaa
    Gunawan, Teddy Surya
    Habaebi, Mohamed Hadi
    Halbouni, Murad
    Kartiwi, Mira
    Ahmad, Robiah
    IEEE ACCESS, 2022, 10 : 19572 - 19585
  • [32] Applications of Deep Learning Approaches in Speech Recognition: A Survey
    Al-Janabi, Sameer I. Ali
    Lateef, Ali Azawii Abdul
    PROCEEDINGS OF INTERNATIONAL CONFERENCE ON COMPUTING AND COMMUNICATION NETWORKS (ICCCN 2021), 2022, 394 : 189 - 196
  • [33] Learning to Fool the Speaker Recognition
    Li, Jiguo
    Zhang, Xinfeng
    Xu, Jizheng
    Ma, Siwei
    Gao, Wen
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2021, 17 (03)
  • [34] LEARNING TO FOOL THE SPEAKER RECOGNITION
    Li, Jiguo
    Zhang, Xinfeng
    Xu, Jizheng
    Zhang, Li
    Wang, Yue
    Ma, Siwei
    Gao, Wen
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 2937 - 2941
  • [35] A Review of Local Feature Algorithms and Deep Learning Approaches in Facial Expression Recognition with Tensorflow and Keras
    Chengeta, Kennedy
    PATTERN RECOGNITION, MCPR 2019, 2019, 11524 : 127 - 138
  • [36] Reliable Visualization for Deep Speaker Recognition
    Li, Pengqi
    Li, Lantian
    Hamdulla, Askar
    Wang, Dong
    INTERSPEECH 2022, 2022, : 331 - 335
  • [37] Deep Speaker Recognition: Modular or Monolithic?
    Bhattacharya, Gautam
    Alam, Jahangir
    Kenny, Patrick
    INTERSPEECH 2019, 2019, : 1143 - 1147
  • [38] A deep learning approach for text-independent speaker recognition with short utterances
    Rania Chakroun
    Mondher Frikha
    Multimedia Tools and Applications, 2023, 82 : 33111 - 33133
  • [39] Deep Learning Approaches on Image Captioning: A Review
    Ghandi, Taraneh
    Pourreza, Hamidreza
    Mahyar, Hamidreza
    ACM COMPUTING SURVEYS, 2024, 56 (03)
  • [40] DEEP LEARNING APPROACHES FOR CLASSIFYING DATA: A REVIEW
    Bikku, Thulasi
    Sree, K. P. N. V. Satya
    JOURNAL OF ENGINEERING SCIENCE AND TECHNOLOGY, 2020, 15 (04): : 2580 - 2594