Speaker Recognition with Deep Learning Approaches: A Review

被引:0
|
作者
Alenizi, Abdulrahman S. [1 ]
Al-Karawi, Khamis A. [2 ]
机构
[1] PAAET, Shuwaikh Ind, Kuwait
[2] Diyala Univ, Baqubah, Diyala, Iraq
关键词
Deep learning text independence; Feature extraction; Statistical models; Discriminative models; Speaker identification; And speaker verification; MACHINES; NOISE;
D O I
10.1007/978-981-97-3289-0_39
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This article gives an overview of the methods for using deep learning to identify and verify speakers. Speaker recognition is an everyday use of speech technology. Many research initiatives have been carried out in the past few years, but little progress has been achieved. But just as deep learning techniques are replacing previous state-of-the-art approaches in speech recognition, they are also developing in most machine learning fields. Deep learning seems to have evolved into the most advanced speaker verification and identification technique. Most novel efforts start with the common x-vectors in addition to i-vectors. The increasing volume of data gathered makes the area where deep learning is most effective more accessible.
引用
收藏
页码:481 / 499
页数:19
相关论文
共 50 条
  • [11] DOMAIN ROBUST DEEP EMBEDDING LEARNING FOR SPEAKER RECOGNITION
    Hu, Hang-Rui
    Song, Yan
    Liu, Ying
    Dai, Li-Rong
    McLoughlin, Ian
    Liu, Lin
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7182 - 7186
  • [12] A Comparison of MFCC and LPCC with Deep Learning for Speaker Recognition
    Yang, Haiyan
    Deng, Yanrong
    Zhao, Hua-An
    ICBDC 2019: PROCEEDINGS OF 2019 4TH INTERNATIONAL CONFERENCE ON BIG DATA AND COMPUTING, 2019, : 160 - 164
  • [13] A Review of Local, Holistic and Deep Learning Approaches in Facial Expressions Recognition
    Chengeta, Kennedy
    Viriri, Serestina
    2019 CONFERENCE ON INFORMATION COMMUNICATIONS TECHNOLOGY AND SOCIETY (ICTAS), 2019,
  • [14] Deep Learning Approaches for Continuous Sign Language Recognition: A Comprehensive Review
    Khan, Asma
    Jin, Seyong
    Lee, Geon-Hee
    Arzu, Gul E.
    Dang, L. Minh
    Nguyen, Tan N.
    Choi, Woong
    Moon, Hyeonjoon
    IEEE ACCESS, 2025, 13 : 55524 - 55544
  • [15] Improved Deep Speaker Feature Learning for Text-Dependent Speaker Recognition
    Li, Lantian
    Lin, Yiye
    Zhang, Zhiyong
    Wang, Dong
    2015 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2015, : 426 - 429
  • [16] Curriculum Learning Based Approaches for Noise Robust Speaker Recognition
    Ranjan, Shivesh
    Hansen, John H. L.
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (01) : 197 - 210
  • [17] CENTROID-BASED DEEP METRIC LEARNING FOR SPEAKER RECOGNITION
    Wang, Jixuan
    Wang, Kuan-Chieh
    Law, Marc T.
    Rudzicz, Frank
    Brudno, Michael
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 3652 - 3656
  • [18] A DISCRIMINATIVE UNSUPERVISED METHOD FOR SPEAKER RECOGNITION USING DEEP LEARNING
    Saleem, Muhammad Muneeb
    Hansen, John H. L.
    2016 IEEE 26TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2016,
  • [19] A Review on Deep Learning-Based Approaches for Automatic Sonar Target Recognition
    Neupane, Dhiraj
    Seok, Jongwon
    ELECTRONICS, 2020, 9 (11) : 1 - 30
  • [20] Automatic Speaker Recognition using Transfer Learning Approach of Deep Learning Models
    Ganvir, Sonal
    Lal, Nidhi
    PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTATION TECHNOLOGIES (ICICT 2021), 2021, : 595 - 601