Speaker Recognition with Deep Learning Approaches: A Review

被引:0
|
作者
Alenizi, Abdulrahman S. [1 ]
Al-Karawi, Khamis A. [2 ]
机构
[1] PAAET, Shuwaikh Ind, Kuwait
[2] Diyala Univ, Baqubah, Diyala, Iraq
关键词
Deep learning text independence; Feature extraction; Statistical models; Discriminative models; Speaker identification; And speaker verification; MACHINES; NOISE;
D O I
10.1007/978-981-97-3289-0_39
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This article gives an overview of the methods for using deep learning to identify and verify speakers. Speaker recognition is an everyday use of speech technology. Many research initiatives have been carried out in the past few years, but little progress has been achieved. But just as deep learning techniques are replacing previous state-of-the-art approaches in speech recognition, they are also developing in most machine learning fields. Deep learning seems to have evolved into the most advanced speaker verification and identification technique. Most novel efforts start with the common x-vectors in addition to i-vectors. The increasing volume of data gathered makes the area where deep learning is most effective more accessible.
引用
收藏
页码:481 / 499
页数:19
相关论文
共 50 条
  • [41] Deep Learning Approaches to Grasp Synthesis: A Review
    Newbury, Rhys
    Gu, Morris
    Chumbley, Lachlan
    Mousavian, Arsalan
    Eppner, Clemens
    Leitner, Jurgen
    Bohg, Jeannette
    Morales, Antonio
    Asfour, Tamim
    Kragic, Danica
    Fox, Dieter
    Cosgun, Akansel
    IEEE TRANSACTIONS ON ROBOTICS, 2023, 39 (05) : 3994 - 4015
  • [42] A deep learning approach for text-independent speaker recognition with short utterances
    Chakroun, Rania
    Frikha, Mondher
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (21) : 33111 - 33133
  • [43] Deep Learning-Based Holistic Speaker Independent Visual Speech Recognition
    Nemani P.
    Krishna G.S.
    Ramisetty N.
    Sai B.D.S.
    Kumar S.
    IEEE Transactions on Artificial Intelligence, 2023, 4 (06): : 1705 - 1713
  • [44] Learning Deep Embedding with Acoustic and Phoneme Features for Speaker Recognition in FM Broadcasting
    Li, Xiao
    Chen, Xiao
    Fu, Rui
    Hu, Xiao
    Chen, Mintong
    Niu, Kun
    IET BIOMETRICS, 2024, 2024 (01)
  • [45] Deep Learning Backend for Single and Multisession i-Vector Speaker Recognition
    Ghahabi, Omid
    Hernando, Javier
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (04) : 807 - 817
  • [46] Speaker Recognition Techniques: A review
    Todkar, Satyam P.
    Babar, Snehal S.
    Ambike, Rudrendra U.
    Suryakar, Prasad B.
    Prasad, J. R.
    2018 3RD INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT), 2018,
  • [47] Plant image recognition with deep learning: A review
    Chen, Ying
    Huang, Yiqi
    Zhang, Zizhao
    Wang, Zhen
    Liu, Bo
    Liu, Conghui
    Huang, Cong
    Dong, Shuangyu
    Pu, Xuejiao
    Wan, Fanghao
    Qiao, Xi
    Qian, Wanqiang
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2023, 212
  • [48] Arabic Speech Recognition with Deep Learning: A Review
    Algihab, Wajdan
    Alawwad, Noura
    Aldawish, Anfal
    AlHumoud, Sarah
    SOCIAL COMPUTING AND SOCIAL MEDIA: DESIGN, HUMAN BEHAVIOR AND ANALYTICS, SCSM 2019, PT I, 2019, 11578 : 15 - 31
  • [49] A Short Review on Deep Learning for Entity Recognition
    Nguyen, Hien T.
    Thuan Quoc Nguyen
    FUTURE DATA AND SECURITY ENGINEERING, FDSE 2018, 2018, 11251 : 261 - 272
  • [50] Use of Speaker Recognition Approaches for Learning and Evaluating Embedding Representations of Musical Instrument Sounds
    Shi, Xuan
    Cooper, Erica
    Yamagishi, Junichi
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 367 - 377