Speaker Recognition with Deep Learning Approaches: A Review

被引：0

作者：

Alenizi, Abdulrahman S. ^{[1
]}

Al-Karawi, Khamis A. ^{[2
]}

机构：

[1] PAAET, Shuwaikh Ind, Kuwait

[2] Diyala Univ, Baqubah, Diyala, Iraq

来源：

PROCEEDINGS OF NINTH INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY, VOL 5, ICICT 2024 | 2024年 / 1000卷

关键词：

Deep learning text independence; Feature extraction; Statistical models; Discriminative models; Speaker identification; And speaker verification; MACHINES; NOISE;

D O I：

10.1007/978-981-97-3289-0_39

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This article gives an overview of the methods for using deep learning to identify and verify speakers. Speaker recognition is an everyday use of speech technology. Many research initiatives have been carried out in the past few years, but little progress has been achieved. But just as deep learning techniques are replacing previous state-of-the-art approaches in speech recognition, they are also developing in most machine learning fields. Deep learning seems to have evolved into the most advanced speaker verification and identification technique. Most novel efforts start with the common x-vectors in addition to i-vectors. The increasing volume of data gathered makes the area where deep learning is most effective more accessible.

引用

页码：481 / 499

页数：19

共 50 条

[21] A review of speaker diarization: Recent advances with deep learning
Park, Tae Jin
Kanda, Naoyuki
Dimitriadis, Dimitrios
Han, Kyu J.
Watanabe, Shinji
Narayanan, Shrikanth
COMPUTER SPEECH AND LANGUAGE, 2022, 72
[22] Deep Learning Approaches for Phantom Movement Recognition
Akbulut, Akhan
Asci, Guven
Gungor, Feray
Tarakci, Ela
Aydin, Muhammed Ali
Zaim, Abdul Halim
2019 MEDICAL TECHNOLOGIES CONGRESS (TIPTEKNO), 2019, : 487 - 490
[23] Review Article on Deep Learning approaches
Modi, Arpit Sunilkumar
PROCEEDINGS OF THE 2018 SECOND INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICICCS), 2018, : 1635 - 1639
[24] A deep learning approach to integrate convolutional neural networks in speaker recognition
Hourri, Soufiane
Nikolov, Nikola S.
Kharroubi, Jamal
INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2020, 23 (03) : 615 - 623
[25] Sign Language Recognition: A Comprehensive Review of Traditional and Deep Learning Approaches, Datasets, and Challenges
Tao, Tangfei
Zhao, Yizhe
Liu, Tianyu
Zhu, Jieli
IEEE ACCESS, 2024, 12 : 75034 - 75060
[26] A deep learning approach to integrate convolutional neural networks in speaker recognition
Soufiane Hourri
Nikola S. Nikolov
Jamal Kharroubi
International Journal of Speech Technology, 2020, 23 : 615 - 623
[27] Machine Learning and Deep Learning Approaches for Arabic Sign Language Recognition: A Decade Systematic Literature Review
Alayed, Asmaa
SENSORS, 2024, 24 (23)
[28] A tibetan-dependent speaker recognition method based on deep learning
Gan, Zhen-ye
Yu, Yue
Luo, Min
MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (21) : 30821 - 30840
[29] A tibetan-dependent speaker recognition method based on deep learning
Zhen-ye Gan
Yue Yu
Min Luo
Multimedia Tools and Applications, 2022, 81 : 30821 - 30840
[30] A Review of Deep Learning in Image Recognition
Pak, Myeongsuk
Kim, Sanghoon
2017 4TH INTERNATIONAL CONFERENCE ON COMPUTER APPLICATIONS AND INFORMATION PROCESSING TECHNOLOGY (CAIPT), 2017, : 367 - 369

← 1 2 3 4 5 →