Robust speaker detection using Neural Networks

被引：0

作者：

Shell, John R. ^{[1
]}

机构：

[1] So Illinois Univ, Dept Elect & Comp Engn, Carbondale, IL 62901 USA

来源：

PROCEEDINGS OF THE EIGHTH IASTED INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING | 2006年

关键词：

Neural Networks; speech recognition; modeling;

D O I：

暂无

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

The work proposed in this paper utilizes Neural Networks to distinguish speech patterns. A feature extractor is used as a standard Linear Processing Coefficients (LPC) Cepstrum coder, converting the incoming speech signal captured by a Matlab interface into LPC Cepstrum feature space. A Neural Network makes each variable length LPC trajectory of an isolated word into a fixed length LPC trajectory and makes the fixed length feature vector that is fed into a recognizer. The design of the recognizer uses a Feed Forward (FF) and Back Propagation (BP) Network approach tested with variable hidden layers with Transfer functions of hyperbolic tangent and sigmoid to test the signal output for the recognition of the feature vectors of isolated words. The feature vector was normalized and decorrelated by pruning techniques. The training process uses momentum to find the global minima of the error surface avoiding the oscillations in local minima. The goal of the work is to consistently identify a randomly chosen speech pattern from the samples of four different speakers uttering the same phrase 100% of the time and to verify the effectiveness of neural networks as a valid method in pattern recognition.

引用

页码：414 / 419

页数：6

共 50 条

[21] Audio Replay Attack Detection for Speaker Verification System Using Convolutional Neural Networks
Kemanth, P. J.
Supanekar, Sujata
Koolagudi, Shashidhar G.
PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2019, PT II, 2019, 11942 : 445 - 453
[22] Robust PPG Peak Detection Using Dilated Convolutional Neural Networks
Kazemi, Kianoosh
Laitala, Juho
Azimi, Iman
Liljeberg, Pasi
Rahmani, Amir M.
SENSORS, 2022, 22 (16)
[23] Speaker Diarization Using Deep Recurrent Convolutional Neural Networks for Speaker Embeddings
Cyrta, Pawel
Trzcinski, Tomasz
Stokowiec, Wojciech
INFORMATION SYSTEMS ARCHITECTURE AND TECHNOLOGY, PT I, 2018, 655 : 107 - 117
[24] Speaker identification from voice using neural networks
Biswas, B
Konar, A
JOURNAL OF SCIENTIFIC & INDUSTRIAL RESEARCH, 2002, 61 (08): : 599 - 606
[25] Speaker Recognition Using Neural Networks and Conventional Classifiers
Farrell, Kevin R.
Mammone, Richard J.
Assaleh, Khaled T.
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (01): : 194 - 205
[26] SPEAKER ADAPTIVE TRAINING USING DEEP NEURAL NETWORKS
Ochiai, Tsubasa
Matsuda, Shigeki
Lu, Xugang
Hori, Chiori
Katagiri, Shigeru
2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
[27] Coevolutionary approach to speaker identification using neural networks
He, XM
Hu, GR
Tan, ZH
2000 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I-III, 2000, : 1572 - 1575
[28] AN APPLICATION OF SPEAKER RECOGNITION USING ARTIFICIAL NEURAL NETWORKS
Caner, Murat
Ustun, Seydi Vakkas
PAMUKKALE UNIVERSITY JOURNAL OF ENGINEERING SCIENCES-PAMUKKALE UNIVERSITESI MUHENDISLIK BILIMLERI DERGISI, 2006, 12 (02): : 279 - 284
[29] SPEAKER IDENTIFICATION AND CLUSTERING USING CONVOLUTIONAL NEURAL NETWORKS
Lukic, Yanick
Vogt, Carlo
Durr, Oliver
Stadelmann, Thilo
2016 IEEE 26TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2016,
[30] Speaker recognition using convolutional siamese neural networks
Jung H.
Yoon S.
Park N.
Transactions of the Korean Institute of Electrical Engineers, 2020, 69 (01): : 164 - 169

← 1 2 3 4 5 →