Robust speaker detection using Neural Networks

被引:0
|
作者
Shell, John R. [1 ]
机构
[1] So Illinois Univ, Dept Elect & Comp Engn, Carbondale, IL 62901 USA
关键词
Neural Networks; speech recognition; modeling;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The work proposed in this paper utilizes Neural Networks to distinguish speech patterns. A feature extractor is used as a standard Linear Processing Coefficients (LPC) Cepstrum coder, converting the incoming speech signal captured by a Matlab interface into LPC Cepstrum feature space. A Neural Network makes each variable length LPC trajectory of an isolated word into a fixed length LPC trajectory and makes the fixed length feature vector that is fed into a recognizer. The design of the recognizer uses a Feed Forward (FF) and Back Propagation (BP) Network approach tested with variable hidden layers with Transfer functions of hyperbolic tangent and sigmoid to test the signal output for the recognition of the feature vectors of isolated words. The feature vector was normalized and decorrelated by pruning techniques. The training process uses momentum to find the global minima of the error surface avoiding the oscillations in local minima. The goal of the work is to consistently identify a randomly chosen speech pattern from the samples of four different speakers uttering the same phrase 100% of the time and to verify the effectiveness of neural networks as a valid method in pattern recognition.
引用
收藏
页码:414 / 419
页数:6
相关论文
共 50 条
  • [21] Audio Replay Attack Detection for Speaker Verification System Using Convolutional Neural Networks
    Kemanth, P. J.
    Supanekar, Sujata
    Koolagudi, Shashidhar G.
    PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2019, PT II, 2019, 11942 : 445 - 453
  • [22] Robust PPG Peak Detection Using Dilated Convolutional Neural Networks
    Kazemi, Kianoosh
    Laitala, Juho
    Azimi, Iman
    Liljeberg, Pasi
    Rahmani, Amir M.
    SENSORS, 2022, 22 (16)
  • [23] Speaker Diarization Using Deep Recurrent Convolutional Neural Networks for Speaker Embeddings
    Cyrta, Pawel
    Trzcinski, Tomasz
    Stokowiec, Wojciech
    INFORMATION SYSTEMS ARCHITECTURE AND TECHNOLOGY, PT I, 2018, 655 : 107 - 117
  • [24] Speaker identification from voice using neural networks
    Biswas, B
    Konar, A
    JOURNAL OF SCIENTIFIC & INDUSTRIAL RESEARCH, 2002, 61 (08): : 599 - 606
  • [25] Speaker Recognition Using Neural Networks and Conventional Classifiers
    Farrell, Kevin R.
    Mammone, Richard J.
    Assaleh, Khaled T.
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (01): : 194 - 205
  • [26] SPEAKER ADAPTIVE TRAINING USING DEEP NEURAL NETWORKS
    Ochiai, Tsubasa
    Matsuda, Shigeki
    Lu, Xugang
    Hori, Chiori
    Katagiri, Shigeru
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [27] Coevolutionary approach to speaker identification using neural networks
    He, XM
    Hu, GR
    Tan, ZH
    2000 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I-III, 2000, : 1572 - 1575
  • [28] AN APPLICATION OF SPEAKER RECOGNITION USING ARTIFICIAL NEURAL NETWORKS
    Caner, Murat
    Ustun, Seydi Vakkas
    PAMUKKALE UNIVERSITY JOURNAL OF ENGINEERING SCIENCES-PAMUKKALE UNIVERSITESI MUHENDISLIK BILIMLERI DERGISI, 2006, 12 (02): : 279 - 284
  • [29] SPEAKER IDENTIFICATION AND CLUSTERING USING CONVOLUTIONAL NEURAL NETWORKS
    Lukic, Yanick
    Vogt, Carlo
    Durr, Oliver
    Stadelmann, Thilo
    2016 IEEE 26TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2016,
  • [30] Speaker recognition using convolutional siamese neural networks
    Jung H.
    Yoon S.
    Park N.
    Transactions of the Korean Institute of Electrical Engineers, 2020, 69 (01): : 164 - 169