Hierarchical Speech Recognition System Using MFCC Feature Extraction and Dynamic Spiking RSOM

被引:0
|
作者
Tarek, Behi [1 ]
Najet, Arous [1 ]
Noureddine, Ellouze [1 ]
机构
[1] Enit Univ Tunis El Manar, Natl Engn Sch Tunis, Lab Signal Image & Informat Technol, Tunis, Tunisia
关键词
Kohonen map; Temporal self organizing map; hierarchical self-organizing model; Spiking neural network; speech recognition;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose new variants of unsupervised and competitive learning algorithms designed to deal with temporal sequences. These algorithms combine features from Spiking Neural Networks (SNNs) and the advantages of the hierarchical self organizing map (HSOM). The first variant named Hierarchical Dynamic recurrent spiking self-organizing map (HD-RSSOM) is characterized by the integration of a temporal controller component to regulate the firing activity of the spiking neurons. The second variant is a hierarchical model which represents a multi-layer extension of HD-RSSOM model. The case study of the proposed HSOM variants is phonemes and words recognition in continuous speech. The applied HSOM variants serve as tools for developing intelligent systems and pursuing artificial intelligence applications.
引用
收藏
页码:41 / 46
页数:6
相关论文
共 50 条
  • [31] Emotion Recognition in Speech Using MFCC and Classifiers
    Ajitha, G.
    Prashanth, Addagatla
    Radhika, Chelle
    Chaitanya, Kancharapu
    COMPUTATIONAL VISION AND BIO-INSPIRED COMPUTING ( ICCVBIC 2021), 2022, 1420 : 197 - 207
  • [32] Classification and Recognition of Underwater Target Based on MFCC Feature Extraction
    Tong, Yuze
    Zhang, Xin
    Ge, Yizhou
    2020 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMMUNICATIONS AND COMPUTING (IEEE ICSPCC 2020), 2020,
  • [33] A Modified MFCC Feature Extraction Technique For Robust Speaker Recognition
    Sharma, Diksha
    Ali, Israj
    2015 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2015, : 1052 - 1057
  • [34] Feature extraction for poultry vocalization recognition based on improved MFCC
    Key Laboratory of Agricultural Bioenvironmental Engineering, College of Water Conservancy and Civil Engineering, China Agricultural University, Beijing 100083, China
    Nongye Gongcheng Xuebao/Transactions of the Chinese Society of Agricultural Engineering, 2008, 24 (11): : 202 - 205
  • [35] Speech recognition as feature extraction for speaker recognition
    Stolcke, A.
    Shriberg, E.
    Ferrer, L.
    Kajarekar, S.
    Sonmez, K.
    Tur, G.
    2007 IEEE WORKSHOP ON SIGNAL PROCESSING APPLICATIONS FOR PUBLIC SECURITY AND FORENSICS, 2007, : 39 - +
  • [36] FEATURE EXTRACTION FOR A SPEECH RECOGNITION SYSTEM IN NOISY ENVIRONMENT: A STUDY\
    Shrawankar, Urmila
    Thakare, Vilas
    2010 SECOND INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND APPLICATIONS: ICCEA 2010, PROCEEDINGS, VOL 1, 2010, : 358 - 361
  • [37] Multiresolution Feature Extraction (MRFE) based speech recognition system
    Priyanka, M. Anbu Swarna
    Solomi, V. Sherlin
    Vijayalakshmi, P.
    Nagarajan, T.
    2013 INTERNATIONAL CONFERENCE ON RECENT TRENDS IN INFORMATION TECHNOLOGY (ICRTIT), 2013, : 152 - 156
  • [38] LPC AND LPCC METHOD OF FEATURE EXTRACTION IN SPEECH RECOGNITION SYSTEM
    Gupta, Harshita
    Gupta, Divya
    2016 6th International Conference - Cloud System and Big Data Engineering (Confluence), 2016, : 498 - 502
  • [39] Text-to-Speech Synthesis for Hindi Language Using MFCC and LPC Feature Extraction Techniques
    Sultana, Shaikh Naziya
    Deshmukh, Ratnadeep R.
    JOURNAL OF ADVANCED APPLIED SCIENTIFIC RESEARCH, 2024, 6 (03): : 148 - 161
  • [40] Study of Robust Feature Extraction Techniques for Speech Recognition System
    Sharma, Usha
    Maheshkar, Sushila
    Mishra, A. N.
    2015 1ST INTERNATIONAL CONFERENCE ON FUTURISTIC TRENDS ON COMPUTATIONAL ANALYSIS AND KNOWLEDGE MANAGEMENT (ABLAZE), 2015, : 666 - 670