Hierarchical Speech Recognition System Using MFCC Feature Extraction and Dynamic Spiking RSOM

被引:0
|
作者
Tarek, Behi [1 ]
Najet, Arous [1 ]
Noureddine, Ellouze [1 ]
机构
[1] Enit Univ Tunis El Manar, Natl Engn Sch Tunis, Lab Signal Image & Informat Technol, Tunis, Tunisia
关键词
Kohonen map; Temporal self organizing map; hierarchical self-organizing model; Spiking neural network; speech recognition;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose new variants of unsupervised and competitive learning algorithms designed to deal with temporal sequences. These algorithms combine features from Spiking Neural Networks (SNNs) and the advantages of the hierarchical self organizing map (HSOM). The first variant named Hierarchical Dynamic recurrent spiking self-organizing map (HD-RSSOM) is characterized by the integration of a temporal controller component to regulate the firing activity of the spiking neurons. The second variant is a hierarchical model which represents a multi-layer extension of HD-RSSOM model. The case study of the proposed HSOM variants is phonemes and words recognition in continuous speech. The applied HSOM variants serve as tools for developing intelligent systems and pursuing artificial intelligence applications.
引用
收藏
页码:41 / 46
页数:6
相关论文
共 50 条
  • [41] The speech recognition system based on bark wavelet MFCC
    Zhang, Xue-ying
    Bai, Jing
    Liang, Wu-zhou
    2006 8TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-4, 2006, : 780 - +
  • [42] Hierarchical feature extraction for image recognition
    Partridge, M
    Jabri, M
    JOURNAL OF VLSI SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2002, 32 (1-2): : 157 - 167
  • [43] Hierarchical Feature Extraction for Image Recognition
    Matthew Partridge
    Marwan Jabri
    Journal of VLSI signal processing systems for signal, image and video technology, 2002, 32 : 157 - 167
  • [44] Non-Negative Subspace Projection During Conventional MFCC Feature Extraction for Noise Robust Speech Recognition
    Kumar, D. S. Pavan
    Bilgi, Raghavendra R.
    Umesh, S.
    2013 NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2013,
  • [45] The speaker recognition system based on the dynamic MFCC
    Dong, Zhi-Feng
    Wang, Zeng-Fu
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2005, 18 (05): : 596 - 601
  • [46] Optimizing feature extraction for speech recognition
    Lee, CH
    Hyun, DH
    Choi, ES
    Go, JW
    Lee, CY
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2003, 11 (01): : 80 - 87
  • [47] Feature extraction for robust speech recognition
    Dharanipragada, S
    2002 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL II, PROCEEDINGS, 2002, : 855 - 858
  • [48] Feature extraction for HMM speech recognition systems using DTW
    Go, J
    Hyun, D
    Lee, C
    6TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL III, PROCEEDINGS: IMAGE, ACOUSTIC, SPEECH AND SIGNAL PROCESSING I, 2002, : 241 - 244
  • [49] Speech Gender Recognition Using a Multilayer Feature Extraction Method
    Abdulmohsin, Husam Ali
    Al-Khateeb, Belal
    Hasan, Samer Sami
    PROCEEDINGS OF INTERNATIONAL CONFERENCE ON COMPUTING AND COMMUNICATION NETWORKS (ICCCN 2021), 2022, 394 : 113 - 122
  • [50] Articulatory feature extraction for speech recognition using neural network
    Huda, Mohammad Nurul
    Hasan, Mohammad Mahedi
    Hassan, Foyzul
    Kotwal, Mohammed Rokibul Alam
    Muhammad, Ghulam
    Rahman, Chowdhury Mofizur
    International Review on Computers and Software, 2011, 6 (01) : 25 - 31