Speech Command Recognition Based on Convolutional Spiking Neural Networks

被引:5
|
作者
Sadovsky, Erik [1 ]
Jakubec, Maros [1 ]
Jarina, Roman [1 ]
机构
[1] Univ Zilina, Dept Multimedia & Informat Commun Technol, FEIT, Zilina, Slovakia
关键词
spiking neural network; spiking speech commands; command recognition; convolutional spiking neural network;
D O I
10.1109/RADIOELEKTRONIKA57919.2023.10109082
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This article presents a new technique for speech recognition that combines Convolutional Neural Networks (CNNs) with Spiking Neural Networks (SNNs) to create an SNN-CNN model. The model is tested on the Google Speech Command Dataset and achieves an accuracy of 72.03%, which is similar to the current state-of-the-art speech recognition methods. The study also compares the performance of the SNN-CNN model with other SNN models that use Multi-Layer Perceptrons (MLPs) and traditional Artificial Neural Networks (ANNs). The results show that the CNN-based SNNs outperform both MLPs and ANNs, demonstrating the superiority of the proposed model. The approach presented in this study can potentially be applied to other speech recognition tasks and could lead to further improvements in the field.
引用
收藏
页数:5
相关论文
共 50 条
  • [21] The spiking neural network based on fMRI for speech recognition
    Song, Yihua
    Guo, Lei
    Man, Menghua
    Wu, Youxi
    PATTERN RECOGNITION, 2024, 155
  • [22] Speech emotion recognition with deep convolutional neural networks
    Issa, Dias
    Demirci, M. Fatih
    Yazici, Adnan
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2020, 59
  • [23] CONVOLUTIONAL NEURAL NETWORKS-BASED CONTINUOUS SPEECH RECOGNITION USING RAW SPEECH SIGNAL
    Palaz, Dimitri
    Magimai-Doss, Mathew
    Collobert, Ronan
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4295 - 4299
  • [24] Hybrid convolutional neural networks for articulatory and acoustic information based speech recognition
    Mitra, Vikramjit
    Sivaraman, Ganesh
    Nam, Hosung
    Espy-Wilson, Carol
    Saltzman, Elliot
    Tiede, Mark
    SPEECH COMMUNICATION, 2017, 89 : 103 - 112
  • [25] An Experimental Study of Speech Emotion Recognition Based on Deep Convolutional Neural Networks
    Zheng, W. Q.
    Yu, J. S.
    Zou, Y. X.
    2015 INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII), 2015, : 827 - 831
  • [26] Recognition of Electromagnetic Signals Based on the Spiking Convolutional Neural Network
    Tao S.
    Xiao S.
    Gong S.
    Wang H.
    Ding H.
    Wang H.
    Wireless Communications and Mobile Computing, 2022, 2022
  • [27] Deep Spiking Neural Networks for Large Vocabulary Automatic Speech Recognition
    Wu, Jibin
    Yilmaz, Emre
    Zhang, Malu
    Li, Haizhou
    Tan, Kay Chen
    FRONTIERS IN NEUROSCIENCE, 2020, 14
  • [28] Exploration of rank order coding with spiking neural networks for speech recognition
    Loiselle, S
    Rouat, J
    Pressnitzer, D
    Thorpe, S
    PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), VOLS 1-5, 2005, : 2076 - 2080
  • [29] IMPROVING CONVOLUTIONAL RECURRENT NEURAL NETWORKS FOR SPEECH EMOTION RECOGNITION
    Meyer, Patrick
    Xu, Ziyi
    Fingscheidt, Tim
    2021 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP (SLT), 2021, : 365 - 372
  • [30] VERY DEEP CONVOLUTIONAL NEURAL NETWORKS FOR ROBUST SPEECH RECOGNITION
    Qian, Yanmin
    Woodland, Philip C.
    2016 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2016), 2016, : 481 - 488