Speech Command Recognition Based on Convolutional Spiking Neural Networks

被引:5
|
作者
Sadovsky, Erik [1 ]
Jakubec, Maros [1 ]
Jarina, Roman [1 ]
机构
[1] Univ Zilina, Dept Multimedia & Informat Commun Technol, FEIT, Zilina, Slovakia
关键词
spiking neural network; spiking speech commands; command recognition; convolutional spiking neural network;
D O I
10.1109/RADIOELEKTRONIKA57919.2023.10109082
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This article presents a new technique for speech recognition that combines Convolutional Neural Networks (CNNs) with Spiking Neural Networks (SNNs) to create an SNN-CNN model. The model is tested on the Google Speech Command Dataset and achieves an accuracy of 72.03%, which is similar to the current state-of-the-art speech recognition methods. The study also compares the performance of the SNN-CNN model with other SNN models that use Multi-Layer Perceptrons (MLPs) and traditional Artificial Neural Networks (ANNs). The results show that the CNN-based SNNs outperform both MLPs and ANNs, demonstrating the superiority of the proposed model. The approach presented in this study can potentially be applied to other speech recognition tasks and could lead to further improvements in the field.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] Neuromorphic Speech Recognition With Photonic Convolutional Spiking Neural Networks
    Xiang, Shuiying
    Zhang, Tianrui
    Han, Yanan
    Guo, Xingxing
    Zhang, Yahui
    Shi, Yuechun
    Hao, Yue
    IEEE JOURNAL OF SELECTED TOPICS IN QUANTUM ELECTRONICS, 2023, 29 (06)
  • [2] Temporal Feedback Convolutional Recurrent Neural Networks for Speech Command Recognition
    Kim, Taejun
    Nam, Juhan
    PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 437 - 441
  • [3] Speech Recognition Based on Convolutional Neural Networks
    Du Guiming
    Wang Xia
    Wang Guangyan
    Zhang Yan
    Li Dan
    2016 IEEE INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING (ICSIP), 2016, : 708 - 711
  • [4] Speech emotion recognition based on spiking neural network and convolutional neural network
    Du, Chengyan
    Liu, Fu
    Kang, Bing
    Hou, Tao
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 147
  • [5] Convolutional Neural Networks for Speech Recognition
    Abdel-Hamid, Ossama
    Mohamed, Abdel-Rahman
    Jiang, Hui
    Deng, Li
    Penn, Gerald
    Yu, Dong
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (10) : 1533 - 1545
  • [6] A Biologically Plausible Speech Recognition Framework Based on Spiking Neural Networks
    Wu, Jibin
    Chua, Yansong
    Li, Haizhou
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [7] LOW-ACTIVITY SUPERVISED CONVOLUTIONAL SPIKING NEURAL NETWORKS APPLIED TO SPEECH COMMANDS RECOGNITION
    Pellegrini, Thomas
    Zimmer, Romain
    Masquelier, Timothee
    2021 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP (SLT), 2021, : 97 - 103
  • [8] Speech emotion recognition using spiking neural networks
    Buscicchio, Cosimo A.
    Gorecki, Przemyslaw
    Caponetti, Laura
    FOUNDATIONS OF INTELLIGENT SYSTEMS, PROCEEDINGS, 2006, 4203 : 38 - 46
  • [9] Low Latency Based Convolutional Recurrent Neural Network Model for Speech Command Recognition
    Kinkar, Chhayarani Ram
    Jain, Yogendra Kumar
    INFORMATION TECHNOLOGY AND CONTROL, 2021, 50 (04): : 656 - 673
  • [10] STDP-based spiking deep convolutional neural networks for object recognition
    Kheradpisheh, Saeed Reza
    Ganjtabesh, Mohammad
    Thorpe, Simon J.
    Masquelier, Timothee
    NEURAL NETWORKS, 2018, 99 : 56 - 67