Speech Command Recognition Based on Convolutional Spiking Neural Networks

被引:5
|
作者
Sadovsky, Erik [1 ]
Jakubec, Maros [1 ]
Jarina, Roman [1 ]
机构
[1] Univ Zilina, Dept Multimedia & Informat Commun Technol, FEIT, Zilina, Slovakia
关键词
spiking neural network; spiking speech commands; command recognition; convolutional spiking neural network;
D O I
10.1109/RADIOELEKTRONIKA57919.2023.10109082
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This article presents a new technique for speech recognition that combines Convolutional Neural Networks (CNNs) with Spiking Neural Networks (SNNs) to create an SNN-CNN model. The model is tested on the Google Speech Command Dataset and achieves an accuracy of 72.03%, which is similar to the current state-of-the-art speech recognition methods. The study also compares the performance of the SNN-CNN model with other SNN models that use Multi-Layer Perceptrons (MLPs) and traditional Artificial Neural Networks (ANNs). The results show that the CNN-based SNNs outperform both MLPs and ANNs, demonstrating the superiority of the proposed model. The approach presented in this study can potentially be applied to other speech recognition tasks and could lead to further improvements in the field.
引用
收藏
页数:5
相关论文
共 50 条
  • [31] Gender Differentiated Convolutional Neural Networks for Speech Emotion Recognition
    Mishra, Puneet
    Sharma, Ruchir
    2020 12TH INTERNATIONAL CONGRESS ON ULTRA MODERN TELECOMMUNICATIONS AND CONTROL SYSTEMS AND WORKSHOPS (ICUMT 2020), 2020, : 142 - 148
  • [32] FSER: Deep Convolutional Neural Networks for Speech Emotion Recognition
    Dossou, Bonaventure F. P.
    Gbenou, Yeno K. S.
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 3526 - 3531
  • [33] SPEECH EMOTION RECOGNITION USING QUATERNION CONVOLUTIONAL NEURAL NETWORKS
    Muppidi, Aneesh
    Radfar, Martin
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6309 - 6313
  • [34] Efficient GPU implementation of convolutional neural networks for speech recognition
    van den Berg, Ewout
    Brand, Daniel
    Bordawekar, Rajesh
    Rachevsky, Leonid
    Ramabhadran, Bhuvana
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1483 - 1487
  • [35] Speech Recognition of Punjabi Numerals Using Convolutional Neural Networks
    Aditi, Thakur
    Karun, Verma
    ADVANCES IN COMPUTER COMMUNICATION AND COMPUTATIONAL SCIENCES, VOL 1, 2019, 759 : 61 - 69
  • [36] Speech Emotion Recognition using Convolutional and Recurrent Neural Networks
    Lim, Wootaek
    Jang, Daeyoung
    Lee, Taejin
    2016 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2016,
  • [37] Spiking Deep Convolutional Neural Networks for Energy-Efficient Object Recognition
    Yongqiang Cao
    Yang Chen
    Deepak Khosla
    International Journal of Computer Vision, 2015, 113 : 54 - 66
  • [38] Spiking Deep Convolutional Neural Networks for Energy-Efficient Object Recognition
    Cao, Yongqiang
    Chen, Yang
    Khosla, Deepak
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 113 (01) : 54 - 66
  • [39] Radar Emitter Recognition Based on Spiking Neural Networks
    Luo, Zhenghao
    Wang, Xingdong
    Yuan, Shuo
    Liu, Zhangmeng
    REMOTE SENSING, 2024, 16 (14)
  • [40] Emotion Recognition System from Speech and Visual Information based on Convolutional Neural Networks
    Ristea, Nicolae-Catalin
    Dutu, Liviu Cristian
    Radoi, Anamaria
    2019 10TH INTERNATIONAL CONFERENCE ON SPEECH TECHNOLOGY AND HUMAN-COMPUTER DIALOGUE (SPED), 2019,