Speech Command Recognition Based on Convolutional Spiking Neural Networks

被引:5
|
作者
Sadovsky, Erik [1 ]
Jakubec, Maros [1 ]
Jarina, Roman [1 ]
机构
[1] Univ Zilina, Dept Multimedia & Informat Commun Technol, FEIT, Zilina, Slovakia
关键词
spiking neural network; spiking speech commands; command recognition; convolutional spiking neural network;
D O I
10.1109/RADIOELEKTRONIKA57919.2023.10109082
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This article presents a new technique for speech recognition that combines Convolutional Neural Networks (CNNs) with Spiking Neural Networks (SNNs) to create an SNN-CNN model. The model is tested on the Google Speech Command Dataset and achieves an accuracy of 72.03%, which is similar to the current state-of-the-art speech recognition methods. The study also compares the performance of the SNN-CNN model with other SNN models that use Multi-Layer Perceptrons (MLPs) and traditional Artificial Neural Networks (ANNs). The results show that the CNN-based SNNs outperform both MLPs and ANNs, demonstrating the superiority of the proposed model. The approach presented in this study can potentially be applied to other speech recognition tasks and could lead to further improvements in the field.
引用
收藏
页数:5
相关论文
共 50 条
  • [41] Speech Emotion Recognition based on Multi-Level Residual Convolutional Neural Networks
    Zheng, Kai
    Xia, ZhiGuang
    Zhang, Yi
    Xu, Xuan
    Fu, Yaqin
    ENGINEERING LETTERS, 2020, 28 (02) : 559 - 565
  • [42] Continuous Speech Recognition based on Convolutional Neural Network
    Zhang, Qing-qing
    Liu, Yong
    Pan, Jie-lin
    Yan, Yong-hong
    SEVENTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2015), 2015, 9631
  • [43] Deep Convolutional Spiking Neural Network Based Hand Gesture Recognition
    Ke, Weijie
    Xing, Yannan
    Di Caterina, Gaetano
    Petropoulakis, Lykourgos
    Soraghan, John
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [44] GAIT RECOGNITION BASED ON CONVOLUTIONAL NEURAL NETWORKS
    Sokolova, A.
    Konushin, A.
    INTERNATIONAL WORKSHOP PHOTOGRAMMETRIC AND COMPUTER VISION TECHNIQUES FOR VIDEO SURVEILLANCE, BIOMETRICS AND BIOMEDICINE, 2017, 42-2 (W4): : 207 - 212
  • [45] Enabling On-Device Learning with Deep Spiking Neural Networks for Speech Recognition
    Soures, N. M.
    Kudithipudi, D.
    Jacobs-Gedrim, R. B.
    Agarwal, S.
    Marinella, M.
    SILICON COMPATIBLE MATERIALS, PROCESSES, AND TECHNOLOGIES FOR ADVANCED INTEGRATED CIRCUITS AND EMERGING APPLICATIONS 8, 2018, 85 (06): : 127 - 137
  • [46] Speech Emotion Recognition using Convolution Neural Networks and Deep Stride Convolutional Neural Networks
    Wani, Taiba Majid
    Gunawan, Teddy Surya
    Qadri, Syed Asif Ahmad
    Mansor, Hasmah
    Kartiwi, Mira
    Ismail, Nanang
    PROCEEDING OF 2020 6TH INTERNATIONAL CONFERENCE ON WIRELESS AND TELEMATICS (ICWT), 2020,
  • [47] Very Deep Convolutional Neural Networks for Noise Robust Speech Recognition
    Qian, Yanmin
    Bi, Mengxiao
    Tan, Tian
    Yu, Kai
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (12) : 2263 - 2276
  • [48] Deep Convolutional Neural Networks for Feature Extraction in Speech Emotion Recognition
    Heracleous, Panikos
    Mohammad, Yasser
    Yoneyama, Akio
    HUMAN-COMPUTER INTERACTION. RECOGNITION AND INTERACTION TECHNOLOGIES, HCI 2019, PT II, 2019, 11567 : 117 - 132
  • [49] Speech Emotion Recognition Using Convolutional Neural Networks with Attention Mechanism
    Mountzouris, Konstantinos
    Perikos, Isidoros
    Hatzilygeroudis, Ioannis
    Corchado, Juan M.
    Iglesias, Carlos A.
    Kim, Byung-Gyu
    Mehmood, Rashid
    Ren, Fuji
    Lee, In
    ELECTRONICS, 2023, 12 (20)
  • [50] Convolutional Maxout Neural Networks for Low-Resource Speech Recognition
    Cai, Meng
    Shi, Yongzhe
    Kang, Jian
    Liu, Jia
    Su, Tengrong
    2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 133 - +