Speech Command Recognition Based on Convolutional Spiking Neural Networks

被引：5

作者：

Sadovsky, Erik ^{[1
]}

Jakubec, Maros ^{[1
]}

Jarina, Roman ^{[1
]}

机构：

[1] Univ Zilina, Dept Multimedia & Informat Commun Technol, FEIT, Zilina, Slovakia

来源：

2023 33RD INTERNATIONAL CONFERENCE RADIOELEKTRONIKA, RADIOELEKTRONIKA | 2023年

关键词：

spiking neural network; spiking speech commands; command recognition; convolutional spiking neural network;

D O I：

10.1109/RADIOELEKTRONIKA57919.2023.10109082

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

This article presents a new technique for speech recognition that combines Convolutional Neural Networks (CNNs) with Spiking Neural Networks (SNNs) to create an SNN-CNN model. The model is tested on the Google Speech Command Dataset and achieves an accuracy of 72.03%, which is similar to the current state-of-the-art speech recognition methods. The study also compares the performance of the SNN-CNN model with other SNN models that use Multi-Layer Perceptrons (MLPs) and traditional Artificial Neural Networks (ANNs). The results show that the CNN-based SNNs outperform both MLPs and ANNs, demonstrating the superiority of the proposed model. The approach presented in this study can potentially be applied to other speech recognition tasks and could lead to further improvements in the field.

引用

页数：5

共 50 条

[41] Speech Emotion Recognition based on Multi-Level Residual Convolutional Neural Networks
Zheng, Kai
Xia, ZhiGuang
Zhang, Yi
Xu, Xuan
Fu, Yaqin
ENGINEERING LETTERS, 2020, 28 (02) : 559 - 565
[42] Continuous Speech Recognition based on Convolutional Neural Network
Zhang, Qing-qing
Liu, Yong
Pan, Jie-lin
Yan, Yong-hong
SEVENTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2015), 2015, 9631
[43] Deep Convolutional Spiking Neural Network Based Hand Gesture Recognition
Ke, Weijie
Xing, Yannan
Di Caterina, Gaetano
Petropoulakis, Lykourgos
Soraghan, John
2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
[44] GAIT RECOGNITION BASED ON CONVOLUTIONAL NEURAL NETWORKS
Sokolova, A.
Konushin, A.
INTERNATIONAL WORKSHOP PHOTOGRAMMETRIC AND COMPUTER VISION TECHNIQUES FOR VIDEO SURVEILLANCE, BIOMETRICS AND BIOMEDICINE, 2017, 42-2 (W4): : 207 - 212
[45] Enabling On-Device Learning with Deep Spiking Neural Networks for Speech Recognition
Soures, N. M.
Kudithipudi, D.
Jacobs-Gedrim, R. B.
Agarwal, S.
Marinella, M.
SILICON COMPATIBLE MATERIALS, PROCESSES, AND TECHNOLOGIES FOR ADVANCED INTEGRATED CIRCUITS AND EMERGING APPLICATIONS 8, 2018, 85 (06): : 127 - 137
[46] Speech Emotion Recognition using Convolution Neural Networks and Deep Stride Convolutional Neural Networks
Wani, Taiba Majid
Gunawan, Teddy Surya
Qadri, Syed Asif Ahmad
Mansor, Hasmah
Kartiwi, Mira
Ismail, Nanang
PROCEEDING OF 2020 6TH INTERNATIONAL CONFERENCE ON WIRELESS AND TELEMATICS (ICWT), 2020,
[47] Very Deep Convolutional Neural Networks for Noise Robust Speech Recognition
Qian, Yanmin
Bi, Mengxiao
Tan, Tian
Yu, Kai
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (12) : 2263 - 2276
[48] Deep Convolutional Neural Networks for Feature Extraction in Speech Emotion Recognition
Heracleous, Panikos
Mohammad, Yasser
Yoneyama, Akio
HUMAN-COMPUTER INTERACTION. RECOGNITION AND INTERACTION TECHNOLOGIES, HCI 2019, PT II, 2019, 11567 : 117 - 132
[49] Speech Emotion Recognition Using Convolutional Neural Networks with Attention Mechanism
Mountzouris, Konstantinos
Perikos, Isidoros
Hatzilygeroudis, Ioannis
Corchado, Juan M.
Iglesias, Carlos A.
Kim, Byung-Gyu
Mehmood, Rashid
Ren, Fuji
Lee, In
ELECTRONICS, 2023, 12 (20)
[50] Convolutional Maxout Neural Networks for Low-Resource Speech Recognition
Cai, Meng
Shi, Yongzhe
Kang, Jian
Liu, Jia
Su, Tengrong
2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 133 - +

← 1 2 3 4 5 →