LOW-ACTIVITY SUPERVISED CONVOLUTIONAL SPIKING NEURAL NETWORKS APPLIED TO SPEECH COMMANDS RECOGNITION

被引:25
|
作者
Pellegrini, Thomas [1 ]
Zimmer, Romain [1 ,2 ]
Masquelier, Timothee [2 ]
机构
[1] Univ Toulouse, IRIT, Toulouse, France
[2] Univ Toulouse 3, CNRS, CERCO UMR 5549, Toulouse, France
关键词
Spiking neural networks; surrogate gradient; speech command recognition;
D O I
10.1109/SLT48900.2021.9383587
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep Neural Networks (DNNs) are the current state-of-the-art models in many speech related tasks. There is a growing interest, though, for more biologically realistic, hardware friendly and energy efficient models, named Spiking Neural Networks (SNNs). Recently, it has been shown that SNNs can be trained efficiently, in a supervised manner, using backpropagation with a surrogate gradient trick. In this work, we report speech command (SC) recognition experiments using supervised SNNs. We explored the Leaky-Integrate-Fire (LIF) neuron model for this task, and show that a model comprised of stacked dilated convolution spiking layers can reach an error rate very close to standard DNNs on the Google SC v1 dataset: 5.5%, while keeping a very sparse spiking activity, below 5%, thank to a new regularization term. We also show that modeling the leakage of the neuron membrane potential is useful, since the LIF model outperformed its non-leaky model counterpart significantly.
引用
收藏
页码:97 / 103
页数:7
相关论文
共 50 条
  • [1] Bangla Short Speech Commands Recognition Using Convolutional Neural Networks
    Sumon, Shakil Ahmed
    Chowdhury, Joydip
    Debnath, Sujit
    Mohammed, Nabeel
    Momen, Sifat
    2018 INTERNATIONAL CONFERENCE ON BANGLA SPEECH AND LANGUAGE PROCESSING (ICBSLP), 2018,
  • [2] Speech Command Recognition Based on Convolutional Spiking Neural Networks
    Sadovsky, Erik
    Jakubec, Maros
    Jarina, Roman
    2023 33RD INTERNATIONAL CONFERENCE RADIOELEKTRONIKA, RADIOELEKTRONIKA, 2023,
  • [3] Neuromorphic Speech Recognition With Photonic Convolutional Spiking Neural Networks
    Xiang, Shuiying
    Zhang, Tianrui
    Han, Yanan
    Guo, Xingxing
    Zhang, Yahui
    Shi, Yuechun
    Hao, Yue
    IEEE JOURNAL OF SELECTED TOPICS IN QUANTUM ELECTRONICS, 2023, 29 (06)
  • [4] Convolutional Neural Networks for Speech Recognition
    Abdel-Hamid, Ossama
    Mohamed, Abdel-Rahman
    Jiang, Hui
    Deng, Li
    Penn, Gerald
    Yu, Dong
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (10) : 1533 - 1545
  • [5] Semi-Supervised Convolutional Neural Networks for Human Activity Recognition\
    Zeng, Ming
    Yu, Tong
    Wang, Xiao
    Nguyen, Le T.
    Mengshoel, Ole J.
    Lane, Ian
    2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 522 - 529
  • [7] Convolutional Maxout Neural Networks for Low-Resource Speech Recognition
    Cai, Meng
    Shi, Yongzhe
    Kang, Jian
    Liu, Jia
    Su, Tengrong
    2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 133 - +
  • [8] Speech emotion recognition using spiking neural networks
    Buscicchio, Cosimo A.
    Gorecki, Przemyslaw
    Caponetti, Laura
    FOUNDATIONS OF INTELLIGENT SYSTEMS, PROCEEDINGS, 2006, 4203 : 38 - 46
  • [9] Speech emotion recognition based on spiking neural network and convolutional neural network
    Du, Chengyan
    Liu, Fu
    Kang, Bing
    Hou, Tao
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 147
  • [10] Gated Convolutional LSTM for Speech Commands Recognition
    Wang, Dong
    Lv, Shaohe
    Wang, Xiaodong
    Lin, Xinye
    COMPUTATIONAL SCIENCE - ICCS 2018, PT II, 2018, 10861 : 669 - 681