LOW-ACTIVITY SUPERVISED CONVOLUTIONAL SPIKING NEURAL NETWORKS APPLIED TO SPEECH COMMANDS RECOGNITION

被引:25
|
作者
Pellegrini, Thomas [1 ]
Zimmer, Romain [1 ,2 ]
Masquelier, Timothee [2 ]
机构
[1] Univ Toulouse, IRIT, Toulouse, France
[2] Univ Toulouse 3, CNRS, CERCO UMR 5549, Toulouse, France
关键词
Spiking neural networks; surrogate gradient; speech command recognition;
D O I
10.1109/SLT48900.2021.9383587
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep Neural Networks (DNNs) are the current state-of-the-art models in many speech related tasks. There is a growing interest, though, for more biologically realistic, hardware friendly and energy efficient models, named Spiking Neural Networks (SNNs). Recently, it has been shown that SNNs can be trained efficiently, in a supervised manner, using backpropagation with a surrogate gradient trick. In this work, we report speech command (SC) recognition experiments using supervised SNNs. We explored the Leaky-Integrate-Fire (LIF) neuron model for this task, and show that a model comprised of stacked dilated convolution spiking layers can reach an error rate very close to standard DNNs on the Google SC v1 dataset: 5.5%, while keeping a very sparse spiking activity, below 5%, thank to a new regularization term. We also show that modeling the leakage of the neuron membrane potential is useful, since the LIF model outperformed its non-leaky model counterpart significantly.
引用
收藏
页码:97 / 103
页数:7
相关论文
共 50 条
  • [21] Recognizing Speech Commands Using Convolutional Neural Network
    Kubanek, Mariusz
    Bobulski, Janusz
    Kulawik, Joanna
    INTERNATIONAL CONFERENCE ON NUMERICAL ANALYSIS AND APPLIED MATHEMATICS ICNAAM 2019, 2020, 2293
  • [22] Supervised learning with spiking neural networks
    Xin, JG
    Embrechts, MJ
    IJCNN'01: INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, PROCEEDINGS, 2001, : 1772 - 1777
  • [23] Human Activity Recognition with Convolutional Neural Networks
    Bevilacqua, Antonio
    MacDonald, Kyle
    Rangarej, Aamina
    Widjaya, Venessa
    Caulfield, Brian
    Kechadi, Tahar
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2018, PT III, 2019, 11053 : 541 - 552
  • [24] Hierarchical structure neural networks applied to speech recognition
    Zhou, Liqing
    Zhao, Chang
    Liu, Zemin
    Beijing Youdian Xueyuan Xuebao/Journal of Beijing University of Posts And Telecommunications, 1995, 18 (04): : 57 - 61
  • [25] Deep Spiking Neural Networks for Large Vocabulary Automatic Speech Recognition
    Wu, Jibin
    Yilmaz, Emre
    Zhang, Malu
    Li, Haizhou
    Tan, Kay Chen
    FRONTIERS IN NEUROSCIENCE, 2020, 14
  • [26] Exploration of rank order coding with spiking neural networks for speech recognition
    Loiselle, S
    Rouat, J
    Pressnitzer, D
    Thorpe, S
    PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), VOLS 1-5, 2005, : 2076 - 2080
  • [27] A Biologically Plausible Speech Recognition Framework Based on Spiking Neural Networks
    Wu, Jibin
    Chua, Yansong
    Li, Haizhou
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [28] Assessment of Semi-supervised Approaches Applied to Convolutional Neural Networks
    Bassani, Cristiano N. de O.
    Saito, Prisicla T. M.
    Bugatti, Pedro H.
    ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, ICAISC 2022, PT II, 2023, 13589 : 195 - 205
  • [29] Applied Spiking Neural Networks for Radar-based Gesture Recognition
    Kreutz, Felix
    Gerhards, Pascal
    Vogginger, Bernhard
    Knobloch, Klaus
    Mayr, Christian Georg
    2021 7TH INTERNATIONAL CONFERENCE ON EVENT BASED CONTROL, COMMUNICATION, AND SIGNAL PROCESSING (EBCCSP), 2021,
  • [30] Supervised Learning with Small Training Set for Gesture Recognition by Spiking Neural Networks
    Gyongyossy, Natabara Mate
    Domonkos, Mark
    Botzheim, Janos
    Korondi, Peter
    2019 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2019), 2019, : 2201 - 2206