A Convolutional Heterogeneous Spiking Neural Network for Real-time Music Classification

被引:0
|
作者
Liu, Yuguo [1 ]
Chen, Wenyu [1 ]
Qu, Hong [1 ]
机构
[1] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Chengdu, Peoples R China
基金
美国国家科学基金会;
关键词
Hybrid Neural Networks; Spiking Neural Networks; Real-time Processing; Music Classification;
D O I
10.1109/RAIIC61787.2024.10670762
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
While artificial Neural Networks(ANNs) have gradually become the dominating frameworks for music classification, single-structured ANNs have certain functional and computational limitations to meet real-time and low-latency requirement in this era of the Internet of Things. With Convolutional Neural Network(CNN) module to address spatial correlations and Recurrent Neural Networks(RNN) module to capture temporal dependencies, hybrid CNN-RNN architectures have attracted academic focus. However, the inference promptness and energy efficiency of these hybrid models are still confined by their computationally costly RNN modules. With sparse and binary activated data flows, the brain-inspired Spiking Neural Networks(SNNs) enable RNN-like sequential processing in a computationally efficient and economic way. Taking respective advantages of dynamically different spiking neurons, we proposed the Convolutional Heterogeneous Spiking Neural Network(CHSNN). With deliberate choices of input spectrograms and training strategy, the proposed CHSNN have outperformed most existing CNN-RNN classification models, demonstrated satisfactory real-time processing ability even with only 20.8% inference time steps, and shown the substitutability of SNN against RNN in terms of both classification performance and inference latency.
引用
收藏
页码:331 / 336
页数:6
相关论文
共 50 条
  • [31] Research on Real-time Video Action Classification Based on Three-Dimensional Convolutional Neural Network
    Yu, Mingli
    Zhang, Chuang
    Wu, Ming
    PROCEEDINGS OF 2018 INTERNATIONAL CONFERENCE ON NETWORK INFRASTRUCTURE AND DIGITAL CONTENT (IEEE IC-NIDC), 2018, : 339 - 343
  • [32] A real-time application-based convolutional neural network approach for tomato leaf disease classification
    Paul, Showmick Guha
    Biswas, Al Amin
    Saha, Arpa
    Zulfiker, Md. Sabab
    Ritu, Nadia Afrin
    Zahan, Ifrat
    Rahman, Mushfiqur
    Islam, Mohammad Ashraful
    ARRAY, 2023, 19
  • [33] NESIM-RT: A real-time distributed spiking neural network simulator
    Rosa-Gallardo, Daniel J.
    de la Torre, Juan Carlos
    Quintana, Fernando M.
    Dominguez-Morales, Juan P.
    Perez-Pena, Fernando
    SOFTWAREX, 2023, 22
  • [34] Real-time radionuclide identification in γ-emitter mixtures based on spiking neural network
    Bobin, C.
    Bichler, O.
    Lourenco, V.
    Thiam, C.
    Thevenin, M.
    APPLIED RADIATION AND ISOTOPES, 2016, 109 : 405 - 409
  • [35] Time-Multiplexed Spiking Convolutional Neural Network Based on VCSELs for Unsupervised Image Classification
    Skontranis, Menelaos
    Sarantoglou, George
    Deligiannidis, Stavros
    Bogris, Adonis
    Mesaritakis, Charis
    APPLIED SCIENCES-BASEL, 2021, 11 (04): : 1 - 18
  • [36] Real-Time Speech Enhancement Based on Convolutional Recurrent Neural Network
    Girirajan, S.
    Pandian, A.
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2023, 35 (02): : 1987 - 2001
  • [37] Robust and Real-Time Visual Tracking with Triplet Convolutional Neural Network
    Kim, Jung Uk
    Kim, Hak Gu
    Ro, Yong Man
    PROCEEDINGS OF THE THEMATIC WORKSHOPS OF ACM MULTIMEDIA 2017 (THEMATIC WORKSHOPS'17), 2017, : 280 - 286
  • [38] A real-time and accurate convolutional neural network for fabric defect detection
    Xueshen Li
    Yong Zhu
    Complex & Intelligent Systems, 2024, 10 : 3371 - 3387
  • [39] Real-Time Fabric Defect Segmentation Based on Convolutional Neural Network
    Zhen Wang
    Jing Junfeng
    Zhang, Huanhuan
    Yan Zhao
    AATCC JOURNAL OF RESEARCH, 2021, 8 (1_SUPPL): : 92 - 97
  • [40] FDDWNET: A LIGHTWEIGHT CONVOLUTIONAL NEURAL NETWORK FOR REAL-TIME SEMANTIC SEGMENTATION
    Liu, Jia
    Zhou, Quan
    Qiang, Yong
    Kang, Bin
    Wu, Xiaofu
    Zheng, Baoyu
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 2373 - 2377