Few-Shot Learning in Spiking Neural Networks by Multi-Timescale Optimization

被引:11
|
作者
Jiang, Runhao [1 ]
Zhang, Jie [1 ]
Yan, Rui [2 ]
Tang, Huajin [3 ,4 ]
机构
[1] Sichuan Univ, Coll Comp Sci, Chengdu 610065, Peoples R China
[2] Zhejiang Univ Technol, Coll Comp Sci, Hangzhou 310014, Peoples R China
[3] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou 310027, Peoples R China
[4] Zhejiang Lab, Hangzhou 311121, Peoples R China
基金
中国国家自然科学基金;
关键词
ALGORITHM; NEURONS;
D O I
10.1162/neco_a_01423
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning new concepts rapidly from a few examples is an open issue in spike-based machine learning. This few-shot learning imposes substantial challenges to the current learning methodologies of spiking neuron networks (SNNs) due to the lack of task-related priori knowledge. The recent learning-to-learn (L2L) approach allows SNNs to acquire priori knowledge through example-level learning and task-level optimization. However, existing L2L-based frameworks do not target the neural dynamics (i.e., neuronal and synaptic parameter changes) on different timescales. This diversity of temporal dynamics is an important attribute in spike-based learning, which facilitates the networks to rapidly acquire knowledge from very few examples and gradually integrate this knowledge. In this work, we consider the neural dynamics on various timescales and provide a multi-timescale optimization (MTSO) framework for SNNs. This framework introduces an adaptive-gated LSTM to accommodate two different timescales of neural dynamics: short-term learning and long-term evolution. Short-term learning is a fast knowledge acquisition process achieved by a novel surrogate gradient online learning (SGOL) algorithm, where the LSTM guides gradient updating of SNN on a short timescale through an adaptive learning rate and weight decay gating. The long-term evolution aims to slowly integrate acquired knowledge and form a priori, which can be achieved by optimizing the LSTM guidance process to tune SNN parameters on a long timescale. Experimental results demonstrate that the collaborative optimization of multi-timescale neural dynamics can make SNNs achieve promising performance for the few-shot learning tasks.
引用
收藏
页码:2439 / 2472
页数:34
相关论文
共 50 条
  • [41] Critical direction projection networks for few-shot learning
    Sheng Bi
    Yongxing Wang
    Xiaoxiao Li
    Min Dong
    Jinhui Zhu
    Applied Intelligence, 2022, 52 : 5400 - 5413
  • [42] Incremental Few-Shot Learning with Attention Attractor Networks
    Ren, Mengye
    Liao, Renjie
    Fetaya, Ethan
    Zemel, Richard S.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [43] Attention Based Siamese Networks for Few-Shot Learning
    Wang, Junhua
    Zhu, Zijiang
    Li, Jianjun
    Li, Junshan
    PROCEEDINGS OF 2018 IEEE 9TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS), 2018, : 551 - 554
  • [44] Critical direction projection networks for few-shot learning
    Bi, Sheng
    Wang, Yongxing
    Li, Xiaoxiao
    Dong, Min
    Zhu, Jinhui
    APPLIED INTELLIGENCE, 2022, 52 (05) : 5400 - 5413
  • [45] Feature Transformation and Metric Networks for Few-shot Learning
    Wang, Duo-Rui
    Du, Yang
    Dong, Lan-Fang
    Hu, Wei-Ming
    Li, Bing
    Zidonghua Xuebao/Acta Automatica Sinica, 2024, 50 (07): : 1305 - 1314
  • [46] Federated Learning and Optimization for Few-Shot Image Classification
    Zuo, Yi
    Chen, Zhenping
    Feng, Jing
    Fan, Yunhao
    CMC-COMPUTERS MATERIALS & CONTINUA, 2025, 82 (03): : 4649 - 4667
  • [47] Multi-Scale Metric Learning for Few-Shot Learning
    Jiang, Wen
    Huang, Kai
    Geng, Jie
    Deng, Xinyang
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (03) : 1091 - 1102
  • [48] Episode Adaptive Embedding Networks for Few-Shot Learning
    Liu, Fangbing
    Wang, Qing
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2021, PT III, 2021, 12714 : 3 - 15
  • [49] Few-Shot Learning with Siamese Networks and Label Tuning
    Mueller, Thomas
    Perez-Torro, Guillermo
    Franco-Salvador, Marc
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 8532 - 8545
  • [50] Multi-Branch Network for Few-shot Learning
    Ren, Kai
    Guo, Zijie
    Zhang, Zhimin
    Zhu, Rui
    Li, Xiaoxu
    PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 520 - 525