Few-Shot Class-Incremental Learning via Class-Aware Bilateral Distillation

被引:30
|
作者
Zhao, Linglan [1 ]
Lu, Jing [2 ]
Xu, Yunlu [2 ]
Cheng, Zhanzhan [2 ]
Guo, Dashan [1 ]
Niu, Yi [2 ]
Fang, Xiangzhong [1 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Elect Engn, Shanghai, Peoples R China
[2] Hikvis Res Inst, Hangzhou, Zhejiang, Peoples R China
关键词
D O I
10.1109/CVPR52729.2023.01139
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Few-Shot Class-Incremental Learning (FSCIL) aims to continually learn novel classes based on only few training samples, which poses a more challenging task than the well-studied Class-Incremental Learning (CIL) due to data scarcity. While knowledge distillation, a prevailing technique in CIL, can alleviate the catastrophic forgetting of older classes by regularizing outputs between current and previous model, it fails to consider the overfitting risk of novel classes in FSCIL. To adapt the powerful distillation technique for FSCIL, we propose a novel distillation structure, by taking the unique challenge of overfitting into account. Concretely, we draw knowledge from two complementary teachers. One is the model trained on abundant data from base classes that carries rich general knowledge, which can be leveraged for easing the overfitting of current novel classes. The other is the updated model from last incremental session that contains the adapted knowledge of previous novel classes, which is used for alleviating their forgetting. To combine the guidances, an adaptive strategy conditioned on the class-wise semantic similarities is introduced. Besides, for better preserving base class knowledge when accommodating novel concepts, we adopt a two-branch network with an attention-based aggregation module to dynamically merge predictions from two complementary branches. Extensive experiments on 3 popular FSCIL datasets: mini-ImageNet, CIFAR100 and CUB200 validate the effectiveness of our method by surpassing existing works by a significant margin. Code is available at https://github.com/LinglanZhao/BiDistFSCIL.
引用
收藏
页码:11838 / 11847
页数:10
相关论文
共 50 条
  • [31] Few-Shot Class-Incremental Learning Based on Feature Distribution Learning
    Yao, Guangle
    Zhu, Juntao
    Zhou, Wenlong
    Zhang, Guiyu
    Zhang, Wei
    Zhang, Qian
    Computer Engineering and Applications, 2023, 59 (14) : 151 - 157
  • [32] Few-shot class incremental learning via prompt transfer and knowledge distillation
    Akmel, Feidu
    Meng, Fanman
    Liu, Mingyu
    Zhang, Runtong
    Teka, Asebe
    Lemuye, Elias
    IMAGE AND VISION COMPUTING, 2024, 151
  • [33] Few-Shot Class-Incremental SAR Target Recognition via Cosine Prototype Learning
    Zhao, Yan
    Zhao, Lingjun
    Ding, Ding
    Hu, Dewen
    Kuang, Gangyao
    Liu, Li
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [34] Rethinking Self-Supervision for Few-Shot Class-Incremental Learning
    Zhao, Linglan
    Lu, Jing
    Cheng, Zhanzhan
    Liu, Duo
    Fang, Xiangzhong
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 726 - 731
  • [35] Knowledge Representation by Generic Models for Few-Shot Class-Incremental Learning
    Chen, Xiaodong
    Jiang, Weijie
    Huang, Zhiyong
    Su, Jiangwen
    Yu, Yuanlong
    ADVANCES IN NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, ICNC-FSKD 2022, 2023, 153 : 1237 - 1247
  • [36] A Few-Shot Class-Incremental Learning Method for Network Intrusion Detection
    Du, Lei
    Gu, Zhaoquan
    Wang, Ye
    Wang, Le
    Jia, Yan
    IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2024, 21 (02): : 2389 - 2401
  • [37] Few-Shot Class-Incremental Learning for Classification and Object Detection: A Survey
    Zhang, Jinghua
    Liu, Li
    Silven, Olli
    Pietikainen, Matti
    Hu, Dewen
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2025, 47 (04) : 2924 - 2945
  • [38] Few-Shot Class-Incremental Learning via Training-Free Prototype Calibration
    Wang, Qi-Wei
    Zhou, Da-Wei
    Zhang, Yi-Kai
    Zhan, De-Chuan
    Ye, Han-Jia
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [39] Few-Shot Class-Incremental Learning for Network Intrusion Detection Systems
    Di Monda, Davide
    Montieri, Antonio
    Persico, Valerio
    Voria, Pasquale
    De Ieso, Matteo
    Pescape, Antonio
    IEEE OPEN JOURNAL OF THE COMMUNICATIONS SOCIETY, 2024, 5 : 6736 - 6757
  • [40] Improved Continually Evolved Classifiers for Few-Shot Class-Incremental Learning
    Wang, Ye
    Zhao, Guoshuai
    Qian, Xueming
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (02) : 1123 - 1134