Evolving Ensemble Model based on Hilbert Schmidt Independence Criterion for task-free continual learning

被引:0
|
作者
Ye, Fei [1 ]
Bors, Adrian G. [2 ]
机构
[1] Univ Elect Sci & Technol China, Sch Informat & Software Engn, Chengdu, Peoples R China
[2] Univ York, Dept Comp Sci, York YO10 5GH, England
关键词
Lifelong learning; Variational Autoencoders (VAE); Hilbert Schmidt Independence Criterion; Representation learning;
D O I
10.1016/j.neucom.2025.129370
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Continual Learning (CL) aims to extend the abilities of deep learning models for continuously acquiring new knowledge without forgetting. However, most CL studies assume that task identities and boundaries are known, which is not a realistic assumption in areal scenario. In this work, we address amore challenging and realistic situation in CL, namely the Task-Free Continuous Learning (TFCL), where an ensemble of experts is trained on non-stationary data streams without having any task labels. To deal with TFCL, we introduce the Evolving Ensemble Model (EEM), which can dynamically build new experts into a mixture, thus adapting to the changing data distributions while continuously learning new data sets. To ensure a compact network architecture for EEM during training, we propose a novel expansion mechanism that considers the Hilbert- Schmidt Independence Criterion (HSIC) for evaluating the statistical consistency between the knowledge learned by each expert and that corresponding to the given data. This expansion mechanism does not require storing all previous samples and is more efficient as it performs statistical evaluations in a low-dimensional feature space inferred by a deep network. We also propose anew dropout mechanism for selectively removing unimportant stored samples from the memory buffer used for storing the continuously incoming data before they are used for training. The proposed dropout mechanism ensures the diversity of information being learnt by the experts of our model. We perform extensive TFCL tests which show that the proposed approach achieves the state of the art. The source code is available at https://github.com/dtuzi123/HSCI-DEM.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] A Feature-Weighted Support Vector Regression Machine Based on Hilbert-Schmidt Independence Criterion Least Absolute Shrinkage and Selection Operator
    Zhang, Xin
    Wang, Tinghua
    Lai, Zhiyong
    INFORMATION, 2024, 15 (10)
  • [42] The meta-learning method for the ensemble model based on situational meta-task
    Zhang, Zhengchao
    Zhou, Lianke
    Wu, Yuyang
    Wang, Nianbin
    FRONTIERS IN NEUROROBOTICS, 2024, 18
  • [43] Ensemble Compressed Language Model Based on Knowledge Distillation and Multi-Task Learning
    Xiang, Kun
    Fujii, Akihiro
    2022 7TH INTERNATIONAL CONFERENCE ON BUSINESS AND INDUSTRIAL RESEARCH (ICBIR2022), 2022, : 72 - 77
  • [44] TFTL: A Task-Free Transfer Learning Strategy for EEG-Based Cross-Subject and Cross-Dataset Motor Imagery BCI
    Wang, Yihan
    Wang, Jiaxing
    Wang, Weiqun
    Su, Jianqiang
    Bunterngchit, Chayut
    Hou, Zeng-Guang
    IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2025, 72 (02) : 810 - 821
  • [45] Extraversion differentiates between model-based and model-free strategies in a reinforcement learning task
    Skatova, Anya
    Chan, Patricia A.
    Daw, Nathaniel D.
    FRONTIERS IN HUMAN NEUROSCIENCE, 2013, 7
  • [46] A Deep Learning Based Multi-task Ensemble Model for Intent Detection and Slot Filling in Spoken Language Understanding
    Firdaus, Mauajama
    Bhatnagar, Shobhit
    Ekbal, Asif
    Bhattacharyya, Pushpak
    NEURAL INFORMATION PROCESSING (ICONIP 2018), PT IV, 2018, 11304 : 647 - 658
  • [47] Task complexity interacts with state-space uncertainty in the arbitration between model-based and model-free learning
    Dongjae Kim
    Geon Yeong Park
    John P. O′Doherty
    Sang Wan Lee
    Nature Communications, 10
  • [48] Task complexity interacts with state-space uncertainty in the arbitration between model-based and model-free learning
    Kim, Dongjae
    Park, Geon Yeong
    O'Doherty, John P.
    Lee, Sang Wan
    NATURE COMMUNICATIONS, 2019, 10 (1)
  • [49] A multi-energy load prediction model based on deep multi-task learning and ensemble approach for regional integrated energy systems
    Wang Xuan
    Wang Shouxiang
    Zhao Qianyu
    Wang Shaomin
    Fu Liwei
    INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS, 2021, 126
  • [50] Proselfs depend more on model-based than model-free learning in a non-social probabilistic state-transition task
    Oguchi, Mineki
    Li, Yang
    Matsumoto, Yoshie
    Kiyonari, Toko
    Yamamoto, Kazuhiko
    Sugiura, Shigeki
    Sakagami, Masamichi
    SCIENTIFIC REPORTS, 2023, 13 (01)