Evolving Ensemble Model based on Hilbert Schmidt Independence Criterion for task-free continual learning

被引：0

作者：

Ye, Fei ^{[1
]}

Bors, Adrian G. ^{[2
]}

机构：

[1] Univ Elect Sci & Technol China, Sch Informat & Software Engn, Chengdu, Peoples R China

[2] Univ York, Dept Comp Sci, York YO10 5GH, England

来源：

NEUROCOMPUTING | 2025年 / 624卷

关键词：

Lifelong learning; Variational Autoencoders (VAE); Hilbert Schmidt Independence Criterion; Representation learning;

D O I：

10.1016/j.neucom.2025.129370

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Continual Learning (CL) aims to extend the abilities of deep learning models for continuously acquiring new knowledge without forgetting. However, most CL studies assume that task identities and boundaries are known, which is not a realistic assumption in areal scenario. In this work, we address amore challenging and realistic situation in CL, namely the Task-Free Continuous Learning (TFCL), where an ensemble of experts is trained on non-stationary data streams without having any task labels. To deal with TFCL, we introduce the Evolving Ensemble Model (EEM), which can dynamically build new experts into a mixture, thus adapting to the changing data distributions while continuously learning new data sets. To ensure a compact network architecture for EEM during training, we propose a novel expansion mechanism that considers the Hilbert- Schmidt Independence Criterion (HSIC) for evaluating the statistical consistency between the knowledge learned by each expert and that corresponding to the given data. This expansion mechanism does not require storing all previous samples and is more efficient as it performs statistical evaluations in a low-dimensional feature space inferred by a deep network. We also propose anew dropout mechanism for selectively removing unimportant stored samples from the memory buffer used for storing the continuously incoming data before they are used for training. The proposed dropout mechanism ensures the diversity of information being learnt by the experts of our model. We perform extensive TFCL tests which show that the proposed approach achieves the state of the art. The source code is available at https://github.com/dtuzi123/HSCI-DEM.

引用

页数：12

共 50 条

[41] A Feature-Weighted Support Vector Regression Machine Based on Hilbert-Schmidt Independence Criterion Least Absolute Shrinkage and Selection Operator
Zhang, Xin
Wang, Tinghua
Lai, Zhiyong
INFORMATION, 2024, 15 (10)
[42] The meta-learning method for the ensemble model based on situational meta-task
Zhang, Zhengchao
Zhou, Lianke
Wu, Yuyang
Wang, Nianbin
FRONTIERS IN NEUROROBOTICS, 2024, 18
[43] Ensemble Compressed Language Model Based on Knowledge Distillation and Multi-Task Learning
Xiang, Kun
Fujii, Akihiro
2022 7TH INTERNATIONAL CONFERENCE ON BUSINESS AND INDUSTRIAL RESEARCH (ICBIR2022), 2022, : 72 - 77
[44] TFTL: A Task-Free Transfer Learning Strategy for EEG-Based Cross-Subject and Cross-Dataset Motor Imagery BCI
Wang, Yihan
Wang, Jiaxing
Wang, Weiqun
Su, Jianqiang
Bunterngchit, Chayut
Hou, Zeng-Guang
IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2025, 72 (02) : 810 - 821
[45] Extraversion differentiates between model-based and model-free strategies in a reinforcement learning task
Skatova, Anya
Chan, Patricia A.
Daw, Nathaniel D.
FRONTIERS IN HUMAN NEUROSCIENCE, 2013, 7
[46] A Deep Learning Based Multi-task Ensemble Model for Intent Detection and Slot Filling in Spoken Language Understanding
Firdaus, Mauajama
Bhatnagar, Shobhit
Ekbal, Asif
Bhattacharyya, Pushpak
NEURAL INFORMATION PROCESSING (ICONIP 2018), PT IV, 2018, 11304 : 647 - 658
[47] Task complexity interacts with state-space uncertainty in the arbitration between model-based and model-free learning
Dongjae Kim
Geon Yeong Park
John P. O′Doherty
Sang Wan Lee
Nature Communications, 10
[48] Task complexity interacts with state-space uncertainty in the arbitration between model-based and model-free learning
Kim, Dongjae
Park, Geon Yeong
O'Doherty, John P.
Lee, Sang Wan
NATURE COMMUNICATIONS, 2019, 10 (1)
[49] A multi-energy load prediction model based on deep multi-task learning and ensemble approach for regional integrated energy systems
Wang Xuan
Wang Shouxiang
Zhao Qianyu
Wang Shaomin
Fu Liwei
INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS, 2021, 126
[50] Proselfs depend more on model-based than model-free learning in a non-social probabilistic state-transition task
Oguchi, Mineki
Li, Yang
Matsumoto, Yoshie
Kiyonari, Toko
Yamamoto, Kazuhiko
Sugiura, Shigeki
Sakagami, Masamichi
SCIENTIFIC REPORTS, 2023, 13 (01)

← 1 2 3 4 5 →