Trust-Region Adaptive Frequency for Online Continual Learning

被引:1
|
作者
Kong, Yajing [1 ]
Liu, Liu [1 ]
Qiao, Maoying [2 ]
Wang, Zhen [1 ]
Tao, Dacheng [1 ]
机构
[1] Univ Sydney, Fac Engn, Sch Comp Sci, Sydney, NSW 2008, Australia
[2] Univ Technol Sydney, Sydney, NSW, Australia
关键词
Online continual learning; Catastrophic forgetting; Trust-region; Deep learning;
D O I
10.1007/s11263-023-01775-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the paradigm of online continual learning, one neural network is exposed to a sequence of tasks, where the data arrive in an online fashion and previously seen data are not accessible. Such online fashion causes insufficient learning and severe forgetting on past tasks issues, preventing a good stability-plasticity trade-off, where ideally the network is expected to have high plasticity to adapt to new tasks well and have the stability to prevent forgetting on old tasks simultaneously. To solve these issues, we propose a trust-region adaptive frequency approach, which alternates between standard-process and intra-process updates. Specifically, the standard-process replays data stored in a coreset and interleaves the data with current data, and the intra-process updates the network parameters based on the coreset. Furthermore, to improve the unsatisfactory performance stemming from online fashion, the frequency of the intra-process is adjusted based on a trust region, which is measured by the confidence score of current data. During the intra-process, we distill the dark knowledge to retain useful learned knowledge. Moreover, to store more representative data in the coreset, a confidence-based coreset selection is presented in an online manner. The experimental results on standard benchmarks show that the proposed method significantly outperforms state-of-art continual learning algorithms.
引用
收藏
页码:1825 / 1839
页数:15
相关论文
共 50 条
  • [21] Adaptive Shortcut Debiasing for Online Continual Learning
    Kim, Doyoung
    Park, Dongmin
    Shin, Yooju
    Bang, Jihwan
    Song, Hwanjun
    Lee, Jae-Gil
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 12, 2024, : 13122 - 13131
  • [22] STOCHASTIC TRUST-REGION METHODS WITH TRUST-REGION RADIUS DEPENDING ON PROBABILISTIC MODELS
    Wang, Xiaoyu
    Yuan, Ya-xiang
    JOURNAL OF COMPUTATIONAL MATHEMATICS, 2022, 40 (02): : 295 - 336
  • [23] AN ADAPTIVE TRUST-REGION METHOD FOR GENERALIZED EIGENVALUES OF SYMMETRIC TENSORS
    Chen, Yuting
    Cao, Mingyuan
    Yang, Yueting
    Huang, Qingdao
    JOURNAL OF COMPUTATIONAL MATHEMATICS, 2021, 39 (03): : 358 - 374
  • [24] Adaptive Online Domain Incremental Continual Learning
    Gunasekara, Nuwan
    Gomes, Heitor
    Bifet, Albert
    Pfahringer, Bernhard
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT I, 2022, 13529 : 491 - 502
  • [25] A New Restarting Adaptive Trust-Region Method for Unconstrained Optimization
    Kimiaei M.
    Ghaderi S.
    Kimiaei, Morteza (morteza.kimiaei@gmail.com), 1600, Springer Science and Business Media Deutschland GmbH (05): : 487 - 507
  • [26] An efficient adaptive trust-region method for systems of nonlinear equations
    Esmaeili, Hamid
    Kimiaei, Morteza
    INTERNATIONAL JOURNAL OF COMPUTER MATHEMATICS, 2015, 92 (01) : 151 - 166
  • [27] Trust-region reflective adaptive controller for time varying systems
    Moubarak, Paul M.
    IET CONTROL THEORY AND APPLICATIONS, 2015, 9 (02): : 240 - 247
  • [28] Nonmonotone adaptive trust-region method for unconstrained optimization problems
    Fu, JH
    Sun, WY
    APPLIED MATHEMATICS AND COMPUTATION, 2005, 163 (01) : 489 - 504
  • [29] A new adaptive trust-region method for system of nonlinear equations
    Esmaeili, Hamid
    Kimiaei, Morteza
    APPLIED MATHEMATICAL MODELLING, 2014, 38 (11-12) : 3003 - 3015
  • [30] Adaptive online continual multi-view learning
    Yu, Yang
    Du, Zhekai
    Meng, Lichao
    Li, Jingjing
    Hu, Jiang
    INFORMATION FUSION, 2024, 103