Trust-Region Adaptive Frequency for Online Continual Learning

被引:1
|
作者
Kong, Yajing [1 ]
Liu, Liu [1 ]
Qiao, Maoying [2 ]
Wang, Zhen [1 ]
Tao, Dacheng [1 ]
机构
[1] Univ Sydney, Fac Engn, Sch Comp Sci, Sydney, NSW 2008, Australia
[2] Univ Technol Sydney, Sydney, NSW, Australia
关键词
Online continual learning; Catastrophic forgetting; Trust-region; Deep learning;
D O I
10.1007/s11263-023-01775-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the paradigm of online continual learning, one neural network is exposed to a sequence of tasks, where the data arrive in an online fashion and previously seen data are not accessible. Such online fashion causes insufficient learning and severe forgetting on past tasks issues, preventing a good stability-plasticity trade-off, where ideally the network is expected to have high plasticity to adapt to new tasks well and have the stability to prevent forgetting on old tasks simultaneously. To solve these issues, we propose a trust-region adaptive frequency approach, which alternates between standard-process and intra-process updates. Specifically, the standard-process replays data stored in a coreset and interleaves the data with current data, and the intra-process updates the network parameters based on the coreset. Furthermore, to improve the unsatisfactory performance stemming from online fashion, the frequency of the intra-process is adjusted based on a trust region, which is measured by the confidence score of current data. During the intra-process, we distill the dark knowledge to retain useful learned knowledge. Moreover, to store more representative data in the coreset, a confidence-based coreset selection is presented in an online manner. The experimental results on standard benchmarks show that the proposed method significantly outperforms state-of-art continual learning algorithms.
引用
收藏
页码:1825 / 1839
页数:15
相关论文
共 50 条
  • [31] On the use of the energy norm in trust-region and adaptive cubic regularization subproblems
    E. Bergou
    Y. Diouane
    S. Gratton
    Computational Optimization and Applications, 2017, 68 : 533 - 554
  • [32] An adaptive approach of conic trust-region method for unconstrained optimization problems
    Fu J.
    Sun W.
    De Sampaio R.J.B.
    Journal of Applied Mathematics and Computing, 2005, 19 (1-2) : 165 - 177
  • [33] A trust-region method with improved adaptive radius for systems of nonlinear equations
    Hamid Esmaeili
    Morteza Kimiaei
    Mathematical Methods of Operations Research, 2016, 83 : 109 - 125
  • [34] An adaptive nonmonotone trust-region method with curvilinear search for minimax problem
    Wang, Fu-Sheng
    Wang, Chuan-Long
    APPLIED MATHEMATICS AND COMPUTATION, 2013, 219 (15) : 8033 - 8041
  • [35] On the use of the energy norm in trust-region and adaptive cubic regularization subproblems
    Bergou, E.
    Diouane, Y.
    Gratton, S.
    COMPUTATIONAL OPTIMIZATION AND APPLICATIONS, 2017, 68 (03) : 533 - 554
  • [36] Adaptive instance similarity embedding for online continual learning
    Han, Ya-nan
    Liu, Jian-wei
    PATTERN RECOGNITION, 2024, 149
  • [37] Adaptive Orthogonal Projection for Batch and Online Continual Learning
    Guo, Yiduo
    Hu, Wenpeng
    Zhao, Dongyan
    Liu, Bing
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 6783 - 6791
  • [38] A trust-region method with improved adaptive radius for systems of nonlinear equations
    Esmaeili, Hamid
    Kimiaei, Morteza
    MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2016, 83 (01) : 109 - 125
  • [39] Two adaptive nonmonotone trust-region algorithms for solving multiobjective optimization problems
    Ghalavand, Nasim
    Khorram, Esmaile
    Morovati, Vahid
    OPTIMIZATION, 2024, 73 (09) : 2953 - 2985
  • [40] A trust-region approach with novel filter adaptive radius for system of nonlinear equations
    Kimiaei, Morteza
    Esmaeili, Hamid
    NUMERICAL ALGORITHMS, 2016, 73 (04) : 999 - 1016