Trust-Region Adaptive Frequency for Online Continual Learning

被引:1
|
作者
Kong, Yajing [1 ]
Liu, Liu [1 ]
Qiao, Maoying [2 ]
Wang, Zhen [1 ]
Tao, Dacheng [1 ]
机构
[1] Univ Sydney, Fac Engn, Sch Comp Sci, Sydney, NSW 2008, Australia
[2] Univ Technol Sydney, Sydney, NSW, Australia
关键词
Online continual learning; Catastrophic forgetting; Trust-region; Deep learning;
D O I
10.1007/s11263-023-01775-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the paradigm of online continual learning, one neural network is exposed to a sequence of tasks, where the data arrive in an online fashion and previously seen data are not accessible. Such online fashion causes insufficient learning and severe forgetting on past tasks issues, preventing a good stability-plasticity trade-off, where ideally the network is expected to have high plasticity to adapt to new tasks well and have the stability to prevent forgetting on old tasks simultaneously. To solve these issues, we propose a trust-region adaptive frequency approach, which alternates between standard-process and intra-process updates. Specifically, the standard-process replays data stored in a coreset and interleaves the data with current data, and the intra-process updates the network parameters based on the coreset. Furthermore, to improve the unsatisfactory performance stemming from online fashion, the frequency of the intra-process is adjusted based on a trust region, which is measured by the confidence score of current data. During the intra-process, we distill the dark knowledge to retain useful learned knowledge. Moreover, to store more representative data in the coreset, a confidence-based coreset selection is presented in an online manner. The experimental results on standard benchmarks show that the proposed method significantly outperforms state-of-art continual learning algorithms.
引用
收藏
页码:1825 / 1839
页数:15
相关论文
共 50 条
  • [1] Trust-Region Adaptive Frequency for Online Continual Learning
    Yajing Kong
    Liu Liu
    Maoying Qiao
    Zhen Wang
    Dacheng Tao
    International Journal of Computer Vision, 2023, 131 : 1825 - 1839
  • [2] Trust-region learning for ICA
    Choi, H
    Kim, S
    Choi, S
    2004 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, PROCEEDINGS, 2004, : 41 - 46
  • [3] An improved adaptive trust-region algorithm
    Kamandi, Ahmad
    Amini, Keyvan
    Ahookhosh, Masoud
    OPTIMIZATION LETTERS, 2017, 11 (03) : 555 - 569
  • [4] A consistently adaptive trust-region method
    Hamad, Fadi
    Hinder, Oliver
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [5] An improved adaptive trust-region algorithm
    Ahmad Kamandi
    Keyvan Amini
    Masoud Ahookhosh
    Optimization Letters, 2017, 11 : 555 - 569
  • [6] Relative trust-region learning for ICA
    Choi, H
    Choi, S
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 261 - 264
  • [7] Trust-Region Inverse Reinforcement Learning
    Cao, Kun
    Xie, Lihua
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2024, 69 (02) : 1037 - 1044
  • [8] Adaptive trust-region algorithms for unconstrained optimization
    Rezapour, Mostafa
    Asaki, Thomas J.
    OPTIMIZATION METHODS & SOFTWARE, 2021, 36 (05): : 1059 - 1081
  • [9] Adaptive Trust-Region Method on Riemannian Manifold
    Shimin Zhao
    Tao Yan
    Kai Wang
    Yuanguo Zhu
    Journal of Scientific Computing, 2023, 96
  • [10] Adaptive Trust-Region Method on Riemannian Manifold
    Zhao, Shimin
    Yan, Tao
    Wang, Kai
    Zhu, Yuanguo
    JOURNAL OF SCIENTIFIC COMPUTING, 2023, 96 (03)