Adaptive Orthogonal Projection for Batch and Online Continual Learning

被引:0
|
作者
Guo, Yiduo [1 ]
Hu, Wenpeng [2 ]
Zhao, Dongyan [1 ]
Liu, Bing [3 ]
机构
[1] Peking Univ, Wangxuan Inst Comp Technol, Beijing, Peoples R China
[2] Peking Univ, Sch Math Sci, Beijing, Peoples R China
[3] Univ Illinois, Dept Comp Sci, Chicago, IL 60607 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Catastrophic forgetting is a key obstacle to continual learning. One of the state-of-the-art approaches is orthogonal projection. The idea of this approach is to learn each task by updating the network parameters or weights only in the direction orthogonal to the subspace spanned by all previous task inputs. This ensures no interference with tasks that have been learned. The system OWM that uses the idea performs very well against other state-of-the-art systems. In this paper, we first discuss an issue that we discovered in the mathematical derivation of this approach and then propose a novel method, called AOP (Adaptive Orthogonal Projection), to resolve it, which results in significant accuracy gains in empirical evaluations in both the batch and online continual learning settings without saving any previous training data as in replay-based methods.
引用
收藏
页码:6783 / 6791
页数:9
相关论文
共 50 条
  • [1] Restricted orthogonal gradient projection for continual learning
    Yang, Zeyuan
    Yang, Zonghan
    Liu, Yichen
    Li, Peng
    Liu, Yang
    AI OPEN, 2023, 4 : 98 - 110
  • [2] Adaptive Shortcut Debiasing for Online Continual Learning
    Kim, Doyoung
    Park, Dongmin
    Shin, Yooju
    Bang, Jihwan
    Song, Hwanjun
    Lee, Jae-Gil
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 12, 2024, : 13122 - 13131
  • [3] Adaptive Online Domain Incremental Continual Learning
    Gunasekara, Nuwan
    Gomes, Heitor
    Bifet, Albert
    Pfahringer, Bernhard
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT I, 2022, 13529 : 491 - 502
  • [4] GopGAN: Gradients Orthogonal Projection Generative Adversarial Network With Continual Learning
    Li, Xiaobin
    Wang, Weiqiang
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (01) : 215 - 227
  • [5] Adaptive online continual multi-view learning
    Yu, Yang
    Du, Zhekai
    Meng, Lichao
    Li, Jingjing
    Hu, Jiang
    INFORMATION FUSION, 2024, 103
  • [6] Adaptive instance similarity embedding for online continual learning
    Han, Ya-nan
    Liu, Jian-wei
    PATTERN RECOGNITION, 2024, 149
  • [7] Adaptive Neural Networks for Online Domain Incremental Continual Learning
    Gunasekara, Nuwan
    Gomes, Heitor
    Bifet, Albert
    Pfahringer, Bernhard
    DISCOVERY SCIENCE (DS 2022), 2022, 13601 : 89 - 103
  • [8] Trust-Region Adaptive Frequency for Online Continual Learning
    Kong, Yajing
    Liu, Liu
    Qiao, Maoying
    Wang, Zhen
    Tao, Dacheng
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2023, 131 (07) : 1825 - 1839
  • [9] Trust-Region Adaptive Frequency for Online Continual Learning
    Yajing Kong
    Liu Liu
    Maoying Qiao
    Zhen Wang
    Dacheng Tao
    International Journal of Computer Vision, 2023, 131 : 1825 - 1839
  • [10] Online Prototype Learning for Online Continual Learning
    Wei, Yujie
    Ye, Jiaxin
    Huang, Zhizhong
    Zhang, Junping
    Shan, Hongming
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 18718 - 18728