Adaptive Orthogonal Projection for Batch and Online Continual Learning

被引:0
|
作者
Guo, Yiduo [1 ]
Hu, Wenpeng [2 ]
Zhao, Dongyan [1 ]
Liu, Bing [3 ]
机构
[1] Peking Univ, Wangxuan Inst Comp Technol, Beijing, Peoples R China
[2] Peking Univ, Sch Math Sci, Beijing, Peoples R China
[3] Univ Illinois, Dept Comp Sci, Chicago, IL 60607 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Catastrophic forgetting is a key obstacle to continual learning. One of the state-of-the-art approaches is orthogonal projection. The idea of this approach is to learn each task by updating the network parameters or weights only in the direction orthogonal to the subspace spanned by all previous task inputs. This ensures no interference with tasks that have been learned. The system OWM that uses the idea performs very well against other state-of-the-art systems. In this paper, we first discuss an issue that we discovered in the mathematical derivation of this approach and then propose a novel method, called AOP (Adaptive Orthogonal Projection), to resolve it, which results in significant accuracy gains in empirical evaluations in both the batch and online continual learning settings without saving any previous training data as in replay-based methods.
引用
收藏
页码:6783 / 6791
页数:9
相关论文
共 50 条
  • [21] ONLINE CONTINUAL LEARNING FOR EMBEDDED DEVICES
    Hayes, Tyler L.
    Kanan, Christopher
    CONFERENCE ON LIFELONG LEARNING AGENTS, VOL 199, 2022, 199
  • [22] Sample Condensation in Online Continual Learning
    Sangermano, Mattia
    Carta, Antonio
    Cossu, Andrea
    Bacciu, Davide
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [23] Online Noisy Continual Relation Learning
    Li, Guozheng
    Wang, Peng
    Luo, Qiqing
    Liu, Yanhe
    Ke, Wenjun
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 11, 2023, : 13059 - 13066
  • [24] Online continual learning with declarative memory
    Xiao, Zhe
    Du, Zhekai
    Wang, Ruijin
    Gan, Ruimeng
    Li, Jingjing
    NEURAL NETWORKS, 2023, 163 : 146 - 155
  • [25] Scalable Adversarial Online Continual Learning
    Dam, Tanmoy
    Pratama, Mahardhika
    Ferdaus, Meftahul
    Anavatti, Sreenatha
    Abbas, Hussein
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2022, PT III, 2023, 13715 : 373 - 389
  • [26] Adaptive Exploration for Continual Reinforcement Learning
    Stulp, Freek
    2012 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2012, : 1631 - 1636
  • [27] Adaptive Plasticity Improvement for Continual Learning
    Liang, Yan-Shuo
    Li, Wu-Jun
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 7816 - 7825
  • [28] CBA: Improving Online Continual Learning via Continual Bias Adaptor
    Wang, Quanziang
    Wang, Renzhen
    Wu, Yichen
    Jia, Xixi
    Meng, Deyu
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 19036 - 19046
  • [29] On the Convergence of Continual Learning with Adaptive Methods
    Han, Seungyub
    Kim, Yeongmo
    Cho, Taehyun
    Lee, Jungwoo
    UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2023, 216 : 809 - 818
  • [30] Online Learned Continual Compression with Adaptive Quantization Modules
    Caccia, Lucas
    Belilovsky, Eugene
    Caccia, Massimo
    Pineau, Joelle
    25TH AMERICAS CONFERENCE ON INFORMATION SYSTEMS (AMCIS 2019), 2019,