Gradient Regularization with Multivariate Distribution of Previous Knowledge for Continual Learning

被引:3
|
作者
Kim, Tae-Heon [1 ]
Moon, Hyung-Jun [2 ]
Cho, Sung-Bae [1 ,2 ]
机构
[1] Yonsei Univ, Dept Comp Sci, Seoul 03722, South Korea
[2] Yonsei Univ, Dept Artificial Intelligence, Seoul 03722, South Korea
关键词
Continual learning; Memory replay; Sample generation; Multivariate gaussian distribution; Expectation-maximization;
D O I
10.1007/978-3-031-21753-1_35
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Continual learning is a novel learning setup for an environment where data are introduced sequentially, and a model continually learns new tasks. However, the model forgets the learned knowledge as it learns new classes. There is an approach that keeps a few previous data, but this causes other problems such as overfitting and class imbalance. In this paper, we propose a method that retrains a network with generated representations from an estimated multivariate Gaussian distribution. The representations are the vectors coming from CNN that is trained using a gradient regularization to prevent a distribution shift, allowing the stored means and covariances to create realistic representations. The generated vectors contain every class seen so far, which helps preventing the forgetting. Our 6-fold cross-validation experiment shows that the proposed method outperforms the existing continual learning methods by 1.14%p and 4.60%p in CIFAR10 and CIFAR100, respectively. Moreover, we visualize the generated vectors using t-SNE to confirm the validity of multivariate Gaussian mixture to estimate the distribution of the data representations.
引用
收藏
页码:359 / 368
页数:10
相关论文
共 50 条
  • [31] Adversarial Targeted Forgetting in Regularization and Generative Based Continual Learning Models
    Umer, Muhammad
    Polikar, Robi
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [32] RTRA: Rapid Training of Regularization-based Approaches in Continual Learning
    Nokhwal, Sahil
    Kumar, Nirman
    2023 10TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING & MACHINE INTELLIGENCE, ISCMI, 2023, : 188 - 192
  • [33] Frobenius Norm Regularization for the Multivariate Von Mises Distribution
    Rodriguez-Lujan, Luis
    Larranaga, Pedro
    Bielza, Concha
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2017, 32 (02) : 153 - 176
  • [34] Using Hindsight to Anchor Past Knowledge in Continual Learning
    Chaudhry, Arslan
    Gordo, Albert
    Dokania, Puneet
    Torr, Philip
    Lopez-Paz, David
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 6993 - 7001
  • [35] Continual Learning for Robotic Grasping Detection With Knowledge Transferring
    Liu, Jin
    Xie, Jialong
    Huang, Saike
    Wang, Chaoqun
    Zhou, Fengyu
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2024, 71 (09) : 11019 - 11027
  • [36] Continual Learning Objective for Analyzing Complex Knowledge Representations
    Khan, Asad Mansoor
    Hassan, Taimur
    Akram, Muhammad Usman
    Alghamdi, Norah Saleh
    Werghi, Naoufel
    SENSORS, 2022, 22 (04)
  • [37] Principal Gradient Direction and Confidence Reservoir Sampling for Continual Learning
    Chen, Zhiyi
    Lin, Tong
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT II, 2021, 12892 : 421 - 432
  • [38] Rectification-based Knowledge Retention for Continual Learning
    Singh, Pravendra
    Mazumder, Pratik
    Rai, Piyush
    Namboodiri, Vinay P.
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 15277 - 15286
  • [39] Towards Causal Replay for Knowledge Rehearsal in Continual Learning
    Churamani, Nikhil
    Cheong, Jiaee
    Kalkan, Sinan
    Gunes, Hatice
    AAAI BRIDGE PROGRAM ON CONTINUAL CAUSALITY, VOL 208, 2023, 208 : 63 - 70
  • [40] Federated Continual Learning via Knowledge Fusion: A Survey
    Yang, Xin
    Yu, Hao
    Gao, Xin
    Wang, Hao
    Zhang, Junbo
    Li, Tianrui
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (08) : 3832 - 3850