Bilevel Online Deep Learning in Non-stationary Environment

被引:2
|
作者
Han, Ya-nan [1 ]
Liu, Jian-wei [1 ]
Xiao, Bing-biao [1 ]
Wang, Xin-Tan [1 ]
Luo, Xiong-lin [1 ]
机构
[1] China Univ Petr, Coll Informat Sci & Engn, Dept Automat, Beijing Campus CUP, Beijing, Peoples R China
关键词
Online Deep Learning; Bilevel optimization; Concept drift;
D O I
10.1007/978-3-030-86340-1_28
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent years have witnessed enormous progress of online learning. However, a major challenge on the road to artificial agents is concept drift, that is, the data probability distribution would change where the data instance arrives sequentially in a stream fashion, which would lead to catastrophic forgetting and degrade the performance of the model. In this paper, we proposed a new Bilevel Online Deep Learning (BODL) framework, which combine bilevel optimization strategy and online ensemble classifier. In BODL algorithm, we use an ensemble classifier, which use the output of different hidden layers in deep neural network to build multiple base classifiers, the important weights of the base classifiers are updated according to exponential gradient descent method in an online manner. Besides, we apply the similar constraint to overcome the convergence problem of online ensemble framework. Then an effective concept drift detection mechanism utilizing the error rate of classifier is designed to monitor the change of the data probability distribution. When the concept drift is detected, our BODL algorithm can adaptively update the model parameters via bilevel optimization and then circumvent the large drift and encourage positive transfer. Finally, the extensive experiments and ablation studies are conducted on various datasets and the competitive numerical results illustrate that our BODL algorithm is a promising approach.
引用
收藏
页码:347 / 358
页数:12
相关论文
共 50 条
  • [31] Online robust non-stationary estimation
    Sankararaman, Abishek
    Narayanaswamy, Balakrishnan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [32] Learning and selection of dynamic Bayesian networks for online non-stationary process
    Hourbracq M.
    Wuillemin P.-H.
    Gonzales C.
    Baumard P.
    Revue d'Intelligence Artificielle, 2018, 32 (01) : 75 - 109
  • [33] Online Bayesian Learning for Rate Adaptation in Non-stationary Wireless Channels
    Lei, Xiaoying
    2022 19TH ANNUAL IEEE INTERNATIONAL CONFERENCE ON SENSING, COMMUNICATION, AND NETWORKING (SECON), 2022, : 55 - 63
  • [34] Incremental kernel spectral clustering for online learning of non-stationary data
    Langone, Rocco
    Agudelo, Oscar Mauricio
    De Moor, Bart
    Suykens, Johan A. K.
    NEUROCOMPUTING, 2014, 139 : 246 - 260
  • [35] Learning Non-stationary System Dynamics Online Using Gaussian Processes
    Rottmann, Axel
    Burgard, Wolfram
    PATTERN RECOGNITION, 2010, 6376 : 192 - 201
  • [36] Deep reinforcement learning control for non-stationary building energy management
    Naug, Avisek
    Quinones-Grueiro, Marcos
    Biswas, Gautam
    ENERGY AND BUILDINGS, 2022, 277
  • [37] A deep learning approximation of non-stationary solutions to wave kinetic equations
    Walton, Steven
    Tran, Minh-Binh
    Bensoussan, Alain
    APPLIED NUMERICAL MATHEMATICS, 2024, 199 : 213 - 226
  • [38] Towards Deep Robot Learning with Optimizer applicable to Non-stationary Problems
    Kobayashi, Taisuke
    2021 IEEE/SICE INTERNATIONAL SYMPOSIUM ON SYSTEM INTEGRATION (SII), 2021, : 190 - 194
  • [39] Deep Frequency Derivative Learning for Non-stationary Time Series Forecasting
    Fan, Wei
    Yi, Kun
    Ye, Hangting
    Ning, Zhiyuan
    Zhang, Qi
    An, Ning
    PROCEEDINGS OF THE THIRTY-THIRD INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2024, 2024, : 3944 - 3952
  • [40] Deep Reinforcement Learning for inventory optimization with non-stationary uncertain demand
    Dehaybe, Henri
    Catanzaro, Daniele
    Chevalier, Philippe
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2024, 314 (02) : 433 - 445