Adaptive Classification Method for Concept Drift Based on Online Ensemble

被引:0
|
作者
Guo H. [1 ,2 ]
Cong L. [1 ]
Gao S. [1 ]
Wang W. [1 ,2 ]
机构
[1] School of Computer and Information Technology, Shanxi University, Taiyuan
[2] Key Laboratory of Computational Intelligence and Chinese Information Processing, Shanxi University, Ministry of Education, Taiyuan
基金
中国国家自然科学基金;
关键词
adaptive model; concept drift; incremental learning; online ensemble; streaming data;
D O I
10.7544/issn1000-1239.202220245
中图分类号
学科分类号
摘要
In view of the problems that the online learning model cannot respond in time to the change of data distribution and it is difficult to extract the latest information of data distribution after concept drift occurs in streaming data, which leads to slow convergence of the learning model, an adaptive classification method for concept drift based on online ensemble (AC_OE) is presented. On the one hand, the online ensemble strategy is used to construct a local online learner, which can dynamically adjust the weight of base learner by local prediction of training samples in data blocks. It is helpful to not only extract the evolution information of streaming data in depth to make a more accurate response to the change of data distribution, but also improve the adaptability of the online learning model to the new data distribution after the occurrence of concept drift, and the real-time generalization performance of the learning model is improved too. On the other hand, the incremental learning strategy is used to construct a global incremental learner, and incremental training updates are carried out with the entry of new samples. The method extracts global distribution information of streaming data, and the model can maintain good robustness in the steady state of streaming data. Experimental results show that the proposed method can respond to concept drift and accelerate the convergence of online learning model, and improve the overall generalization performance of the learner effectively. © 2023 Science Press. All rights reserved.
引用
收藏
页码:1592 / 1602
页数:10
相关论文
共 28 条
  • [1] Georg K, Zliobaite I, Brzezinski D., Open challenges for data stream mining research[J], ACM SIGKDD Explorations Newsletter, 16, 1, pp. 1-10, (2014)
  • [2] Lughofer E, Pratama M., Online active learning in data stream regression using uncertainty sampling based on evolving generalized fuzzy models[J], IEEE Transactions on Fuzzy Systems, 26, 1, pp. 292-309, (2018)
  • [3] Tingting Zhai, Yang Gao, Junwu Zhu, Survey of online learning algorithms for streaming data classification[J], Journal of Software, 31, 4, (2020)
  • [4] Hangyuan Du, Wenjian Wang, Liang Bai, A novel evolving data stream clustering method based on optimization model[J], SCIENTIA SINICA:Informationis, 47, 11, pp. 1464-1482, (2017)
  • [5] Ma J, Saul L K, Savage S, Et al., Identifying suspicious URLs: An application of large-scale online learning [C], Proc of the 26th Annual Int Conf on Machine Learning, pp. 681-688, (2009)
  • [6] Jie Lu, Anjin Liu, Fan Dong, Et al., Learning under concept drift: A review[J], IEEE Transactions on Knowledge and Data Engineering, 31, 12, pp. 2346-2363, (2019)
  • [7] Tennant M, Stahl F T, Rana O F, Et al., Scalable real-time classification of data streams with concept drift[J], Future Generation Computer Systems, 75, pp. 187-199, (2017)
  • [8] Lei Du, Qinbao Song, Xiaolin Jia, Detecting concept drift: An information entropy based method using an adaptive sliding window[J], Intelligent Data Analysis, 18, 3, (2014)
  • [9] Bifet A, Gavalda R., Learning from time-changing data with adaptive windowing [C], Proc of the 7th SIAM Int Conf on Data Mining, pp. 443-448, (2007)
  • [10] Husheng Guo, Hai Li, Qiaoyan Ren, Et al., Concept drift type identification based on multi-sliding windows[J], Information Sciences, 585, pp. 1-23, (2022)