A novel semi-supervised classification approach for evolving data streams

被引:13
|
作者
Liao, Guobo [1 ]
Zhang, Peng [2 ]
Yin, Hongpeng [1 ]
Deng, Xuanhong [1 ]
Li, Yanxia [1 ]
Zhou, Han [1 ]
Zhao, Dandan [1 ]
机构
[1] Chongqing Univ, Sch Automat, Chongqing, Peoples R China
[2] Chongqing Univ, Coll Bioengn, Chongqing, Peoples R China
基金
中国国家自然科学基金;
关键词
Data stream; Semi-supervised classification; Concept drift; Concept evolution; NONSTATIONARY DATA STREAMS; ENSEMBLE ALGORITHM; CLASSIFIERS; SELECTION;
D O I
10.1016/j.eswa.2022.119273
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Classification plays a crucial role in mining the evolving data streams. The concept drift and concept evolution are the major issues of data streams classification, which greatly affect the classification performance. Most existing works of concept drift and evolution are supervised in nature, where labeling the data is time and resource consuming. In this paper, for the evolving data streams, a semi-supervised classification approach using partially labeled data is proposed. Firstly, an ensemble model dynamically maintains a series of micro -clusters to capture the concept drift. The ensemble model processes the instances in an online fashion rather than chunk-based. Secondly, the concept evolution detection module is constructed to detect the outliers by the local density. The module examines the current buffer to capture the class emergence in data streams with complex class distribution. For improving the execution efficiency of emerging class detection without compromising performance, several constructive strategies are adopted, including removing part of the buffer and selectively executing sample generation. The extensive experiments are constructed about the popular data streams sets and the processed industry data streams, whose results indicate the practicality and effectiveness of the proposed approach for the classification of evolving data streams.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Semi-supervised federated learning on evolving data streams
    Mawuli, Cobbinah B.
    Kumar, Jay
    Nanor, Ebenezer
    Fu, Shangxuan
    Pan, Liangxu
    Yang, Qinli
    Zhang, Wei
    Shao, Junming
    INFORMATION SCIENCES, 2023, 643
  • [2] Semi-Supervised Evolving Approach for Data Streams Classification Based on Online Gustafson-Kessel Algorithm
    Gorbunov, I. V.
    Kalmykov, M. O.
    Rasskazov, E. V.
    Yankovskaya, A. E.
    2017 11TH IEEE INTERNATIONAL CONFERENCE ON APPLICATION OF INFORMATION AND COMMUNICATION TECHNOLOGIES (AICT 2017), 2017, : 206 - 209
  • [3] Online semi-supervised active learning ensemble classification for evolving imbalanced data streams
    Guo, Yinan
    Pu, Jiayang
    Jiao, Botao
    Peng, Yanyan
    Wang, Dini
    Yang, Shengxiang
    APPLIED SOFT COMPUTING, 2024, 155
  • [4] Online reliable semi-supervised learning on evolving data streams
    Din, Salah Ud
    Shao, Junming
    Kumar, Jay
    Ali, Waqar
    Liu, Jiaming
    Ye, Yu
    INFORMATION SCIENCES, 2020, 525 (525) : 153 - 171
  • [5] A Semi-supervised Ensemble Approach for Mining Data Streams
    Liu, Jing
    Xu, Guo-Sheng
    Xiao, Da
    Gu, Li-Ze
    Niu, Xin-Xin
    JOURNAL OF COMPUTERS, 2013, 8 (11) : 2873 - 2879
  • [6] Classification of Data Streams by Incremental Semi-supervised Fuzzy Clustering
    Castellano, G.
    Fanelli, A. M.
    FUZZY LOGIC AND SOFT COMPUTING APPLICATIONS, WILF 2016, 2017, 10147 : 185 - 194
  • [7] CPSSDS: Conformal prediction for semi-supervised classification on data streams
    Tanha, Jafar
    Samadi, Negin
    Abdi, Yousef
    Razzaghi-Asl, Nazila
    INFORMATION SCIENCES, 2022, 584 : 212 - 234
  • [8] Review of ensemble classification over data streams based on supervised and semi-supervised
    Han, Meng
    Li, Xiaojuan
    Wang, Le
    Zhang, Ni
    Cheng, Haodong
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 43 (03) : 3859 - 3878
  • [9] Improving semi-supervised co-forest algorithm in evolving data streams
    Wang, Yi
    Li, Tao
    APPLIED INTELLIGENCE, 2018, 48 (10) : 3248 - 3262
  • [10] Improving semi-supervised co-forest algorithm in evolving data streams
    Yi Wang
    Tao Li
    Applied Intelligence, 2018, 48 : 3248 - 3262