Concept drift detection and accelerated convergence of online learning

被引:6
|
作者
Guo, Husheng [1 ,2 ]
Li, Hai [1 ]
Sun, Ni [1 ]
Ren, Qiaoyan [1 ]
Zhang, Aijuan [1 ]
Wang, Wenjian [1 ,2 ]
机构
[1] Shanxi Univ, Sch Comp & Informat Technol, Taiyuan 030006, Shanxi, Peoples R China
[2] Shanxi Univ, Key Lab Computat Intelligence & Chinese Informat, Minist Educ, Taiyuan 030006, Shanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Streaming data; Concept drift; Authenticity; Model convergence; NEURAL-NETWORKS; DATA STREAMS; ENSEMBLE; CLASSIFICATION; MODELS;
D O I
10.1007/s10115-022-01790-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Streaming data has become an important form in the era of big data, and the concept drift, as one of the most important problem of it, is often studied deeply. However, similar to true concept drift, noise and too small training samples will also lead to the classification performance fluctuation, which is easy to confuse with true concept drift. To solve this problem, an improved concept drift detection method is proposed, and the accelerated convergence of the model after concept drift is also studied. Firstly, the effective fluctuation sites can be obtained by group detection method. Secondly, the authenticity of concept drift can be determined by tracking the testing accuracy of reference sites near the effective fluctuation site. Lastly, in the convergence acceleration stage, the time sequential distance is designed to measure the similarity of these sequential data blocks during different time periods, and the noncritical disturbance data with the largest time sequential distance are removed sequentially to improve the convergence speed of the model after concept drift occurs. The experimental results demonstrate that the proposed method not only produces better identification results in distinguishing true and false concept drift but also improves the convergence speed of the model.
引用
收藏
页码:1005 / 1043
页数:39
相关论文
共 50 条
  • [41] Online Anomaly Detection with Concept Drift Adaptation using Recurrent Neural Networks
    Saurav, Sakti
    Malhotra, Pankaj
    Tv, Vishnu
    Gugulothu, Narendhar
    Vig, Lovekesh
    Agarwal, Puneet
    Shroff, Gautam
    PROCEEDINGS OF THE ACM INDIA JOINT INTERNATIONAL CONFERENCE ON DATA SCIENCE AND MANAGEMENT OF DATA (CODS-COMAD'18), 2018, : 78 - 87
  • [42] Online eigenvector transformation reflecting concept drift for improving network intrusion detection
    Park, Seongchul
    Seo, Sanghyun
    Jeong, Changhoon
    Kim, Juntae
    EXPERT SYSTEMS, 2020, 37 (05)
  • [43] Towards Online Concept Drift Detection with Feature Selection for Data Stream Classification
    Hammoodi, Mahmood
    Stahl, Frederic
    Tennant, Mark
    ECAI 2016: 22ND EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, 285 : 1549 - 1550
  • [44] Network Intrusion Detection through Online Transformation of Eigenvector Reflecting Concept Drift
    Park, Seongchul
    Seo, Sanghyun
    Jeong, Changhoon
    Kim, Juntae
    PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON DATA SCIENCE, E-LEARNING AND INFORMATION SYSTEMS 2018 (DATA'18), 2018,
  • [45] Online concept evolution detection based on active learning
    Guo, Husheng
    Li, Hai
    Cong, Lu
    Wang, Wenjian
    DATA MINING AND KNOWLEDGE DISCOVERY, 2024, 38 (04) : 1589 - 1633
  • [46] A Multiscale Concept Drift Detection Method for Learning from Data Streams
    Wang, XueSong
    Kang, Qi
    Zhou, MengChu
    Yao, SiYa
    2018 IEEE 14TH INTERNATIONAL CONFERENCE ON AUTOMATION SCIENCE AND ENGINEERING (CASE), 2018, : 786 - 790
  • [47] Detection & management of concept drift
    Mak, Lee-Onn
    Krause, Paul
    PROCEEDINGS OF 2006 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2006, : 3486 - +
  • [48] Machine learning-based detection of concept drift in business processes
    Alexander Kraus
    Han van der Aa
    Process Science, 2 (1):
  • [49] Concept Drift Detection for Deep Learning Aided Receivers in Dynamic Channels
    Uzlaner, Nicole
    Raviv, Tomer
    Shlezinger, Nir
    Todros, Koby
    2024 IEEE 25TH INTERNATIONAL WORKSHOP ON SIGNAL PROCESSING ADVANCES IN WIRELESS COMMUNICATIONS, SPAWC 2024, 2024, : 371 - 375
  • [50] Machine Learning & Concept Drift based Approach for Malicious Website Detection
    Singhal, Siddharth
    Chawla, Utkarsh
    Shorey, Rajeev
    2020 INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS & NETWORKS (COMSNETS), 2020,