A reliable adaptive prototype-based learning for evolving data streams with limited labels

被引:5
|
作者
Din, Salah Ud [1 ,2 ,3 ]
Ullah, Aman [1 ,2 ]
Mawuli, Cobbinah B. [1 ,2 ]
Yang, Qinli [1 ,2 ]
Shao, Junming [1 ,2 ]
机构
[1] Univ Elect Sci & Technol China, Yangtze Delta Reg Inst Huzhou, Huzhou 313001, Peoples R China
[2] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Chengdu 611731, Peoples R China
[3] COMSATS Univ Islamabad, Dept Comp Sci, Abbottabad Campus, Abbottabad 22020, Pakistan
基金
中国国家自然科学基金;
关键词
Data streams; Data-driven prototypes; Concept drift; Concept evolution; Semi-supervised classification; NONSTATIONARY DATA; CONCEPT DRIFT; CLASSIFICATION; ENSEMBLE; MODEL;
D O I
10.1016/j.ipm.2023.103532
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Data stream mining presents notable challenges in the form of concept drift and evolution. Existing learning algorithms, typically designed within a supervised learning framework, require class labels for all data points. However, this is an impractical requirement given the rapid pace of data streams, which often results in label scarcity. Recognizing the realistic necessity of learning from data streams with limited labels, we propose an adaptive, data-driven, prototype-based semi-supervised learning framework specifically tailored to handle evolving data streams. Our method employs a prototype-based data representation, summarizing the continuous flow of streaming data using dynamic prototypes at varying levels of granularity. This technique enables improved data abstraction, capturing the underlying local data distributions more accurately. The model also incorporates reliability modeling and efficient emerging class discovery, dynamically updating the significance of prototypes over time and swiftly adapting to local concept drift. We further leverage these adaptive prototypes to intuitively detect concept evolution, i.e., identifying novel classes from a local density perspective. To minimize the need for manual labeling while optimizing performance, we incorporate active learning into our method. This method employs a dual-criteria approach for data point selection, considering both uncertainty and local density. These manually labeled data points, together with unlabeled data, serve to update the model efficiently and robustly. Empirical validation using several bench-mark datasets demonstrates promising performance in comparison to existing state-of-the-art techniques.
引用
收藏
页数:22
相关论文
共 50 条
  • [21] Handling Delayed Labels in Temporally Evolving Data Streams
    Plasse, Joshua
    Adams, Niall
    2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2016, : 2416 - 2424
  • [22] Data warehouse testing: A prototype-based methodology
    Golfarelli, Matteo
    Rizzi, Stefano
    INFORMATION AND SOFTWARE TECHNOLOGY, 2011, 53 (11) : 1183 - 1198
  • [23] Adaptive XGBoost for Evolving Data Streams
    Montiel, Jacob
    Mitchell, Rory
    Frank, Eibe
    Pfahringer, Bernhard
    Abdessalem, Talel
    Bifet, Albert
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [24] Learning Prototype-based Classifiers by Margin Maximization
    Wakou, Chiharu
    Kusunoki, Yoshifumi
    Tatsumi, Keiji
    2017 JOINT 17TH WORLD CONGRESS OF INTERNATIONAL FUZZY SYSTEMS ASSOCIATION AND 9TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND INTELLIGENT SYSTEMS (IFSA-SCIS), 2017,
  • [25] Hyperparameter learning in probabilistic prototype-based models
    Schneider, Petra
    Biehl, Michael
    Hammer, Barbara
    NEUROCOMPUTING, 2010, 73 (7-9) : 1117 - 1124
  • [26] Clustering Based Active Learning for Evolving Data Streams
    Ienco, Dino
    Bifet, Albert
    Zliobaite, Indre
    Pfahringer, Bernhard
    DISCOVERY SCIENCE, 2013, 8140 : 79 - 93
  • [27] Computational Advantages of Deep Prototype-Based Learning
    Hecht, Thomas
    Gepperth, Alexander
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2016, PT II, 2016, 9887 : 121 - 127
  • [28] A novel kernel prototype-based learning algorithm
    Qin, AK
    Suganthan, PN
    PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 4, 2004, : 621 - 624
  • [29] Prototype-based category learning in autism: A review
    Vanpaemel, Wolf
    Bayer, Janine
    NEUROSCIENCE AND BIOBEHAVIORAL REVIEWS, 2021, 127 : 607 - 618
  • [30] Learning interpretable kernelized prototype-based models
    Hofmann, Daniela
    Schleif, Frank-Michael
    Paassen, Benjamin
    Hammer, Barbara
    NEUROCOMPUTING, 2014, 141 : 84 - 96