Hoeffding adaptive trees for multi-label classification on data streams

被引:0
|
作者
Esteban, Aurora [1 ]
Cano, Alberto [2 ]
Zafra, Amelia [1 ]
Ventura, Sebastian [1 ]
机构
[1] Univ Cordoba, Andalusian Res Inst Data Sci & Computat Intelligen, Dept Comp Sci & Numer Anal, Cordoba 14071, Spain
[2] Virginia Commonwealth Univ, Dept Comp Sci, Richmond, VA 23284 USA
关键词
Multi-label classification; Data streams; Incremental decision trees;
D O I
10.1016/j.knosys.2024.112561
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data stream learning is a very relevant paradigm because of the increasing real-world scenarios generating data at high velocities and in unbounded sequences. Stream learning aims at developing models that can process instances as they arrive, so models constantly adapt to new concepts and the temporal evolution in the stream. In multi-label data stream environments where instances have the peculiarity of belonging simultaneously to more than one class, the problem becomes even more complex and poses unique challenges such as different concept drifts impacting different labels at simultaneous or distinct times, higher class imbalance, or new labels emerging in the stream. This paper proposes a novel approach to multi-label data stream classification called Multi-Label Hoeffding Adaptive Tree (MLHAT). MLHAT leverages the Hoeffding adaptive tree to address these challenges by considering possible relations and label co-occurrences in the partitioning process of the decision tree, dynamically adapting the learner in each leaf node of the tree, and implementing a concept drift detector that can quickly detect and replace tree branches that are no longer performing well. The proposed approach is compared with other 18 online multi-label classifiers on 41 datasets. The results, validated with statistical analysis, show that MLHAT outperforms other state-of-the-art approaches in 12 well-known multi-label metrics.
引用
收藏
页数:21
相关论文
共 50 条
  • [1] Incremental multi-label classification of evolving data streams
    Yin, Zhiwu
    Huang, Shangteng
    Journal of Computational Information Systems, 2007, 3 (06): : 2189 - 2193
  • [2] Weighted Ensemble Classification of Multi-label Data Streams
    Wang, Lulu
    Shen, Hong
    Tian, Hui
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2017, PT II, 2017, 10235 : 551 - 562
  • [3] Scalable and efficient multi-label classification for evolving data streams
    Jesse Read
    Albert Bifet
    Geoff Holmes
    Bernhard Pfahringer
    Machine Learning, 2012, 88 : 243 - 272
  • [4] Scalable and efficient multi-label classification for evolving data streams
    Read, Jesse
    Bifet, Albert
    Holmes, Geoff
    Pfahringer, Bernhard
    MACHINE LEARNING, 2012, 88 (1-2) : 243 - 272
  • [5] Multi-label classification via multi-target regression on data streams
    Osojnik, Aljaz
    Panov, Pance
    Dzeroski, Saso
    MACHINE LEARNING, 2017, 106 (06) : 745 - 770
  • [6] Multi-label classification via multi-target regression on data streams
    Aljaž Osojnik
    Panče Panov
    Sašo Džeroski
    Machine Learning, 2017, 106 : 745 - 770
  • [7] Multi-label Classification via Multi-target Regression on Data Streams
    Osojnik, Aljaz
    Panov, Pance
    Dzeroski, Saso
    DISCOVERY SCIENCE, DS 2015, 2015, 9356 : 170 - 185
  • [8] Efficient Ensemble Classification for Multi-Label Data Streams with Concept Drift
    Sun, Yange
    Shao, Han
    Wang, Shasha
    INFORMATION, 2019, 10 (05)
  • [9] Learning Hierarchical Multi-label Classification Trees from Network Data
    Stojanova, Daniela
    Ceci, Michelangelo
    Malerba, Donato
    Dzeroski, Saso
    DISCOVERY SCIENCE, 2013, 8140 : 233 - 248
  • [10] Decision trees for hierarchical multi-label classification
    Vens, Celine
    Struyf, Jan
    Schietgat, Leander
    Dzeroski, Saso
    Blockeel, Hendrik
    MACHINE LEARNING, 2008, 73 (02) : 185 - 214