Hoeffding adaptive trees for multi-label classification on data streams

被引:0
|
作者
Esteban, Aurora [1 ]
Cano, Alberto [2 ]
Zafra, Amelia [1 ]
Ventura, Sebastian [1 ]
机构
[1] Univ Cordoba, Andalusian Res Inst Data Sci & Computat Intelligen, Dept Comp Sci & Numer Anal, Cordoba 14071, Spain
[2] Virginia Commonwealth Univ, Dept Comp Sci, Richmond, VA 23284 USA
关键词
Multi-label classification; Data streams; Incremental decision trees;
D O I
10.1016/j.knosys.2024.112561
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data stream learning is a very relevant paradigm because of the increasing real-world scenarios generating data at high velocities and in unbounded sequences. Stream learning aims at developing models that can process instances as they arrive, so models constantly adapt to new concepts and the temporal evolution in the stream. In multi-label data stream environments where instances have the peculiarity of belonging simultaneously to more than one class, the problem becomes even more complex and poses unique challenges such as different concept drifts impacting different labels at simultaneous or distinct times, higher class imbalance, or new labels emerging in the stream. This paper proposes a novel approach to multi-label data stream classification called Multi-Label Hoeffding Adaptive Tree (MLHAT). MLHAT leverages the Hoeffding adaptive tree to address these challenges by considering possible relations and label co-occurrences in the partitioning process of the decision tree, dynamically adapting the learner in each leaf node of the tree, and implementing a concept drift detector that can quickly detect and replace tree branches that are no longer performing well. The proposed approach is compared with other 18 online multi-label classifiers on 41 datasets. The results, validated with statistical analysis, show that MLHAT outperforms other state-of-the-art approaches in 12 well-known multi-label metrics.
引用
收藏
页数:21
相关论文
共 50 条
  • [21] Fuzzy Rough Decision Trees for Multi-label Classification
    Wang, Xiaoxue
    An, Shuang
    Shi, Hong
    Hu, Qinghua
    ROUGH SETS, FUZZY SETS, DATA MINING, AND GRANULAR COMPUTING, RSFDGRC 2015, 2015, 9437 : 207 - 217
  • [22] Learning Regularized Hoeffding Trees from Data Streams
    Barddal, Jean Paul
    Enembreck, Fabricio
    SAC '19: PROCEEDINGS OF THE 34TH ACM/SIGAPP SYMPOSIUM ON APPLIED COMPUTING, 2019, : 574 - 581
  • [23] A Survey on Multi-Label Data Stream Classification
    Zheng, Xiulin
    Li, Peipei
    Chu, Zhe
    Hu, Xuegang
    IEEE ACCESS, 2020, 8 : 1249 - 1275
  • [24] A Multi-label and Adaptive Genre Classification of Web Pages
    Jebari, Chaker
    Wani, M. Arif
    2012 11TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2012), VOL 1, 2012, : 578 - 581
  • [25] Bonsai: diverse and shallow trees for extreme multi-label classification
    Sujay Khandagale
    Han Xiao
    Rohit Babbar
    Machine Learning, 2020, 109 : 2099 - 2119
  • [26] Online Multi-label Classification with Adaptive Model Rules
    Sousa, Ricardo
    Gama, Joao
    ADVANCES IN ARTIFICIAL INTELLIGENCE, CAEPIA 2016, 2016, 9868 : 58 - 67
  • [27] Multi-label Classification Based on Adaptive Resonance Theory
    Masuyama, Naoki
    Nojima, Yusuke
    Loo, Chu Kiong
    Ishibuchi, Hisao
    2020 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2020, : 1913 - 1920
  • [28] Multi-label Collective Classification using Adaptive Neighborhoods
    Saha, Tanwistha
    Rangwala, Huzefa
    Domeniconi, Carlotta
    2012 11TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2012), VOL 1, 2012, : 427 - 432
  • [29] Adaptive knowledge graph for multi-label image classification
    Lin, Zhihong
    Tang, Xue-song
    Hao, Kuangrong
    Zhao, Mingbo
    Li, Yubing
    APPLIED INTELLIGENCE, 2025, 55 (01)
  • [30] Bonsai: diverse and shallow trees for extreme multi-label classification
    Khandagale, Sujay
    Xiao, Han
    Babbar, Rohit
    MACHINE LEARNING, 2020, 109 (11) : 2099 - 2119