Hoeffding adaptive trees for multi-label classification on data streams

被引:0
|
作者
Esteban, Aurora [1 ]
Cano, Alberto [2 ]
Zafra, Amelia [1 ]
Ventura, Sebastian [1 ]
机构
[1] Univ Cordoba, Andalusian Res Inst Data Sci & Computat Intelligen, Dept Comp Sci & Numer Anal, Cordoba 14071, Spain
[2] Virginia Commonwealth Univ, Dept Comp Sci, Richmond, VA 23284 USA
关键词
Multi-label classification; Data streams; Incremental decision trees;
D O I
10.1016/j.knosys.2024.112561
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data stream learning is a very relevant paradigm because of the increasing real-world scenarios generating data at high velocities and in unbounded sequences. Stream learning aims at developing models that can process instances as they arrive, so models constantly adapt to new concepts and the temporal evolution in the stream. In multi-label data stream environments where instances have the peculiarity of belonging simultaneously to more than one class, the problem becomes even more complex and poses unique challenges such as different concept drifts impacting different labels at simultaneous or distinct times, higher class imbalance, or new labels emerging in the stream. This paper proposes a novel approach to multi-label data stream classification called Multi-Label Hoeffding Adaptive Tree (MLHAT). MLHAT leverages the Hoeffding adaptive tree to address these challenges by considering possible relations and label co-occurrences in the partitioning process of the decision tree, dynamically adapting the learner in each leaf node of the tree, and implementing a concept drift detector that can quickly detect and replace tree branches that are no longer performing well. The proposed approach is compared with other 18 online multi-label classifiers on 41 datasets. The results, validated with statistical analysis, show that MLHAT outperforms other state-of-the-art approaches in 12 well-known multi-label metrics.
引用
收藏
页数:21
相关论文
共 50 条
  • [31] Option Predictive Clustering Trees for Hierarchical Multi-label Classification
    Perdih, Tomaz Stepisnik
    Osojnik, Aljaz
    Dzeroski, Sao
    Kocev, Dragi
    DISCOVERY SCIENCE, DS 2017, 2017, 10558 : 116 - 123
  • [32] Unsupervised concept drift detection for multi-label data streams
    Ege Berkay Gulcan
    Fazli Can
    Artificial Intelligence Review, 2023, 56 : 2401 - 2434
  • [33] Incremental deep forest for multi-label data streams learning
    Liang, Shunpan
    Pan, Weiwei
    You, Dianlong
    Liu, Ze
    Yin, Ling
    APPLIED INTELLIGENCE, 2022, 52 (12) : 13398 - 13414
  • [34] Unsupervised concept drift detection for multi-label data streams
    Gulcan, Ege Berkay
    Can, Fazli
    ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (03) : 2401 - 2434
  • [35] Drift Detection for Multi-label Data Streams Based on Label Grouping and Entropy
    Shi, Zhongwei
    Wen, Yimin
    Feng, Chao
    Zhao, Hai
    2014 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOP (ICDMW), 2014, : 724 - 731
  • [36] Multi-label classification with label clusters
    Gatto, Elaine Cecilia
    Ferrandin, Mauri
    Cerri, Ricardo
    KNOWLEDGE AND INFORMATION SYSTEMS, 2025, 67 (02) : 1741 - 1785
  • [37] Label Expansion for Multi-Label Classification
    Rivolli, Adriano
    Soares, Carlos
    de Carvalho, Andre C. P. L. F.
    2018 7TH BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS), 2018, : 414 - 419
  • [38] Parallelization of Multi-label classification for large data sets
    Biswas, Shinjini
    Devi, V. Susheela
    2018 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI), 2018, : 2005 - 2010
  • [39] A Combined Approach for Multi-Label Text Data Classification
    Strimaitis, Rokas
    Stefanovic, Pavel
    Ramanauskaite, Simona
    Slotkiene, Asta
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [40] Active Learning in Multi-label Classification of Bioacoustic Data
    Kath, Hannes
    Gouvea, Thiago S.
    Sonntag, Daniel
    KI 2024: ADVANCES IN ARTIFICIAL INTELLIGENCE, KI 2024, 2024, 14992 : 114 - 127