iSOUP-SymRF: Symbolic feature ranking with random forests in online multi-target regression and multi-label classification

被引:0
|
作者
Osojnik, Aljaz [1 ]
Panov, Pance [1 ,2 ]
Dzeroski, Saso [1 ,2 ]
机构
[1] Jozef Stefan Inst, Dept Knowledge Technol, Jamova 39, Ljubljana, Slovenia
[2] Jozef Stefan Int Postgrad Sch, Jamova 39, Ljubljana, Slovenia
基金
欧盟地平线“2020”;
关键词
Online learning; Feature ranking; Multi-target regression; Multi-label classification; FEATURE-SELECTION;
D O I
10.1007/s10994-024-06718-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The task of feature ranking has received considerable attention across various predictive modelling tasks in the batch learning scenario, but not in the online learning setting. Available methods that estimate feature importances on data streams have so far predominantly focused on ranking the features for the tasks of classification and occasionally multi-label classification. We propose a novel online feature ranking method for online multi-target regression iSOUP-SymRF, which estimates feature importance scores based on the positions at which a feature appears in the trees of a random forest of iSOUP-Trees, and additionally extend it to task of online feature ranking for multi-label classification. By utilizing iSOUP-Trees, which can address multiple structured output prediction tasks on data streams, iSOUP-SymRF promises feature ranking across a variety of online structured output prediction tasks. We examine the ranking convergence of iSOUP-SymRF in terms of the methods' parameters, the size of the ensemble and the number of selected features, as well as their stability under different random seeds. Furthermore, to show the utility of iSOUP-SymRF and its rankings we use them in conjunction with two state-of-the-art online multi-target regression and multi-label classification methods, iSOUP-Tree and AMRules, and analyze the impact of adding features according to the rankings obtained from iSOUP-SymRF.
引用
收藏
页数:24
相关论文
共 50 条
  • [1] Multi-label Classification via Multi-target Regression on Data Streams
    Osojnik, Aljaz
    Panov, Pance
    Dzeroski, Saso
    DISCOVERY SCIENCE, DS 2015, 2015, 9356 : 170 - 185
  • [2] Multi-label classification via multi-target regression on data streams
    Osojnik, Aljaz
    Panov, Pance
    Dzeroski, Saso
    MACHINE LEARNING, 2017, 106 (06) : 745 - 770
  • [3] Multi-label classification via multi-target regression on data streams
    Aljaž Osojnik
    Panče Panov
    Sašo Džeroski
    Machine Learning, 2017, 106 : 745 - 770
  • [4] Feature ranking for multi-target regression
    Petkovic, Matej
    Kocev, Dragi
    Dzeroski, Saso
    MACHINE LEARNING, 2020, 109 (06) : 1179 - 1204
  • [5] Feature ranking for multi-target regression
    Matej Petković
    Dragi Kocev
    Sašo Džeroski
    Machine Learning, 2020, 109 : 1179 - 1204
  • [6] ELM: a novel ensemble learning method for multi-target regression and multi-label classification problems
    Wu, Yuxuan
    Guo, Guikai
    Gao, Huanhuan
    APPLIED INTELLIGENCE, 2024, 54 (17-18) : 7674 - 7695
  • [7] Localized Feature Ranking approach for Multi-Target Regression
    Bertrand, Hugo
    Elghazel, Haytham
    Masmoudi, Sahar
    Coquery, Emmanuel
    Hacid, Mohand-Said
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [8] Feature ranking for multi-label classification using Markov networks
    Teisseyre, Pawek
    NEUROCOMPUTING, 2016, 205 : 439 - 454
  • [9] Feature Ranking for Multi-target Regression with Tree Ensemble Methods
    Petkovic, Matej
    Dzeroski, Sao
    Kocev, Dragi
    DISCOVERY SCIENCE, DS 2017, 2017, 10558 : 171 - 185
  • [10] Multi-Target Core Network-based Networked Multi-label Classification
    Li, Lei
    Zhang, Fang
    Ma, Di
    Zhou, Chuan
    Hu, Xuegang
    2018 9TH IEEE INTERNATIONAL CONFERENCE ON BIG KNOWLEDGE (ICBK), 2018, : 274 - 281