iSOUP-SymRF: Symbolic feature ranking with random forests in online multi-target regression and multi-label classification

被引:0
|
作者
Osojnik, Aljaz [1 ]
Panov, Pance [1 ,2 ]
Dzeroski, Saso [1 ,2 ]
机构
[1] Jozef Stefan Inst, Dept Knowledge Technol, Jamova 39, Ljubljana, Slovenia
[2] Jozef Stefan Int Postgrad Sch, Jamova 39, Ljubljana, Slovenia
基金
欧盟地平线“2020”;
关键词
Online learning; Feature ranking; Multi-target regression; Multi-label classification; FEATURE-SELECTION;
D O I
10.1007/s10994-024-06718-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The task of feature ranking has received considerable attention across various predictive modelling tasks in the batch learning scenario, but not in the online learning setting. Available methods that estimate feature importances on data streams have so far predominantly focused on ranking the features for the tasks of classification and occasionally multi-label classification. We propose a novel online feature ranking method for online multi-target regression iSOUP-SymRF, which estimates feature importance scores based on the positions at which a feature appears in the trees of a random forest of iSOUP-Trees, and additionally extend it to task of online feature ranking for multi-label classification. By utilizing iSOUP-Trees, which can address multiple structured output prediction tasks on data streams, iSOUP-SymRF promises feature ranking across a variety of online structured output prediction tasks. We examine the ranking convergence of iSOUP-SymRF in terms of the methods' parameters, the size of the ensemble and the number of selected features, as well as their stability under different random seeds. Furthermore, to show the utility of iSOUP-SymRF and its rankings we use them in conjunction with two state-of-the-art online multi-target regression and multi-label classification methods, iSOUP-Tree and AMRules, and analyze the impact of adding features according to the rankings obtained from iSOUP-SymRF.
引用
收藏
页数:24
相关论文
共 50 条
  • [41] Multi-Label Attribute Reduction Based on Neighborhood Multi-Target Rough Sets
    Zheng, Wenbin
    Li, Jinjin
    Liao, Shujiao
    Lin, Yidong
    SYMMETRY-BASEL, 2022, 14 (08):
  • [42] Combining Dimensionality Reduction with Random Forests for Multi-label Classification Under Interactivity Constraints
    Nair-Benrekia, Noureddine-Yassine
    Kuntz, Pascale
    Meyer, Frank
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2017, PT II, 2017, 10235 : 828 - 839
  • [43] Online Biomedical Publication Classification Using Multi-Instance Multi-Label Algorithms with Feature Reduction
    Ren, Dong
    Ma, Long
    Zhang, Yanqing
    Sunderraman, Raj
    Laird, Angela R.
    Turner, Jessica A.
    Fox, Peter T.
    Turner, Matthew D.
    PROCEEDINGS OF 2015 IEEE 14TH INTERNATIONAL CONFERENCE ON COGNITIVE INFORMATICS & COGNITIVE COMPUTING (ICCI*CC), 2015, : 234 - 241
  • [44] Multi-label Classification Using Random Label Subset Selections
    Breskvar, Martin
    Kocev, Dragi
    Dzeroski, Saso
    DISCOVERY SCIENCE, DS 2017, 2017, 10558 : 108 - 115
  • [45] Multi-label symbolic value partitioning through random walks
    Wen, Liu-Ying
    Luo, Chao-Guang
    Wu, Wei-Zhi
    Min, Fan
    NEUROCOMPUTING, 2020, 387 : 195 - 209
  • [46] Multi-task Joint Feature Selection for Multi-label Classification
    HE Zhifen
    YANG Ming
    LIU Huidong
    Chinese Journal of Electronics, 2015, 24 (02) : 281 - 287
  • [47] Multi-task Joint Feature Selection for Multi-label Classification
    He Zhifen
    Yang Ming
    Liu Huidong
    CHINESE JOURNAL OF ELECTRONICS, 2015, 24 (02) : 281 - 287
  • [48] LSTM2: Multi-Label Ranking for Document Classification
    Yan, Yan
    Wang, Ying
    Gao, Wen-Chao
    Zhang, Bo-Wen
    Yang, Chun
    Yin, Xu-Cheng
    NEURAL PROCESSING LETTERS, 2018, 47 (01) : 117 - 138
  • [49] Feature selection for multi-label naive Bayes classification
    Zhang, Min-Ling
    Pena, Jose M.
    Robles, Victor
    INFORMATION SCIENCES, 2009, 179 (19) : 3218 - 3229
  • [50] Ranking-Based Autoencoder for Extreme Multi-label Classification
    Wang, Bingyu
    Chen, Li
    Sun, Wei
    Qin, Kechen
    Li, Kefeng
    Zhou, Hui
    2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 2820 - 2830