iSOUP-SymRF: Symbolic feature ranking with random forests in online multi-target regression and multi-label classification

被引:0
|
作者
Osojnik, Aljaz [1 ]
Panov, Pance [1 ,2 ]
Dzeroski, Saso [1 ,2 ]
机构
[1] Jozef Stefan Inst, Dept Knowledge Technol, Jamova 39, Ljubljana, Slovenia
[2] Jozef Stefan Int Postgrad Sch, Jamova 39, Ljubljana, Slovenia
基金
欧盟地平线“2020”;
关键词
Online learning; Feature ranking; Multi-target regression; Multi-label classification; FEATURE-SELECTION;
D O I
10.1007/s10994-024-06718-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The task of feature ranking has received considerable attention across various predictive modelling tasks in the batch learning scenario, but not in the online learning setting. Available methods that estimate feature importances on data streams have so far predominantly focused on ranking the features for the tasks of classification and occasionally multi-label classification. We propose a novel online feature ranking method for online multi-target regression iSOUP-SymRF, which estimates feature importance scores based on the positions at which a feature appears in the trees of a random forest of iSOUP-Trees, and additionally extend it to task of online feature ranking for multi-label classification. By utilizing iSOUP-Trees, which can address multiple structured output prediction tasks on data streams, iSOUP-SymRF promises feature ranking across a variety of online structured output prediction tasks. We examine the ranking convergence of iSOUP-SymRF in terms of the methods' parameters, the size of the ensemble and the number of selected features, as well as their stability under different random seeds. Furthermore, to show the utility of iSOUP-SymRF and its rankings we use them in conjunction with two state-of-the-art online multi-target regression and multi-label classification methods, iSOUP-Tree and AMRules, and analyze the impact of adding features according to the rankings obtained from iSOUP-SymRF.
引用
收藏
页数:24
相关论文
共 50 条
  • [31] A model for multi-label classification and ranking of learning objects
    Lopez, Vivian F.
    de la Prieta, Fernando
    Ogihara, Mitsunori
    Wong, Ding Ding
    EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (10) : 8878 - 8884
  • [32] Improving Pairwise Ranking for Multi-label Image Classification
    Li, Yuncheng
    Song, Yale
    Luo, Jiebo
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1837 - 1845
  • [33] A ranking-based feature selection for multi-label classification with fuzzy relative discernibility
    Qian, Wenbin
    Xiong, Chuanzhen
    Wang, Yinglong
    APPLIED SOFT COMPUTING, 2021, 102
  • [34] Multi-label Random Subspace Ensemble Classification
    Bi, Fan
    Zhu, Jianan
    Feng, Yang
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2024,
  • [35] Online Metric Learning for Multi-Label Classification
    Gong, Xiuwen
    Yuan, Dong
    Bao, Wei
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 4012 - 4019
  • [36] Multi-Label Random Forest Model for Tuberculosis Drug Resistance Classification and Mutation Ranking
    Kouchaki, Samaneh
    Yang, Yang
    Lachapelle, Alexander
    Walker, Timothy M.
    Walker, A. Sarah
    Peto, Timothy E. A.
    Crook, Derrick W.
    Clifton, David A.
    FRONTIERS IN MICROBIOLOGY, 2020, 11
  • [37] Online Multi-Label Streaming Feature Selection With Label Correlation
    You, Dianlong
    Wang, Yang
    Xiao, Jiawei
    Lin, Yaojin
    Pan, Maosheng
    Chen, Zhen
    Shen, Limin
    Wu, Xindong
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (03) : 2901 - 2915
  • [38] Online Semi-supervised Multi-label Classification with Label Compression and Local Smooth Regression
    Li, Peiyan
    Wang, Honglian
    Boehm, Christian
    Shao, Junming
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 1359 - 1365
  • [39] Exploiting Label Dependency and Feature Similarity for Multi-Label Classification
    Nedungadi, Prema
    Haripriya, H.
    2014 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2014, : 2196 - 2200
  • [40] Multi-label and Multi-target Sampling of Machine Annotation for Computational Stance Detection
    Liu, Zhengyuan
    Chieu, Hai Leong
    Chen, Nancy F.
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS - EMNLP 2023, 2023, : 2641 - 2649