iSOUP-SymRF: Symbolic feature ranking with random forests in online multi-target regression and multi-label classification

被引：0

作者：

Osojnik, Aljaz ^{[1
]}

Panov, Pance ^{[1
,2
]}

Dzeroski, Saso ^{[1
,2
]}

机构：

[1] Jozef Stefan Inst, Dept Knowledge Technol, Jamova 39, Ljubljana, Slovenia

[2] Jozef Stefan Int Postgrad Sch, Jamova 39, Ljubljana, Slovenia

来源：

MACHINE LEARNING | 2025年 / 114卷 / 02期

基金：

欧盟地平线“2020”;

关键词：

Online learning; Feature ranking; Multi-target regression; Multi-label classification; FEATURE-SELECTION;

D O I：

10.1007/s10994-024-06718-5

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The task of feature ranking has received considerable attention across various predictive modelling tasks in the batch learning scenario, but not in the online learning setting. Available methods that estimate feature importances on data streams have so far predominantly focused on ranking the features for the tasks of classification and occasionally multi-label classification. We propose a novel online feature ranking method for online multi-target regression iSOUP-SymRF, which estimates feature importance scores based on the positions at which a feature appears in the trees of a random forest of iSOUP-Trees, and additionally extend it to task of online feature ranking for multi-label classification. By utilizing iSOUP-Trees, which can address multiple structured output prediction tasks on data streams, iSOUP-SymRF promises feature ranking across a variety of online structured output prediction tasks. We examine the ranking convergence of iSOUP-SymRF in terms of the methods' parameters, the size of the ensemble and the number of selected features, as well as their stability under different random seeds. Furthermore, to show the utility of iSOUP-SymRF and its rankings we use them in conjunction with two state-of-the-art online multi-target regression and multi-label classification methods, iSOUP-Tree and AMRules, and analyze the impact of adding features according to the rankings obtained from iSOUP-SymRF.

引用

页数：24

共 50 条

[41] Multi-Label Attribute Reduction Based on Neighborhood Multi-Target Rough Sets
Zheng, Wenbin
Li, Jinjin
Liao, Shujiao
Lin, Yidong
SYMMETRY-BASEL, 2022, 14 (08):
[42] Combining Dimensionality Reduction with Random Forests for Multi-label Classification Under Interactivity Constraints
Nair-Benrekia, Noureddine-Yassine
Kuntz, Pascale
Meyer, Frank
ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2017, PT II, 2017, 10235 : 828 - 839
[43] Online Biomedical Publication Classification Using Multi-Instance Multi-Label Algorithms with Feature Reduction
Ren, Dong
Ma, Long
Zhang, Yanqing
Sunderraman, Raj
Laird, Angela R.
Turner, Jessica A.
Fox, Peter T.
Turner, Matthew D.
PROCEEDINGS OF 2015 IEEE 14TH INTERNATIONAL CONFERENCE ON COGNITIVE INFORMATICS & COGNITIVE COMPUTING (ICCI*CC), 2015, : 234 - 241
[44] Multi-label Classification Using Random Label Subset Selections
Breskvar, Martin
Kocev, Dragi
Dzeroski, Saso
DISCOVERY SCIENCE, DS 2017, 2017, 10558 : 108 - 115
[45] Multi-label symbolic value partitioning through random walks
Wen, Liu-Ying
Luo, Chao-Guang
Wu, Wei-Zhi
Min, Fan
NEUROCOMPUTING, 2020, 387 : 195 - 209
[46] Multi-task Joint Feature Selection for Multi-label Classification
HE Zhifen
YANG Ming
LIU Huidong
Chinese Journal of Electronics, 2015, 24 (02) : 281 - 287
[47] Multi-task Joint Feature Selection for Multi-label Classification
He Zhifen
Yang Ming
Liu Huidong
CHINESE JOURNAL OF ELECTRONICS, 2015, 24 (02) : 281 - 287
[48] LSTM2: Multi-Label Ranking for Document Classification
Yan, Yan
Wang, Ying
Gao, Wen-Chao
Zhang, Bo-Wen
Yang, Chun
Yin, Xu-Cheng
NEURAL PROCESSING LETTERS, 2018, 47 (01) : 117 - 138
[49] Feature selection for multi-label naive Bayes classification
Zhang, Min-Ling
Pena, Jose M.
Robles, Victor
INFORMATION SCIENCES, 2009, 179 (19) : 3218 - 3229
[50] Ranking-Based Autoencoder for Extreme Multi-label Classification
Wang, Bingyu
Chen, Li
Sun, Wei
Qin, Kechen
Li, Kefeng
Zhou, Hui
2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 2820 - 2830

← 1 2 3 4 5 →