Diversity improvement in homogeneous ensemble feature selection: a case study of its impact on classification performance

被引:1
|
作者
Nosrati, Vahid [1 ]
Rahmani, Mohsen [1 ]
机构
[1] Arak Univ, Fac Engn, Dept Comp Engn, 3815688349, Arak, Iran
来源
NEURAL COMPUTING & APPLICATIONS | 2023年 / 35卷 / 21期
关键词
Aggregation criterion; Data partitioning; Diversity measurement; Ensemble diversity; Feature selection; Homogeneous ensemble learning;
D O I
10.1007/s00521-023-08547-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite ensemble learning has recently been usefully applied in feature selection (FS) models, there are some issues such as the diversity and its effects on model performance that need to be more investigated. Diversity is a crucial property in the success of ensemble FS models, so that ignoring it and trying to improve only accuracy makes the model suffer from "diminishing returns". This led us in this paper to focus on enhancing the diversity paradigm in the homogeneous ensemble feature selection problem via applying a partitioning approach named recursive balanced partitioning (RBP) that deliberately divides the instances into several different partitions. Besides, a new diversity measurement in ensemble FS and a new aggregation criterion named min-mean by taking "minimum" and "mean" criteria is proposed. Experimental results on twelve datasets illustrated that the proposed RBP efficaciously outperforms the traditional random partitioning as a baseline in terms of diversity achievement. Furthermore, examining the impact of diversity on classification accuracy through a case study revealed that the proposed partitioning approach provides more classification accuracy than the baselines; moreover, it was demonstrated that there is an almost positive relationship between diversity and accuracy. These findings can lead to further understanding of the effectiveness of diversity in an ensemble learning pattern.
引用
收藏
页码:15647 / 15665
页数:19
相关论文
共 50 条
  • [31] Ensemble Learning Based Feature Selection with an Application to Text Classification
    Onan, Aytug
    2018 26TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2018,
  • [32] Sentiment classification using hybrid feature selection and ensemble classifier
    Jain, Achin
    Jain, Vanita
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 42 (02) : 659 - 668
  • [33] Ensemble Classification Based on Feature Selection for Environmental Sound Recognition
    Zhao, Shuai
    Zhang, Yan
    Xu, Haifeng
    Han, Te
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2019, 2019
  • [34] An Ensemble Hybrid Feature Selection Method for Neuropsychiatric Disorder Classification
    Liu, Liangliang
    Tang, Shaojie
    Wu, Fang-Xiang
    Wang, Yu-Ping
    Wang, Jianxin
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2022, 19 (03) : 1459 - 1471
  • [35] Imbalanced Network Traffic Classification based on Ensemble Feature Selection
    Ding, Yaojun
    2016 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMMUNICATIONS AND COMPUTING (ICSPCC), 2016,
  • [36] Ensemble feature selection with the simple Bayesian classification in medical diagnostics
    Tsymbal, A
    Puuronen, S
    PROCEEDINGS OF THE 15TH IEEE SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS, 2002, : 225 - 230
  • [37] An improved tree model based on ensemble feature selection for classification
    Mohan, Chandralekha
    Nagarajan, Shenbagavadivu
    TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2019, 27 (02) : 1290 - 1307
  • [38] Hybrid Feature Selection and Ensemble Learning Methods for Gene Selection and Cancer Classification
    Qasem, Sultan Noman
    Saeed, Faisal
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (02) : 193 - 200
  • [39] Classification Performance Improvement Using Random Subset Feature Selection Algorithm for Data Mining
    Lakshmipadmaja, D.
    Vishnuvardhan, B.
    BIG DATA RESEARCH, 2018, 12 : 1 - 12
  • [40] A Comprehensive Study of Eleven Feature Selection Algorithms and their Impact on Text Classification
    Vora, Suchi
    Yang, Hui
    2017 COMPUTING CONFERENCE, 2017, : 440 - 449