Diversity improvement in homogeneous ensemble feature selection: a case study of its impact on classification performance

被引:1
|
作者
Nosrati, Vahid [1 ]
Rahmani, Mohsen [1 ]
机构
[1] Arak Univ, Fac Engn, Dept Comp Engn, 3815688349, Arak, Iran
来源
NEURAL COMPUTING & APPLICATIONS | 2023年 / 35卷 / 21期
关键词
Aggregation criterion; Data partitioning; Diversity measurement; Ensemble diversity; Feature selection; Homogeneous ensemble learning;
D O I
10.1007/s00521-023-08547-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite ensemble learning has recently been usefully applied in feature selection (FS) models, there are some issues such as the diversity and its effects on model performance that need to be more investigated. Diversity is a crucial property in the success of ensemble FS models, so that ignoring it and trying to improve only accuracy makes the model suffer from "diminishing returns". This led us in this paper to focus on enhancing the diversity paradigm in the homogeneous ensemble feature selection problem via applying a partitioning approach named recursive balanced partitioning (RBP) that deliberately divides the instances into several different partitions. Besides, a new diversity measurement in ensemble FS and a new aggregation criterion named min-mean by taking "minimum" and "mean" criteria is proposed. Experimental results on twelve datasets illustrated that the proposed RBP efficaciously outperforms the traditional random partitioning as a baseline in terms of diversity achievement. Furthermore, examining the impact of diversity on classification accuracy through a case study revealed that the proposed partitioning approach provides more classification accuracy than the baselines; moreover, it was demonstrated that there is an almost positive relationship between diversity and accuracy. These findings can lead to further understanding of the effectiveness of diversity in an ensemble learning pattern.
引用
收藏
页码:15647 / 15665
页数:19
相关论文
共 50 条
  • [1] Diversity improvement in homogeneous ensemble feature selection: a case study of its impact on classification performance
    Vahid Nosrati
    Mohsen Rahmani
    Neural Computing and Applications, 2023, 35 : 15647 - 15665
  • [2] Robustness and Predictive Performance of Homogeneous Ensemble Feature Selection in Text Classification
    Mehta, Poornima
    Chandra, Satish
    INTERNATIONAL JOURNAL OF INFORMATION RETRIEVAL RESEARCH, 2021, 11 (01) : 75 - 89
  • [3] A new ensemble feature selection and its application to pattern classification
    Dongbo ZHANG 1
    2.College of Electrical and Information Engineering
    Journal of Control Theory and Applications, 2009, 7 (04) : 419 - 426
  • [4] A new ensemble feature selection and its application to pattern classification
    Zhang D.
    Wang Y.
    Journal of Control Theory and Applications, 2009, 7 (04): : 419 - 426
  • [5] A Study on the Improvement of Life Insurance Underwriting using the Feature Selection Method and Ensemble Classification Model
    Choi, Jung-Moon
    Kim, Yeong-Jin
    Lee, Je-Dong
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2022, 22 (01): : 367 - 374
  • [6] Ensemble feature selection: Homogeneous and heterogeneous approaches
    Seijo-Pardo, B.
    Porto-Diaz, I.
    Bolon-Canedo, V.
    Alonso-Betanzos, A.
    KNOWLEDGE-BASED SYSTEMS, 2017, 118 : 124 - 139
  • [7] EFS-MI: an ensemble feature selection method for classification An ensemble feature selection method
    Hoque, Nazrul
    Singh, Mihir
    Bhattacharyya, Dhruba K.
    COMPLEX & INTELLIGENT SYSTEMS, 2018, 4 (02) : 105 - 118
  • [8] Ensemble Feature Selection for Heart Disease Classification
    Benhar, Houda
    Idri, Ali
    Hosni, Mohamed
    HEALTHINF: PROCEEDINGS OF THE 15TH INTERNATIONAL JOINT CONFERENCE ON BIOMEDICAL ENGINEERING SYSTEMS AND TECHNOLOGIES - VOL 5: HEALTHINF, 2021, : 369 - 376
  • [9] Explainable feature selection and ensemble classification via feature polarity
    Zhou, Peng
    Liang, Ji
    Yan, Yuanting
    Zhao, Shu
    Wu, Xindong
    INFORMATION SCIENCES, 2024, 676
  • [10] A Hybrid Feature Selection with Ensemble Classification for Imbalanced Healthcare Data: A Case Study for Brain Tumor Diagnosis
    Huda S.
    Yearwood J.
    Jelinek H.F.
    Hassan M.M.
    Fortino G.
    Buckland M.
    IEEE Access, 2016, 4 : 9145 - 9154