Diversity improvement in homogeneous ensemble feature selection: a case study of its impact on classification performance

被引:1
|
作者
Nosrati, Vahid [1 ]
Rahmani, Mohsen [1 ]
机构
[1] Arak Univ, Fac Engn, Dept Comp Engn, 3815688349, Arak, Iran
来源
NEURAL COMPUTING & APPLICATIONS | 2023年 / 35卷 / 21期
关键词
Aggregation criterion; Data partitioning; Diversity measurement; Ensemble diversity; Feature selection; Homogeneous ensemble learning;
D O I
10.1007/s00521-023-08547-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite ensemble learning has recently been usefully applied in feature selection (FS) models, there are some issues such as the diversity and its effects on model performance that need to be more investigated. Diversity is a crucial property in the success of ensemble FS models, so that ignoring it and trying to improve only accuracy makes the model suffer from "diminishing returns". This led us in this paper to focus on enhancing the diversity paradigm in the homogeneous ensemble feature selection problem via applying a partitioning approach named recursive balanced partitioning (RBP) that deliberately divides the instances into several different partitions. Besides, a new diversity measurement in ensemble FS and a new aggregation criterion named min-mean by taking "minimum" and "mean" criteria is proposed. Experimental results on twelve datasets illustrated that the proposed RBP efficaciously outperforms the traditional random partitioning as a baseline in terms of diversity achievement. Furthermore, examining the impact of diversity on classification accuracy through a case study revealed that the proposed partitioning approach provides more classification accuracy than the baselines; moreover, it was demonstrated that there is an almost positive relationship between diversity and accuracy. These findings can lead to further understanding of the effectiveness of diversity in an ensemble learning pattern.
引用
收藏
页码:15647 / 15665
页数:19
相关论文
共 50 条
  • [41] Combining diversity and classification accuracy for ensemble selection in random subspaces
    Ko, Albert Hung-Ren
    Sabourin, Robert
    Britto, Alceu de Souza, Jr.
    2006 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORK PROCEEDINGS, VOLS 1-10, 2006, : 2144 - +
  • [42] Homogeneous Ensemble Feature Selection for Mass Spectrometry Data Prediction in Cancer Studies
    Liang, Yulan
    Gharipour, Amin
    Kelemen, Erik
    Kelemen, Arpad
    MATHEMATICS, 2024, 12 (13)
  • [43] The impact of feature selection on medical document classification
    Parlak, Bekir
    Uysal, Alper Kursat
    2016 11TH IBERIAN CONFERENCE ON INFORMATION SYSTEMS AND TECHNOLOGIES (CISTI), 2016,
  • [44] An Empirical Study on the Performance of Rule-Based Classification by Feature Selection
    Balakrishnan, Sarojini
    Babu, M. R.
    Krishna, P. V.
    2014 WORLD CONGRESS ON COMPUTING AND COMMUNICATION TECHNOLOGIES (WCCCT 2014), 2014, : 147 - +
  • [45] Deep feature fusion and optimized feature selection based ensemble classification of liver lesions
    Anisha, A.
    Jiji, G.
    Raj, T. Ajith Bosco
    IMAGING SCIENCE JOURNAL, 2023, 71 (06): : 518 - 536
  • [46] ENSEMBLE FEATURE SELECTION APPROACH BASED ON FEATURE RANKING FOR RICE SEED IMAGES CLASSIFICATION
    Dzi Lam Tran Tuan
    Surinwarangkoon, Thongchai
    Meethongjan, Kittikhun
    Vinh Truong Hoang
    ADVANCES IN ELECTRICAL AND ELECTRONIC ENGINEERING, 2020, 18 (03) : 198 - 206
  • [47] Utilizing Ensemble, Data Sampling and Feature Selection Techniques for Improving Classification Performance on Tweet Sentiment Data
    Prusa, Joseph
    Khoshgoftaar, Taghi M.
    Napolitano, Amri
    2015 IEEE 14TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2015, : 535 - 542
  • [48] Improvement classification performance by the support vector machine ensemble
    Research Inst. of Intelligent Information Processing, Xidian Univ., Xi'an 710071, China
    Xi'an Dianzi Keji Daxue Xuebao, 2007, 1 (68-70+105):
  • [49] Toward improving the performance of learning by joining feature selection and ensemble classification techniques: an application for cancer diagnosis
    Dan Wang
    Journal of Cancer Research and Clinical Oncology, 2023, 149 : 16993 - 17006
  • [50] Significant Improvement in Classification Performance Metrics by Ensemble Approach
    Thalor, Meenakshi A.
    Bhende, Veena
    Chavan, Amrapali
    ICCCE 2019: PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND CYBER-PHYSICAL ENGINEERING, 2020, 570 : 243 - 249