A New Approach for Wrapper Feature Selection Using Genetic Algorithm for Big Data

被引:11
|
作者
Bouaguel, Waad [1 ]
机构
[1] Univ Tunis, LARODEC, ISG, Tunis, Tunisia
关键词
Wrapper; Feature selection; Big data; CLASSIFICATION; PREDICTION;
D O I
10.1007/978-3-319-27000-5_6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The increased dimensionality of genomic and proteomic data produced by microarray and mass spectrometry technology makes testing and training of general classification method difficult. Special data analysis is demanded in this case and one of the common ways to handle high dimensionality is identification of the most relevant features in the data. Wrapper feature selection is one of the most common and effective techniques for feature selection. Although efficient, wrapper methods have some limitations due to the fact that their result depends on the search strategy. In theory when a complex search is used, it may take much longer to choose the best subset of features and may be impractical in some cases. Hence we propose a new wrapper feature selection for big data based on a random search using genetic algorithm and prior information. The new approach was tested on 2 biological dataset and compared to two well known wrapper feature selection approaches and results illustrate that our approach gives the best performances.
引用
收藏
页码:75 / 83
页数:9
相关论文
共 50 条
  • [21] A new ensemble feature selection approach based on genetic algorithm
    Hongzhi Wang
    Chengquan He
    Zhuping Li
    Soft Computing, 2020, 24 : 15811 - 15820
  • [22] Scalable feature subset selection for big data using parallel hybrid evolutionary algorithm based wrapper under apache spark environment
    Vivek, Yelleti
    Ravi, Vadlamani
    Krishna, P. Radha
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2023, 26 (03): : 1949 - 1983
  • [23] A new ensemble feature selection approach based on genetic algorithm
    Wang, Hongzhi
    He, Chengquan
    Li, Zhuping
    SOFT COMPUTING, 2020, 24 (20) : 15811 - 15820
  • [24] Scalable feature subset selection for big data using parallel hybrid evolutionary algorithm based wrapper under apache spark environment
    Yelleti Vivek
    Vadlamani Ravi
    P. Radha Krishna
    Cluster Computing, 2023, 26 : 1949 - 1983
  • [25] Wrapper-based Feature Selection for Imbalanced Data using Binary Queuing Search Algorithm
    Thaher, Thaer
    Mafarja, Majdi
    Abdalhaq, Baker
    Chantar, Hamouda
    2019 2ND INTERNATIONAL CONFERENCE ON NEW TRENDS IN COMPUTING SCIENCES (ICTCS), 2019, : 318 - 323
  • [26] Information gain directed genetic algorithm wrapper feature selection for credit rating
    Jadhav, Swati
    He, Hongmei
    Jenkins, Karl
    APPLIED SOFT COMPUTING, 2018, 69 : 541 - 553
  • [27] A Hybrid Filter/Wrapper Approach of Feature Selection for Gene Expression Data
    Ke, Chao-Hsuan
    Yang, Cheng-Hong
    Chuang, Li-Yeh
    Yang, Cheng-San
    2008 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), VOLS 1-6, 2008, : 2663 - +
  • [28] Feature selection from microarray data : Genetic algorithm based approach
    Ram, Pintu Kumar
    Kuila, Pratyay
    JOURNAL OF INFORMATION & OPTIMIZATION SCIENCES, 2019, 40 (08): : 1599 - 1610
  • [29] Feature Selection on High Dimensional Data using Wrapper Based Subset Selection
    Manikandan, G.
    Susi, E.
    Abirami, S.
    2017 SECOND INTERNATIONAL CONFERENCE ON RECENT TRENDS AND CHALLENGES IN COMPUTATIONAL MODELS (ICRTCCM), 2017, : 320 - 325
  • [30] Wrapper-filter feature selection algorithm using a memetic framework
    Zhu, Zexuan
    Ong, Yew-Soon
    Dash, Manoranjan
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2007, 37 (01): : 70 - 76