A New Approach for Wrapper Feature Selection Using Genetic Algorithm for Big Data

被引:11
|
作者
Bouaguel, Waad [1 ]
机构
[1] Univ Tunis, LARODEC, ISG, Tunis, Tunisia
关键词
Wrapper; Feature selection; Big data; CLASSIFICATION; PREDICTION;
D O I
10.1007/978-3-319-27000-5_6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The increased dimensionality of genomic and proteomic data produced by microarray and mass spectrometry technology makes testing and training of general classification method difficult. Special data analysis is demanded in this case and one of the common ways to handle high dimensionality is identification of the most relevant features in the data. Wrapper feature selection is one of the most common and effective techniques for feature selection. Although efficient, wrapper methods have some limitations due to the fact that their result depends on the search strategy. In theory when a complex search is used, it may take much longer to choose the best subset of features and may be impractical in some cases. Hence we propose a new wrapper feature selection for big data based on a random search using genetic algorithm and prior information. The new approach was tested on 2 biological dataset and compared to two well known wrapper feature selection approaches and results illustrate that our approach gives the best performances.
引用
收藏
页码:75 / 83
页数:9
相关论文
共 50 条
  • [31] A novel hybrid wrapper–filter approach based on genetic algorithm, particle swarm optimization for feature subset selection
    Fateme Moslehi
    Abdorrahman Haeri
    Journal of Ambient Intelligence and Humanized Computing, 2020, 11 : 1105 - 1127
  • [32] Wrapper feature selection with partially labeled data
    Feofanov, Vasilii
    Devijver, Emilie
    Amini, Massih-Reza
    APPLIED INTELLIGENCE, 2022, 52 (11) : 12316 - 12329
  • [33] Wrapper feature selection with partially labeled data
    Vasilii Feofanov
    Emilie Devijver
    Massih-Reza Amini
    Applied Intelligence, 2022, 52 : 12316 - 12329
  • [34] A Wrapper-based feature selection approach using Bees Algorithm for a wood defect classification system
    Packianather, Michael S.
    Kapoor, Bharat
    2015 10th System of Systems Engineering Conference (SoSE), 2015, : 498 - 503
  • [35] Incomplete Big Data Clustering Algorithm Using Feature Selection and Partial Distance
    Bu, Fanyu
    Chen, Zhikui
    Zhang, Qingchen
    Wang, Xin
    2014 5TH INTERNATIONAL CONFERENCE ON DIGITAL HOME (ICDH), 2014, : 263 - 266
  • [36] A hybrid filter/wrapper approach of feature selection using information theory
    Sebban, M
    Nock, R
    PATTERN RECOGNITION, 2002, 35 (04) : 835 - 846
  • [37] Wrapper Feature Selection based on Genetic Algorithm for Recognizing Objects from Satellite Imagery
    Hewahi, Nabil M.
    Alashqar, Eyad A.
    JOURNAL OF INFORMATION TECHNOLOGY RESEARCH, 2015, 8 (03) : 1 - 20
  • [38] Hybrid Approach of SVM and Feature Selection Based Optimization Algorithm for Big Data Security
    Duhan, Bharti
    Dhankhar, Neetu
    PROCEEDINGS OF ICETIT 2019: EMERGING TRENDS IN INFORMATION TECHNOLOGY, 2020, 605 : 694 - 706
  • [39] A novel ensemble-based wrapper method for feature selection using extreme learning machine and genetic algorithm
    Xiaowei Xue
    Min Yao
    Zhaohui Wu
    Knowledge and Information Systems, 2018, 57 : 389 - 412
  • [40] Feature selection based on rough set approach, wrapper approach, and binary whale optimization algorithm
    Tawhid, Mohamed A.
    Ibrahim, Abdelmonem M.
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2020, 11 (03) : 573 - 602