Framework for efficient feature selection in genetic algorithm based data mining

被引:78
|
作者
Sikora, Riyaz
Piramuthu, Selwyn
机构
[1] Univ Texas, Dept Informat Syst, Arlington, TX 76019 USA
[2] Univ Florida, Dept Informat & Decis Sci, Gainesville, FL 32611 USA
关键词
genetic algorithms; rule learning; knowledge discover; data mining; evolutionary algorithms;
D O I
10.1016/j.ejor.2006.02.040
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
We present the design of more effective and efficient genetic algorithm based data mining techniques that use the concepts of feature selection. Explicit feature selection is traditionally done as a wrapper approach where every candidate feature subset is evaluated by executing the data mining algorithm on that subset. In this article we present a GA for doing both the tasks of mining and feature selection simultaneously by evolving a binary code along side the chromosome structure used for evolving the rules. We then present a wrapper approach to feature selection based on Hausdorff distance measure. Results from applying the above techniques to a real world data mining problem show that combining both the feature selection methods provides the best performance in terms of prediction accuracy and computational efficiency. (c) 2006 Elsevier B.V. All rights reserved.
引用
收藏
页码:723 / 737
页数:15
相关论文
共 50 条
  • [21] Data mining algorithm based on feature weighting
    Qian, Zheng
    Xia, Hongxia
    JOURNAL OF COMPUTATIONAL METHODS IN SCIENCES AND ENGINEERING, 2019, 19 (S1) : S269 - S276
  • [22] Deluge based Genetic Algorithm for feature selection
    Guha, Ritam
    Ghosh, Manosij
    Kapri, Souvik
    Shaw, Sushant
    Mutsuddi, Shyok
    Bhateja, Vikrant
    Sarkar, Ram
    EVOLUTIONARY INTELLIGENCE, 2021, 14 (02) : 357 - 367
  • [23] Feature subset selection based on the genetic algorithm
    Yang, Jingwei
    Wang, Sile
    Chen, Yingyi
    Lu, Sukui
    Yang, Wenzhu
    ADVANCED TECHNOLOGIES IN MANUFACTURING, ENGINEERING AND MATERIALS, PTS 1-3, 2013, 774-776 : 1532 - +
  • [24] Deluge based Genetic Algorithm for feature selection
    Ritam Guha
    Manosij Ghosh
    Souvik Kapri
    Sushant Shaw
    Shyok Mutsuddi
    Vikrant Bhateja
    Ram Sarkar
    Evolutionary Intelligence, 2021, 14 : 357 - 367
  • [25] A Clustering Based Genetic Algorithm for Feature Selection
    Rostami, Mehrdad
    Moradi, Parham
    2014 6TH CONFERENCE ON INFORMATION AND KNOWLEDGE TECHNOLOGY (IKT), 2014, : 112 - 116
  • [26] Performance Analysis of Feature Selection Algorithm for Educational Data Mining
    Zaffar, Maryam
    Hashmani, Manzoor Ahmed
    Savita, K. S.
    2017 IEEE CONFERENCE ON BIG DATA AND ANALYTICS (ICBDA), 2017, : 7 - 12
  • [27] Stable Feature Selection with Privacy Preserving Data Mining Algorithm
    Chelvan, Mohana P.
    Perumal, K.
    ADVANCED INFORMATICS FOR COMPUTING RESEARCH, ICAICR 2017, 2017, 712 : 227 - 237
  • [28] Genetic algorithm for feature selection of EEG heterogeneous data
    Saibene, Aurora
    Gasparini, Francesca
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 217
  • [29] Feature Selection Using Genetic Algorithm for Big Data
    Saidi, Rania
    Ncir, Waad Bouaguel
    Essoussi, Nadia
    INTERNATIONAL CONFERENCE ON ADVANCED MACHINE LEARNING TECHNOLOGIES AND APPLICATIONS (AMLTA2018), 2018, 723 : 352 - 361
  • [30] A Robust and Efficient Feature Selection Algorithm for Microarray Data
    Bari, Mehrab Ghanat
    Salekin, Sirajul
    Zhang, Jianqiu
    MOLECULAR INFORMATICS, 2017, 36 (04)