Framework for efficient feature selection in genetic algorithm based data mining

被引:78
|
作者
Sikora, Riyaz
Piramuthu, Selwyn
机构
[1] Univ Texas, Dept Informat Syst, Arlington, TX 76019 USA
[2] Univ Florida, Dept Informat & Decis Sci, Gainesville, FL 32611 USA
关键词
genetic algorithms; rule learning; knowledge discover; data mining; evolutionary algorithms;
D O I
10.1016/j.ejor.2006.02.040
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
We present the design of more effective and efficient genetic algorithm based data mining techniques that use the concepts of feature selection. Explicit feature selection is traditionally done as a wrapper approach where every candidate feature subset is evaluated by executing the data mining algorithm on that subset. In this article we present a GA for doing both the tasks of mining and feature selection simultaneously by evolving a binary code along side the chromosome structure used for evolving the rules. We then present a wrapper approach to feature selection based on Hausdorff distance measure. Results from applying the above techniques to a real world data mining problem show that combining both the feature selection methods provides the best performance in terms of prediction accuracy and computational efficiency. (c) 2006 Elsevier B.V. All rights reserved.
引用
收藏
页码:723 / 737
页数:15
相关论文
共 50 条
  • [41] Feature Subset Selection within a Simulated Annealing Data Mining Algorithm
    Debuse J.C.W.
    Rayward-Smith V.J.
    Journal of Intelligent Information Systems, 1997, 9 (1) : 57 - 81
  • [42] Design and Implementation of ACO Feature Selection Algorithm for Data Stream Mining
    Harde, Shivani
    Sahare, Vaishali
    2016 INTERNATIONAL CONFERENCE ON AUTOMATIC CONTROL AND DYNAMIC OPTIMIZATION TECHNIQUES (ICACDOT), 2016, : 1047 - 1051
  • [43] An Efficient Heap Based Optimizer Algorithm for Feature Selection
    Ali, Mona A. S.
    Rajeena, Fathimathul P. P.
    Abd Elminaam, Diaa Salama
    MATHEMATICS, 2022, 10 (14)
  • [44] An Efficient Feature Selection Algorithm Based on Kernel Function
    Zhang, Shanwen
    Liu, Meiling
    Li, Jiaye
    Zhang, Leyuan
    2019 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2019), 2019, : 2653 - 2659
  • [45] Research on the Application of Random Forest-based Feature Selection Algorithm in Data Mining Experiments
    Wang, Huan
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (10) : 505 - 518
  • [46] Feature selection based-on genetic algorithm for CBIR
    Zhao, Tianzhong
    Lu, Jianjiang
    Zhang, Yafei
    Xiao, Qi
    CISP 2008: FIRST INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOL 2, PROCEEDINGS, 2008, : 495 - 499
  • [47] Feature Selection for Image Retrieval based on Genetic Algorithm
    Kushwaha, Preeti
    Welekar, R. R.
    INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2016, 4 (02): : 16 - 21
  • [48] GENETIC ALGORITHM BASED FEATURE SELECTION FOR PARAPHRASE RECOGNITION
    Chitra, A.
    Rajkumar, Anupriya
    INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2013, 22 (02)
  • [49] A Novel Scalable and Data Efficient Feature Subset Selection Algorithm
    de Morais, Sergio Rodrigues
    Aussem, Alex
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PART II, PROCEEDINGS, 2008, 5212 : 298 - +
  • [50] Hybrid feature selection based on improved genetic algorithm
    Hu, B. (hubin@njau.edu.cn), 1725, Universitas Ahmad Dahlan (11):