Framework for efficient feature selection in genetic algorithm based data mining

被引:78
|
作者
Sikora, Riyaz
Piramuthu, Selwyn
机构
[1] Univ Texas, Dept Informat Syst, Arlington, TX 76019 USA
[2] Univ Florida, Dept Informat & Decis Sci, Gainesville, FL 32611 USA
关键词
genetic algorithms; rule learning; knowledge discover; data mining; evolutionary algorithms;
D O I
10.1016/j.ejor.2006.02.040
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
We present the design of more effective and efficient genetic algorithm based data mining techniques that use the concepts of feature selection. Explicit feature selection is traditionally done as a wrapper approach where every candidate feature subset is evaluated by executing the data mining algorithm on that subset. In this article we present a GA for doing both the tasks of mining and feature selection simultaneously by evolving a binary code along side the chromosome structure used for evolving the rules. We then present a wrapper approach to feature selection based on Hausdorff distance measure. Results from applying the above techniques to a real world data mining problem show that combining both the feature selection methods provides the best performance in terms of prediction accuracy and computational efficiency. (c) 2006 Elsevier B.V. All rights reserved.
引用
收藏
页码:723 / 737
页数:15
相关论文
共 50 条
  • [1] Framework for Efficient Letter Selection in Genetic Algorithm Based Data Mining
    Chen, Xiaoyan
    Zheng, Shijue
    Tao, Tao
    DCABES 2008 PROCEEDINGS, VOLS I AND II, 2008, : 334 - +
  • [2] Efficient genetic algorithm based data mining using feature selection with Hausdorff distance
    Sikora R.
    Piramuthu S.
    Information Technology and Management, 2005, 6 (4) : 315 - 331
  • [3] Efficient Genetic-Wrapper Algorithm Based Data Mining for Feature Subset Selection in a Power Quality Pattern Recognition Application
    Krishna, Brahmadesam
    Kaliaperumal, Baskaran
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2011, 8 (04) : 397 - 405
  • [4] Hybrid Efficient Genetic Algorithm for Big Data Feature Selection Problems
    Mohammed, Tareq Abed
    Bayat, Oguz
    Ucan, Osman N.
    Alhayali, Shaymaa
    FOUNDATIONS OF SCIENCE, 2020, 25 (04) : 1009 - 1025
  • [5] Hybrid Efficient Genetic Algorithm for Big Data Feature Selection Problems
    Tareq Abed Mohammed
    Oguz Bayat
    Osman N. Uçan
    Shaymaa Alhayali
    Foundations of Science, 2020, 25 : 1009 - 1025
  • [6] Data mining and genetic algorithm based gene/SNP selection
    Shah, SC
    Kusiak, A
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2004, 31 (03) : 183 - 196
  • [7] Genetic Algorithm Based Feature Selection Technique for Electroencephalography Data
    Ali, Tariq
    Nawaz, Asif
    Sadia, Hafiza Ayesha
    APPLIED COMPUTER SYSTEMS, 2019, 24 (02) : 119 - 127
  • [8] Genetic Algorithm Based Feature Selection for Mass Spectrometry Data
    Li, Yifeng
    Liu, Yihui
    Bai, Li
    8TH IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOENGINEERING, VOLS 1 AND 2, 2008, : 85 - 90
  • [9] An Efficient Framework for Heart Disease Classification using Feature Extraction and Feature Selection Technique in Data Mining
    Kavitha, R.
    Kannan, E.
    FIRST INTERNATIONAL CONFERENCE ON EMERGING TRENDS IN ENGINEERING, TECHNOLOGY AND SCIENCE - ICETETS 2016, 2016,
  • [10] An efficient feature selection algorithm for hybrid data
    Wang, Feng
    Liang, Jiye
    NEUROCOMPUTING, 2016, 193 : 33 - 41