Hybrid feature selection using micro genetic algorithm on microarray gene expression data

被引:11
|
作者
Pragadeesh, C. [1 ]
Jeyaraj, Rohana [1 ]
Siranjeevi, K. [1 ]
Abishek, R. [1 ]
Jeyakumar, G. [1 ]
机构
[1] Amrita Vishwa Vidyapeetham, Amrita Sch Engn, Dept Comp Sci & Engn, Coimbatore, Tamil Nadu, India
关键词
Genetic algorithm; feature selection; microarray; hybrid methods; classification;
D O I
10.3233/JIFS-169935
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Research has proved that DNA Microarray data containing gene expression profiles are potentially excellent diagnostic tools in the medical industry. A persistent problem with regard to accessible microarray datasets is that the number of samples are much lesser than the number of features that are present. Thus, in order to extract accurate information from the dataset, one must use a robust technique. Feature selection (FS) has proved to be an effective way by which irrelevant and noisy data can be discarded. In FS, relevant features are picked, and result in commendable classification accuracy. This paper proposes a model that employs a compounded / hybrid feature selection technique (Filter + Wrapper) to classify microarray cancer data. Initially, a filter method called Information Gain (IG) to eliminate redundant features that will not contribute significantly to the final classification is used. Following to that, an evolutionary computing technique (micro Genetic Algorithm (mGA)) to find the best minimal subset of required features is employed. Then the features are classified using a traditional Support Vector Classifier and also cross validated to obtain high classification accuracy, using a minimal number of features. The complexity of the model is reduced significantly by adding mGA, as opposed to already existing models that use various other feature selection algorithms.
引用
收藏
页码:2241 / 2246
页数:6
相关论文
共 50 条
  • [41] An Integrated Feature Selection Algorithm for Cancer Classification using Gene Expression Data
    Ahmed, Saeed
    Kabir, Muhammad
    Ali, Zakir
    Arif, Muhammad
    Ali, Farman
    Yu, Dong-Jun
    COMBINATORIAL CHEMISTRY & HIGH THROUGHPUT SCREENING, 2018, 21 (09) : 631 - 645
  • [42] Hybrid Binary Imperialist Competition Algorithm and Tabu Search Approach for Feature Selection Using Gene Expression Data
    Wang, Shuaiqun
    Aorigele
    Kong, Wei
    Zeng, Weiming
    Hong, Xiaomin
    BIOMED RESEARCH INTERNATIONAL, 2016, 2016
  • [43] A Hybrid Discrete Imperialist Competition Algorithm for Gene Selection for Microarray Data
    Aorigele
    Tang, Zheng
    Todo, Yuki
    Gao, Shangce
    CURRENT PROTEOMICS, 2018, 15 (02) : 99 - 110
  • [44] Feature selection and ranking of key genes for tumor classification: Using microarray gene expression data
    Mukkamala, Srinivas
    Liu, Qingzhong
    Veeraghattam, Rajeev
    Sung, Andrew H.
    ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING - ICAISC 2006, PROCEEDINGS, 2006, 4029 : 951 - 961
  • [45] Hybrid GA-IBPSO for feature selection using microarray data
    Yang, Cheng-San
    Chuang, Li-Yeh
    Ho, Chang-Hsuan
    Yang, Cheng-Hong
    IMECS 2007: INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS, VOLS I AND II, 2007, : 284 - +
  • [46] Hybridization of Genetic and Quantum Algorithm for Gene Selection and Classification of Microarray Data
    Abderrahim, Allani
    Talbi, El-Ghazali
    Khaled, Mellouli
    2009 IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL & DISTRIBUTED PROCESSING, VOLS 1-5, 2009, : 2226 - +
  • [47] Gene Selection for Microarray Data by a LDA-Based Genetic Algorithm
    Huerta, Edmundo Bonilla
    Duval, Beatrice
    Hao, Jin-Kao
    PATTERN RECOGNITION IN BIOINFORMATICS, PROCEEDINGS, 2008, 5265 : 250 - 261
  • [48] HYBRIDIZATION OF GENETIC AND QUANTUM ALGORITHM FOR GENE SELECTION AND CLASSIFICATION OF MICROARRAY DATA
    Abderrahim, Allani
    Talbi, El-Ghazali
    Khaled, Mellouli
    INTERNATIONAL JOURNAL OF FOUNDATIONS OF COMPUTER SCIENCE, 2012, 23 (02) : 431 - 444
  • [49] Feature Selection Using Genetic Algorithm for Big Data
    Saidi, Rania
    Ncir, Waad Bouaguel
    Essoussi, Nadia
    INTERNATIONAL CONFERENCE ON ADVANCED MACHINE LEARNING TECHNOLOGIES AND APPLICATIONS (AMLTA2018), 2018, 723 : 352 - 361
  • [50] Feature selection of intrusion detection data using a hybrid genetic algorithm/KNN approach
    Middlemiss, M
    Dick, G
    DESIGN AND APPLICATION OF HYBRID INTELLIGENT SYSTEMS, 2003, 104 : 519 - 527