Gene selection for enhanced classification on microarray data using a weighted k-NN based algorithm

被引:6
|
作者
Ventura-Molina, Elias [1 ]
Alarcon-Paredes, Antonio [2 ]
Aldape-Perez, Mario [3 ]
Yanez-Marquez, Cornelio [1 ]
Adolfo Alonso, Gustavo [2 ]
机构
[1] Inst Politecn Nacl, Ctr Invest Computac, Av Juan de Dios Batiz, Ciudad De Mexico 07738, Mexico
[2] Univ Autonoma Guerrero, Fac Ingn, Av Lazaro Cardenas S-N,Ciudad Univ Zona Sur, Chilpancingo Guerrero 39087, Mexico
[3] Inst Politecn Nacl, Ctr Innovac & Desarrollo Tecnol Computo, Av Juan de Dios Batiz, Ciudad De Mexico 07700, Mexico
关键词
Computational genomics; microarray data analysis; feature selection; feature ranking; feature weighting; k-nearest neighbors; NEAREST-NEIGHBOR; HARMONY SEARCH; CANCER; FEATURES; RANKING; KNN;
D O I
10.3233/IDA-173720
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature selection is a common solution to microarray analysis. Previous approaches either select features based on classical statistical tests that can be tuned up with a classifier, or using regularization penalties incorporated in the cost function. Here we propose to use a feature ranking and weighting scheme instead, which combines statistical techniques with a weighted k-NN classifier using a modified forward selection procedure. We demonstrate that classification accuracy of our proposal outperforms existing methods on a range of public microarray gene expression datasets. The proposed method is also compared to state-of-the-art feature selection algorithms by means of the Friedman test. Although a bunch of feature selection techniques has been used for genomic data, the experimental results show the classification superiority of our method on most of the present gene expression datasets.
引用
收藏
页码:241 / 253
页数:13
相关论文
共 50 条
  • [1] Active Learning Using Fuzzy k-NN for Cancer Classification from Microarray Gene Expression Data
    Halder, Anindya
    Dey, Samrat
    Kumar, Ansuman
    ADVANCES IN COMMUNICATION AND COMPUTING, 2015, 347 : 103 - 113
  • [2] Sentiment Classification with PSO Based Weighted K-NN
    Aydin, Ilhan
    Baskaya, Fatma
    Salur, Mehmet Umut
    2017 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ENGINEERING (UBMK), 2017, : 739 - 744
  • [3] Evaluation of normalization methods for cDNA microarray data by k-NN classification
    Wei Wu
    Eric P Xing
    Connie Myers
    I Saira Mian
    Mina J Bissell
    BMC Bioinformatics, 6
  • [4] A New Method For Selection Optimum k Value In k-NN Classification Algorithm
    Maleki, Masoud
    Eroglu, Kubra
    Aydemir, Onder
    Manshoori, Negin
    Kayikcioglu, Temel
    2013 21ST SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2013,
  • [5] An optimally weighted fuzzy k-NN algorithm
    Pham, TD
    PATTERN RECOGNITION AND DATA MINING, PT 1, PROCEEDINGS, 2005, 3686 : 239 - 247
  • [6] Semi-supervised fuzzy K-NN for cancer classification from microarray gene expression data
    Halder, Anindya
    Misra, Subhashis
    2014 FIRST INTERNATIONAL CONFERENCE ON AUTOMATION, CONTROL, ENERGY & SYSTEMS (ACES-14), 2014, : 266 - 270
  • [7] Feature Selection by Using DE Algorithm and k-NN Classifier
    Senel, Fatih Ahmet
    Yuksel, Asim Sinan
    Yigit, Tuncay
    ARTIFICIAL INTELLIGENCE AND APPLIED MATHEMATICS IN ENGINEERING PROBLEMS, 2020, 43 : 886 - 893
  • [8] Medical Dataset Classification Using k-NN and Genetic Algorithm
    Kumar, Santosh
    Sahoo, G.
    COMPUTATIONAL INTELLIGENCE IN DATA MINING, CIDM 2016, 2017, 556 : 813 - 823
  • [9] Semantic-k-NN algorithm: An enhanced version of traditional k-NN algorithm
    Ali, Munwar
    Jung, Low Tang
    Abdel-Aty, Abdel-Haleem
    Abubakar, Mustapha Y.
    Elhoseny, Mohamed
    Ali, Irfan
    EXPERT SYSTEMS WITH APPLICATIONS, 2020, 151
  • [10] Two-phase EA/k-NN for feature selection and classification in cancer microarray datasets
    Juliusdottir, T
    Keedwell, E
    Corne, D
    Narayanan, A
    Proceedings of the 2005 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology, 2005, : 1 - 8