Gene selection for enhanced classification on microarray data using a weighted k-NN based algorithm

被引:6
|
作者
Ventura-Molina, Elias [1 ]
Alarcon-Paredes, Antonio [2 ]
Aldape-Perez, Mario [3 ]
Yanez-Marquez, Cornelio [1 ]
Adolfo Alonso, Gustavo [2 ]
机构
[1] Inst Politecn Nacl, Ctr Invest Computac, Av Juan de Dios Batiz, Ciudad De Mexico 07738, Mexico
[2] Univ Autonoma Guerrero, Fac Ingn, Av Lazaro Cardenas S-N,Ciudad Univ Zona Sur, Chilpancingo Guerrero 39087, Mexico
[3] Inst Politecn Nacl, Ctr Innovac & Desarrollo Tecnol Computo, Av Juan de Dios Batiz, Ciudad De Mexico 07700, Mexico
关键词
Computational genomics; microarray data analysis; feature selection; feature ranking; feature weighting; k-nearest neighbors; NEAREST-NEIGHBOR; HARMONY SEARCH; CANCER; FEATURES; RANKING; KNN;
D O I
10.3233/IDA-173720
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature selection is a common solution to microarray analysis. Previous approaches either select features based on classical statistical tests that can be tuned up with a classifier, or using regularization penalties incorporated in the cost function. Here we propose to use a feature ranking and weighting scheme instead, which combines statistical techniques with a weighted k-NN classifier using a modified forward selection procedure. We demonstrate that classification accuracy of our proposal outperforms existing methods on a range of public microarray gene expression datasets. The proposed method is also compared to state-of-the-art feature selection algorithms by means of the Friedman test. Although a bunch of feature selection techniques has been used for genomic data, the experimental results show the classification superiority of our method on most of the present gene expression datasets.
引用
收藏
页码:241 / 253
页数:13
相关论文
共 50 条
  • [31] Protein subcellular location prediction using optimally weighted fuzzy k-NN algorithm
    Nasibov, Efendi
    Kandemir-Cavas, Cagin
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2008, 32 (06) : 448 - 451
  • [32] A Modified K-NN Algorithm for Holter Waveform Classification Based on Kernel Function
    Zheng, Gang
    Cao, Guochao
    FIFTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 2, PROCEEDINGS, 2008, : 343 - 346
  • [33] Adaptive K-NN metric classification based on improved Kepler optimization algorithm
    Cai, Liang
    Zhao, Shijie
    Meng, Fanshuai
    Zhang, Tianran
    JOURNAL OF SUPERCOMPUTING, 2025, 81 (01):
  • [34] Fault detection and classification in smart grids using augmented K-NN algorithm
    Hosseinzadeh, Javad
    Masoodzadeh, Farokh
    Roshandel, Emad
    SN APPLIED SCIENCES, 2019, 1 (12):
  • [35] Efficient Selection Algorithm for Fast k-NN Search on GPUs
    Tang, Xiaoxin
    Huang, Zhiyi
    Eyers, David
    Mills, Steven
    Guo, Minyi
    2015 IEEE 29TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), 2015, : 397 - 406
  • [36] k-NN for the Classification of Human Cancer Samples Using the Gene Expression Profiles
    Martin-Merino, Manuel
    ADVANCES IN COMPUTATIONAL BIOLOGY, 2010, 680 : 157 - 164
  • [37] Travel time estimation method using scats traffic data based on k-NN algorithm
    Jiang, G. (jianggy@jlu.edu.cn), 1600, Science Press (48):
  • [38] Classification of stock index movement using k-nearest neighbours (k-NN) algorithm
    Subha, M.V.
    Nambi, S. Thirupparkadal
    WSEAS Transactions on Information Science and Applications, 2012, 9 (09): : 261 - 270
  • [39] Modified K-NN algorithm for classification problems with improved accuracy
    Sahu S.K.
    Kumar P.
    Singh A.P.
    International Journal of Information Technology, 2018, 10 (1) : 65 - 70
  • [40] Predicting the number of nearest neighbors for the k-NN classification algorithm
    Zhang, Xueying
    Song, Qinbao
    INTELLIGENT DATA ANALYSIS, 2014, 18 (03) : 449 - 464