A kernel-based clustering method for gene selection with gene expression data

被引:48
|
作者
Chen, Huihui [1 ]
Zhang, Yusen [1 ]
Gutman, Ivan [2 ]
机构
[1] Shandong Univ Weihai, Sch Math & Stat, Weihai 264209, Peoples R China
[2] Univ Kragujevac, Fac Sci, POB 60, Kragujevac 34000, Serbia
关键词
Gene expression data; Kernel-based clustering; Adaptive distance; Gene selection; Cancer classification; CANCER CLASSIFICATION; PREDICTION; ALGORITHM; DISCOVERY;
D O I
10.1016/j.jbi.2016.05.007
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Gene selection is important for cancer classification based on gene expression data, because of high dimensionality and small sample size. In this paper, we present a new gene selection method based on clustering, in which dissimilarity measures are obtained through kernel functions. It searches for best weights of genes iteratively at the same time to optimize the clustering objective function. Adaptive distance is used in the process, which is suitable to learn the weights of genes during the clustering process, improving the performance of the algorithm. The proposed algorithm is simple and does not require any modification or parameter optimization for each dataset. We tested it on eight publicly available datasets, using two classifiers (support vector machine, k-nearest neighbor), compared with other six competitive feature selectors. The results show that the proposed algorithm is capable of achieving better accuracies and may be an efficient tool for finding possible biomarkers from gene expression data. (C) 2016 Elsevier Inc. All rights reserved.
引用
收藏
页码:12 / 20
页数:9
相关论文
共 50 条
  • [21] Kernel-based deterministic annealing algorithm for data clustering
    Yang, X. L.
    Song, Q.
    Zhang, W. B.
    IEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSING, 2006, 153 (05): : 557 - 568
  • [22] Scuba: scalable kernel-based gene prioritization
    Guido Zampieri
    Dinh Van Tran
    Michele Donini
    Nicolò Navarin
    Fabio Aiolli
    Alessandro Sperduti
    Giorgio Valle
    BMC Bioinformatics, 19
  • [23] Gene-Ontology-based clustering of gene expression data
    Adryan, B
    Schuh, R
    BIOINFORMATICS, 2004, 20 (16) : 2851 - 2852
  • [24] Scuba: scalable kernel-based gene prioritization
    Zampieri, Guido
    Dinh Van Tran
    Donini, Michele
    Navarin, Nicolo
    Aiolli, Fabio
    Sperduti, Alessandro
    Valle, Giorgio
    BMC BIOINFORMATICS, 2018, 19
  • [25] An Ensemble Filtering and Supervised Clustering based Informative Gene Selection Algorithm in Microarray Gene Expression Data
    Bose, Shilpi
    Das, Chandra
    Banerjee, Abhik
    Chattopadhyay, Matangini
    Chattopadhyay, Samiran
    2020 4TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND NETWORKS (CINE 2020), 2020,
  • [26] A temporal precedence based clustering method for gene expression microarray data
    Ritesh Krishna
    Chang-Tsun Li
    Vicky Buchanan-Wollaston
    BMC Bioinformatics, 11
  • [27] A temporal precedence based clustering method for gene expression microarray data
    Krishna, Ritesh
    Li, Chang-Tsun
    Buchanan-Wollaston, Vicky
    BMC BIOINFORMATICS, 2010, 11
  • [28] An effective fuzzy kernel clustering analysis approach for gene expression data
    Sun, Lin
    Xu, Jiucheng
    Yin, Jiaojiao
    BIO-MEDICAL MATERIALS AND ENGINEERING, 2015, 26 : S1863 - S1869
  • [29] Performance assessment of kernel density clustering for gene expression profile data
    Shu, GP
    Zeng, BY
    Chen, YPP
    Smith, OH
    COMPARATIVE AND FUNCTIONAL GENOMICS, 2003, 4 (03): : 287 - 299
  • [30] Online kernel-based clustering
    Alam, Abrar
    Malhotra, Akshay
    Schizas, Ioannis D.
    PATTERN RECOGNITION, 2025, 158