A kernel-based clustering method for gene selection with gene expression data

被引:48
|
作者
Chen, Huihui [1 ]
Zhang, Yusen [1 ]
Gutman, Ivan [2 ]
机构
[1] Shandong Univ Weihai, Sch Math & Stat, Weihai 264209, Peoples R China
[2] Univ Kragujevac, Fac Sci, POB 60, Kragujevac 34000, Serbia
关键词
Gene expression data; Kernel-based clustering; Adaptive distance; Gene selection; Cancer classification; CANCER CLASSIFICATION; PREDICTION; ALGORITHM; DISCOVERY;
D O I
10.1016/j.jbi.2016.05.007
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Gene selection is important for cancer classification based on gene expression data, because of high dimensionality and small sample size. In this paper, we present a new gene selection method based on clustering, in which dissimilarity measures are obtained through kernel functions. It searches for best weights of genes iteratively at the same time to optimize the clustering objective function. Adaptive distance is used in the process, which is suitable to learn the weights of genes during the clustering process, improving the performance of the algorithm. The proposed algorithm is simple and does not require any modification or parameter optimization for each dataset. We tested it on eight publicly available datasets, using two classifiers (support vector machine, k-nearest neighbor), compared with other six competitive feature selectors. The results show that the proposed algorithm is capable of achieving better accuracies and may be an efficient tool for finding possible biomarkers from gene expression data. (C) 2016 Elsevier Inc. All rights reserved.
引用
收藏
页码:12 / 20
页数:9
相关论文
共 50 条
  • [41] Gene selection and classification using non-linear kernel support vector machines based on gene expression data
    Zhang Qizhong
    2007 IEEE/ICME INTERNATIONAL CONFERENCE ON COMPLEX MEDICAL ENGINEERING, VOLS 1-4, 2007, : 1606 - 1611
  • [42] FEED: a feature selection method based on gene expression decomposition for single cell clustering
    Zhang, Chao
    Duan, Zhi-Wei
    Xu, Yun-Pei
    Liu, Jin
    Li, Hong-Dong
    BRIEFINGS IN BIOINFORMATICS, 2023, 24 (06)
  • [43] Performance of kernel-based fuzzy clustering
    Graves, D.
    Pedrycz, W.
    ELECTRONICS LETTERS, 2007, 43 (25) : 1445 - 1446
  • [44] Kernel-based clustering via Isolation Distributional Kernel
    Zhu, Ye
    Ting, Kai Ming
    INFORMATION SYSTEMS, 2023, 117
  • [45] Performance Assessment of Kernel-Based Clustering
    Tushir, Meena
    Srivastava, Smriti
    COMPUTATIONAL INTELLIGENCE, CYBER SECURITY AND COMPUTATIONAL MODELS, 2014, 246 : 139 - 145
  • [46] Fuzzy Rule Based Clustering for Gene Expression Data
    Sinaee, Mehrnoosh
    Mansoori, Eghbal G.
    FOURTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS, MODELLING AND SIMULATION (ISMS 2013), 2013, : 7 - 11
  • [47] Kernel-Based Persian Viseme Clustering
    Dehshibi, Mohammad Mahdi
    Alavi, Meysam
    Shanbehzadeh, Jamshid
    2013 13TH INTERNATIONAL CONFERENCE ON HYBRID INTELLIGENT SYSTEMS (HIS), 2013, : 129 - 133
  • [48] A kernel-based fuzzy clustering algorithm
    Wang, Jiun-Hau
    Lee, Wan-Jui
    Lee, Shie-Jue
    ICICIC 2006: FIRST INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING, INFORMATION AND CONTROL, VOL 1, PROCEEDINGS, 2006, : 550 - +
  • [49] Clustering of Gene Expression Data Based on Shape Similarity
    Hestilow, Travis J.
    Huang, Yufei
    EURASIP JOURNAL ON BIOINFORMATICS AND SYSTEMS BIOLOGY, 2009, (01)
  • [50] DYNAMIC CORE BASED CLUSTERING OF GENE EXPRESSION DATA
    Bocicor, Maria-Iuliana
    Sirbu, Adela
    Czibula, Gabriela
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2014, 10 (03): : 1051 - 1069