Identification and Analysis of Cancer Diagnosis Using Probabilistic Classification Vector Machines with Feature Selection

被引:21
|
作者
Du, Xiuquan [1 ,2 ]
Li, Xinrui [2 ]
Li, Wen [2 ]
Yan, Yuanting [1 ,2 ]
Zhang, Yanping [1 ,2 ]
机构
[1] Anhui Univ, Key Lab Intelligent Comp & Signal Proc, Minist Educ, Hefei, Anhui, Peoples R China
[2] Anhui Univ, Sch Comp Sci & Technol, Hefei 230601, Anhui, Peoples R China
基金
美国国家科学基金会;
关键词
Probabilistic classification vector; feature selection; tumor classification; DX; machine learning; kernel function; GENE; PREDICTION;
D O I
10.2174/1574893612666170405125637
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: The accurate classification of tumors types is mainly important for the treatment of cancer. With the progress of the microarray expression profile, many methods are proposed to deal with these data. However, because of the feature dimension of tumor gene expression profile is very high; many machine learning algorithms are failure. Objective & Methods: In this paper, a novel method named probabilistic classification vector machines (PCVM) with feature selection is proposed for tumor types detection using gene expression data, PCVM adopt a signed and truncated Gaussian prior to solve the problem of unstable solutions caused, and the complexity of the model can be controlled by the truncated Gaussian prior. The performance of PCVM is evaluated on two datasets by using four metrics. Results: This method achieves 84.21% accuracy and 95.24 % accuracy in the leukemia and prostrate dataset respectively. As compared to other methods, PCVM obtain much higher performance than Support Vector Machines (SVM), Naive Bayes (NB), RBF Neural Networks (RBF), K-nearest Neighbor (KNN), and Random Forest (RF) except SVM on Prostate dataset. In order to reduce computational time, we adopt a feature selection method (DX) to rank the features and search the optimal feature combination based on PCVM, PCVM with DX method (PCVM-DX) achieves 94.74% accuracy, 100% sensitivity, 85.71% specificity and 92.31% precision on the leukemia dataset. PCVM-DX method obtained the same result as PCVM on the prostate dataset. We also compare DX with other feature selection method; the result reveals that the PCVM-DX is efficient for tumor classification in terms of performance. Conclusion: PCVM-DX is observed to be better than the other methods in two data sets. The novelty of this approach lies in applying PCVM to tackle the same prior for different classes may lead to unstable solutions by RVMs and also exploring the important feature subset in the microarray expression profile with feature selection.
引用
收藏
页码:625 / 632
页数:8
相关论文
共 50 条
  • [1] Feature selection and classification of breast cancer diagnosis based on support vector machines
    Purnami, Santi Wulan
    Rahayu, S. P.
    Embong, Abdullah
    INTERNATIONAL SYMPOSIUM OF INFORMATION TECHNOLOGY 2008, VOLS 1-4, PROCEEDINGS: COGNITIVE INFORMATICS: BRIDGING NATURAL AND ARTIFICIAL KNOWLEDGE, 2008, : 500 - 505
  • [2] Radar Target Identification using Probabilistic Classification Vector Machines
    Jouny, I.
    AUTOMATIC TARGET RECOGNITION XXVI, 2016, 9844
  • [3] Probabilistic Feature Selection and Classification Vector Machine
    Jiang, Bingbing
    Li, Chang
    de Rijke, Maarten
    Yao, Xin
    Chen, Huanhuan
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2019, 13 (02)
  • [4] Support vector machines combined with feature selection for breast cancer diagnosis
    Akay, Mehmet Fatih
    EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (02) : 3240 - 3247
  • [5] Feature Selection Algorithm in Classification Learning Using Support Vector Machines
    Goncharov, Yu. V.
    Muchnik, I. B.
    Shvartser, L. V.
    COMPUTATIONAL MATHEMATICS AND MATHEMATICAL PHYSICS, 2008, 48 (07) : 1243 - 1260
  • [6] Feature selection algorithm in classification learning using support vector machines
    Yu. V. Goncharov
    I. B. Muchnik
    L. V. Shvartser
    Computational Mathematics and Mathematical Physics, 2008, 48 : 1243 - 1260
  • [7] Gene selection for cancer classification using support vector machines
    Guyon, I
    Weston, J
    Barnhill, S
    Vapnik, V
    MACHINE LEARNING, 2002, 46 (1-3) : 389 - 422
  • [8] Gene Selection for Cancer Classification using Support Vector Machines
    Isabelle Guyon
    Jason Weston
    Stephen Barnhill
    Vladimir Vapnik
    Machine Learning, 2002, 46 : 389 - 422
  • [9] High dimensional data classification and feature selection using support vector machines
    Ghaddar, Bissan
    Naoum-Sawaya, Joe
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2018, 265 (03) : 993 - 1004
  • [10] Probabilistic Classification Vector Machines
    Chen, Huanhuan
    Tino, Peter
    Yao, Xin
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 2009, 20 (06): : 901 - 914