Identification and Analysis of Cancer Diagnosis Using Probabilistic Classification Vector Machines with Feature Selection

被引:21
|
作者
Du, Xiuquan [1 ,2 ]
Li, Xinrui [2 ]
Li, Wen [2 ]
Yan, Yuanting [1 ,2 ]
Zhang, Yanping [1 ,2 ]
机构
[1] Anhui Univ, Key Lab Intelligent Comp & Signal Proc, Minist Educ, Hefei, Anhui, Peoples R China
[2] Anhui Univ, Sch Comp Sci & Technol, Hefei 230601, Anhui, Peoples R China
基金
美国国家科学基金会;
关键词
Probabilistic classification vector; feature selection; tumor classification; DX; machine learning; kernel function; GENE; PREDICTION;
D O I
10.2174/1574893612666170405125637
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: The accurate classification of tumors types is mainly important for the treatment of cancer. With the progress of the microarray expression profile, many methods are proposed to deal with these data. However, because of the feature dimension of tumor gene expression profile is very high; many machine learning algorithms are failure. Objective & Methods: In this paper, a novel method named probabilistic classification vector machines (PCVM) with feature selection is proposed for tumor types detection using gene expression data, PCVM adopt a signed and truncated Gaussian prior to solve the problem of unstable solutions caused, and the complexity of the model can be controlled by the truncated Gaussian prior. The performance of PCVM is evaluated on two datasets by using four metrics. Results: This method achieves 84.21% accuracy and 95.24 % accuracy in the leukemia and prostrate dataset respectively. As compared to other methods, PCVM obtain much higher performance than Support Vector Machines (SVM), Naive Bayes (NB), RBF Neural Networks (RBF), K-nearest Neighbor (KNN), and Random Forest (RF) except SVM on Prostate dataset. In order to reduce computational time, we adopt a feature selection method (DX) to rank the features and search the optimal feature combination based on PCVM, PCVM with DX method (PCVM-DX) achieves 94.74% accuracy, 100% sensitivity, 85.71% specificity and 92.31% precision on the leukemia dataset. PCVM-DX method obtained the same result as PCVM on the prostate dataset. We also compare DX with other feature selection method; the result reveals that the PCVM-DX is efficient for tumor classification in terms of performance. Conclusion: PCVM-DX is observed to be better than the other methods in two data sets. The novelty of this approach lies in applying PCVM to tackle the same prior for different classes may lead to unstable solutions by RVMs and also exploring the important feature subset in the microarray expression profile with feature selection.
引用
收藏
页码:625 / 632
页数:8
相关论文
共 50 条
  • [41] Predicting Protein in Cancer Diagnosis using Effective Classification and Feature Selection Technique
    Lobo, Sophia
    Pallavi, M. S.
    PROCEEDINGS OF THE 2018 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATION AND SIGNAL PROCESSING (ICCSP), 2018, : 156 - 159
  • [42] Speech segmentation using probabilistic phonetic feature hierarchy and support vector machines
    Juneja, A
    Espy-Wilson, C
    PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS 2003, VOLS 1-4, 2003, : 675 - 679
  • [43] Feature Selection for Cancer Classification Based on Support Vector Machine
    Luo, Wei
    Wang, Lipo
    Sun, Jingjing
    PROCEEDINGS OF THE 2009 WRI GLOBAL CONGRESS ON INTELLIGENT SYSTEMS, VOL IV, 2009, : 422 - +
  • [44] Hyperspectral Imagery Classification Based on Probabilistic Classification Vector Machines
    Xue, Zhixiang
    Yu, Xuchu
    Fu, Qiongying
    Wei, Xiangpo
    Liu, Bing
    EIGHTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2016), 2016, 10033
  • [45] Feature selection for bagging of support vector machines
    Li, Guo-Zheng
    Liu, Tian-Yu
    PRICAI 2006: TRENDS IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, 4099 : 271 - 277
  • [46] Feature selection for multiclass support vector machines
    Aazi, F. Z.
    Abdesselam, R.
    Achchab, B.
    Elouardighi, A.
    AI COMMUNICATIONS, 2016, 29 (05) : 583 - 593
  • [47] Stable Feature Selection with Support Vector Machines
    Kamkar, Iman
    Gupta, Sunil Kumar
    Dinh Phung
    Venkatesh, Svetha
    AI 2015: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2015, 9457 : 298 - 308
  • [48] Optimal feature selection for support vector machines
    Nguyen, Minh Hoai
    de la Torre, Fernando
    PATTERN RECOGNITION, 2010, 43 (03) : 584 - 591
  • [49] Feature selection using random probes and linear support vector machines
    Chi, Hoi-Ming
    Ersoy, Okan K.
    Moskowitz, Herbert
    2005 ICSC CONGRESS ON COMPUTATIONAL INTELLIGENCE METHODS AND APPLICATIONS (CIMA 2005), 2005, : 111 - 115
  • [50] Feature selection for linear support vector machines
    Liang, Zhizheng
    Zhao, Tuo
    18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 2, PROCEEDINGS, 2006, : 606 - 609