Two-stage gene selection for support vector machine classification of microarray data

被引:7
|
作者
Xia, Xiao-Lei [1 ]
Li, Kang [1 ]
Irwin, George W. [1 ]
机构
[1] Queens Univ Belfast, Sch Elect Elect Engn & Comp Sci, Ashby Bldg,Stranmillis Rd, Belfast BT9 5AH, Antrim, North Ireland
关键词
support vector machines; SVM; two-stage linear regression; gene selection; baseline method; significance analysis of microarrays; SAM;
D O I
10.1504/IJMIC.2009.029029
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper proposes a new stable gene selection method for support vector machines (SVM) classification of microarray data, aiming to improve the classification accuracy. A two-stage algorithm is used to select genes, leading to the construction of a compact multivariate linear regression model, which contains only genes less than the number of experiments as well as a weight vector for each gene index. An SVM then learns the microarray data based on this linear regression model. The experimental results, from two well-known microarray datasets, show that SVMs with two-stage gene selection maintains a consistently high accuracy with a small number of genes. It is also shown that the proposed method outperforms the two other typical gene selection methods - baseline method and significance analysis of microarrays in terms of accuracy.
引用
收藏
页码:164 / 171
页数:8
相关论文
共 50 条
  • [41] SVM Classifier – a comprehensive java interface for support vector machine classification of microarray data
    Mehdi Pirooznia
    Youping Deng
    BMC Bioinformatics, 7
  • [42] Gene selection from microarray data for cancer classification - a machine learning approach
    Wang, Y
    Tetko, IV
    Hall, MA
    Frank, E
    Facius, A
    Mayer, KFX
    Mewes, HW
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2005, 29 (01) : 37 - 46
  • [43] Analyzing Support Vector Machine Overfitting on Microarray Data
    Han, Henry
    INTELLIGENT COMPUTING IN BIOINFORMATICS, 2014, 8590 : 148 - 156
  • [44] Efficient gene selection for classification of microarray data
    Ho, SY
    Lee, CC
    Chen, HM
    Huang, HL
    2005 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-3, PROCEEDINGS, 2005, : 1753 - 1760
  • [45] Gene selection for cancer classification in microarray data
    Zhang, Lijuan
    Li, Zhoujun
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2009, 46 (05): : 794 - 802
  • [46] A Two-Stage Fault Detection and Classification for Electric Pitch Drives in Offshore Wind Farms using Support Vector Machine
    Kandukuri, Surya Teja
    Senanayaka, Jagath Sri Lal
    Van Khang Huynh
    Robbersmyr, Kjell G.
    2017 20TH INTERNATIONAL CONFERENCE ON ELECTRICAL MACHINES AND SYSTEMS (ICEMS), 2017,
  • [47] Two-Stage Hybrid Gene Selection Using Mutual Information and Genetic Algorithm for Cancer Data Classification
    Rani, M. Jansi
    Devaraj, D.
    JOURNAL OF MEDICAL SYSTEMS, 2019, 43 (08)
  • [48] Two-Stage Hybrid Gene Selection Using Mutual Information and Genetic Algorithm for Cancer Data Classification
    M. Jansi Rani
    D. Devaraj
    Journal of Medical Systems, 2019, 43
  • [49] Two-stage feature selection for classification of gene expression data based on an improved Salp Swarm Algorithm
    Qin, Xiwen
    Zhang, Shuang
    Yin, Dongmei
    Chen, Dongxue
    Dong, Xiaogang
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2022, 19 (12) : 13747 - 13781
  • [50] Variable selection using probability density function similarity for support vector machine classification of high-dimensional microarray data
    Tang, Li-Juan
    Jiang, Jian-Hui
    Wu, Hai-Long
    Shen, Guo-Li
    Yu, Ru-Qin
    TALANTA, 2009, 79 (02) : 260 - 267