Two-stage gene selection for support vector machine classification of microarray data

被引:7
|
作者
Xia, Xiao-Lei [1 ]
Li, Kang [1 ]
Irwin, George W. [1 ]
机构
[1] Queens Univ Belfast, Sch Elect Elect Engn & Comp Sci, Ashby Bldg,Stranmillis Rd, Belfast BT9 5AH, Antrim, North Ireland
关键词
support vector machines; SVM; two-stage linear regression; gene selection; baseline method; significance analysis of microarrays; SAM;
D O I
10.1504/IJMIC.2009.029029
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper proposes a new stable gene selection method for support vector machines (SVM) classification of microarray data, aiming to improve the classification accuracy. A two-stage algorithm is used to select genes, leading to the construction of a compact multivariate linear regression model, which contains only genes less than the number of experiments as well as a weight vector for each gene index. An SVM then learns the microarray data based on this linear regression model. The experimental results, from two well-known microarray datasets, show that SVMs with two-stage gene selection maintains a consistently high accuracy with a small number of genes. It is also shown that the proposed method outperforms the two other typical gene selection methods - baseline method and significance analysis of microarrays in terms of accuracy.
引用
收藏
页码:164 / 171
页数:8
相关论文
共 50 条
  • [31] Huberized multiclass support vector machine for microarray classification
    Li J.-T.
    Jia Y.-M.
    Zidonghua Xuebao/ Acta Automatica Sinica, 2010, 36 (03): : 399 - 405
  • [32] Efficient two-stage fuzzy clustering of microarray gene expression data
    Mukhopadhyay, Anirban
    Maulik, Ujjwal
    Bandyopadhyay, Sanghamitra
    ICIT 2006: 9TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY, PROCEEDINGS, 2006, : 11 - 14
  • [33] Data Classification with Support Vector Machine and Generalized Support Vector Machine
    Qi, Xiaomin
    Silvestrov, Sergei
    Nazir, Talat
    ICNPAA 2016 WORLD CONGRESS: 11TH INTERNATIONAL CONFERENCE ON MATHEMATICAL PROBLEMS IN ENGINEERING, AEROSPACE AND SCIENCES, 2017, 1798
  • [34] A two-stage fault diagnosis methodology for rotating machinery combining optimized support vector data description and optimized support vector machine
    Zhang, Jianqun
    Zhang, Qing
    Qin, Xianrong
    Sun, Yuantao
    MEASUREMENT, 2022, 200
  • [35] Gene Expression Data Classification using Support Vector Machine and Mutual Information-based Gene Selection
    Vanitha, Devi Arockia C.
    Devaraj, D.
    Venkatesulu, M.
    GRAPH ALGORITHMS, HIGH PERFORMANCE IMPLEMENTATIONS AND ITS APPLICATIONS (ICGHIA 2014), 2015, 47 : 13 - 21
  • [36] A Two-Stage Feature Selection Method for Gene Expression Data
    Chuang, Li-Yeh
    Ke, Chao-Hsuan
    Chang, Hsueh-Wei
    Yang, Cheng-Hong
    OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY, 2009, 13 (02) : 127 - 137
  • [37] Two-stage optimization support vector machine for the construction of investment strategy model
    Wen, Chih-Hung
    Pan, Wen-Tsao
    FIRST IITA INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2009, : 248 - +
  • [38] Two-stage optimization support vector machine for the construction of investment strategy model
    Pan, Wen-Tsao
    JOURNAL OF STATISTICS & MANAGEMENT SYSTEMS, 2009, 12 (03): : 561 - 573
  • [39] Architectures for detecting and solving conflicts: two-stage classification and support vector classifiers
    Louis Vuurpijl
    Lambert Schomaker
    Merijn van Erp
    Document Analysis and Recognition, 2003, 5 (4): : 213 - 223
  • [40] Model selection for support vector machine classification
    Gold, C
    Sollich, P
    NEUROCOMPUTING, 2003, 55 (1-2) : 221 - 249