A method for feature selection on microarray data using support vector machine

被引:0
|
作者
Huang, Xiao Bing [1 ]
Tang, Jian [1 ]
机构
[1] Mem Univ Newfoundland, Dept Comp Sci, St John, NF A1B 3X5, Canada
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The data collected from a typical microarray experiment usually consists of tens of samples and thousands of genes (i.e., features). Usually only a small subset of features is relevant and non-redundant to differentiate the samples. Identifying an optimal subset of relevant genes is crucial for accurate classification of samples. In this paper, we propose a method for relevant gene subset selection for microarray gene expression data. Our method is based on gap tolerant classifier, a variation of support vector machine, and uses a hill-climbing search strategy. Unlike most other hill-climbing approaches, where classification accuracies are used as a criterion for feature selection, the proposed method uses a mixture of accuracy and SVM margin to select features. Our experimental results show that this strategy is effective both in selecting relevant and in eliminating redundant features.
引用
收藏
页码:513 / 523
页数:11
相关论文
共 50 条
  • [31] Gene selection algorithms for microarray data based on least squares support vector machine
    E Ke Tang
    PN Suganthan
    Xin Yao
    BMC Bioinformatics, 7 (1)
  • [32] Automatic feature scaling and selection for support vector machine classification with functional data
    Asunción Jiménez-Cordero
    Sebastián Maldonado
    Applied Intelligence, 2021, 51 : 161 - 184
  • [33] Unsupervised feature selection algorithm based on support vector machine for network data
    Dai, Kun
    Yu, Hong-Yi
    Qiu, Wen-Bo
    Li, Qing
    Jilin Daxue Xuebao (Gongxueban)/Journal of Jilin University (Engineering and Technology Edition), 2015, 45 (02): : 576 - 582
  • [34] Automatic feature scaling and selection for support vector machine classification with functional data
    Jimenez-Cordero, Asuncion
    Maldonado, Sebastian
    APPLIED INTELLIGENCE, 2021, 51 (01) : 161 - 184
  • [35] NONPARAMETRIC FEATURE SELECTION AND SUPPORT VECTOR MACHINE FOR POLARIMETRIC SAR DATA CLASSIFICATION
    Maghsoudi, Yasser
    Collins, Michael
    Leckie, Donald G.
    2011 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2011, : 2857 - 2860
  • [36] Spectrum Data Feature Analysis Based on Support Vector Machine Method
    Wu, Jiayi
    Cui, Shuo
    Su, Donglin
    2017 IEEE SIXTH ASIA-PACIFIC CONFERENCE ON ANTENNAS AND PROPAGATION (APCAP), 2017,
  • [37] Hybrid Support Vector Machine based Feature Selection Method for Text Classification
    Sabbah, Thabit
    Ayyash, Mosab
    Ashraf, Mahmood
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2018, 15 (3A) : 599 - 609
  • [38] sigFeature: Novel Significant Feature Selection Method for Classification of Gene Expression Data Using Support Vector Machine and t Statistic
    Das, Pijush
    Roychowdhury, Anirban
    Das, Subhadeep
    Roychoudhury, Susanta
    Tripathy, Sucheta
    FRONTIERS IN GENETICS, 2020, 11
  • [39] Using a Feature Subset Selection method and Support Vector Machine to address curse of dimensionality and redundancy in Hyperion hyperspectral data classification
    Salimi, Amir
    Ziaii, Mansour
    Amiri, Ali
    Zadeh, Mahdieh Hosseinjani
    Karimpouli, Sadegh
    Moradkhani, Mostafa
    EGYPTIAN JOURNAL OF REMOTE SENSING AND SPACE SCIENCES, 2018, 21 (01): : 27 - 36
  • [40] A Method Based on Support Vector Machine for Feature Selection of Latent Semantic Features
    Li, Min-Song
    ADVANCED MATERIALS SCIENCE AND TECHNOLOGY, PTS 1-2, 2011, 181-182 : 830 - 835