Analysis of metabolomic data using support vector machines

被引:286
|
作者
Mahadevan, Sankar [2 ]
Shah, Sirish L. [2 ]
Marrie, Thomas J. [1 ]
Slupsky, Carolyn M. [1 ]
机构
[1] Univ Alberta, Dept Med, Edmonton, AB, Canada
[2] Univ Alberta, Dept Chem & Mat Engn, Edmonton, AB, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
D O I
10.1021/ac800954c
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Metabolomics is an emerging field providing insight into physiological processes. It is an effective tool to investigate disease diagnosis or conduct toxicological studies by observing changes in metabolite concentrations in various biofluids. Multivariate statistical analysis is generally employed with nuclear magnetic resonance (NMR) or mass spectrometry (MS) data to determine differences between groups (for instance diseased vs healthy). Characteristic predictive models may be built based on a set of training data, and these models are subsequently used to predict whether new test data falls under a specific class. In this study, metabolomic data is obtained by doing a H-1 NMR spectroscopy on urine samples obtained from healthy subjects (male and female) and patients suffering from Streptococcus pneumoniae. We compare the performance of traditional PLS-DA multivariate analysis to support vector machines (SVMs), a technique widely used in genome studies on two case studies: (1) a case where nearly complete distinction may be seen (healthy versus pneumonia) and (2) a case where distinction is more ambiguous (male versus female). We show that SVMs are superior to PLS-DA in both cases in terms of predictive accuracy with the least number of features. With fewer number of features, SVMs are able to give better predictive model when compared to that of PLS-DA.
引用
收藏
页码:7562 / 7570
页数:9
相关论文
共 50 条
  • [41] Aerial LiDAR data classification using weighted support vector machines
    Wu, J. (wujun93161@hotmail.com), 1600, Editorial Board of Medical Journal of Wuhan University (38):
  • [42] Anomaly Detection in Time Series Data Using Support Vector Machines
    Yokkampon, Umaporn
    Chumkamon, Sakmongkon
    Mowshowitz, Abbe
    Hayashi, Eiji
    PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON ARTIFICIAL LIFE AND ROBOTICS (ICAROB 2021), 2021, : 581 - 587
  • [43] Aerial Lidar Data Classification Using Weighted Support Vector Machines
    Wu Jun
    Guo Ning
    Liu Rong
    Liu Lijuan
    Xu Gang
    THIRD INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2011), 2011, 8009
  • [44] Strengths and weaknesses of support vector machines within marketing data analysis
    Monien, K
    Decker, R
    INNOVATIONS IN CLASSIFICATION, DATA SCIENCE, AND INFORMATION SYSTEMS, 2005, : 355 - 362
  • [45] Behavior-Constrained Support Vector Machines for fMRI Data Analysis
    Chen, Danmei
    Li, Sheng
    Kourtzi, Zoe
    Wu, Si
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 2010, 21 (10): : 1680 - 1685
  • [46] Bagged ensembles of Support Vector Machines for gene expression data analysis
    Valentini, G
    Muselli, M
    Ruffino, F
    PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS 2003, VOLS 1-4, 2003, : 1844 - 1849
  • [47] Beamforming using support vector machines
    Ramón, MM
    Xu, N
    Christodoulou, CG
    IEEE ANTENNAS AND WIRELESS PROPAGATION LETTERS, 2005, 4 : 439 - 442
  • [48] Support Vector Machines for Program Analysis
    Flexeder, Andrea
    Putz, Matthias
    Runkler, Thomas
    2012 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2012,
  • [49] Analysis of Support Vector Machines Regression
    Tong, Hongzhi
    Chen, Di-Rong
    Peng, Lizhong
    FOUNDATIONS OF COMPUTATIONAL MATHEMATICS, 2009, 9 (02) : 243 - 257
  • [50] ANALYSIS OF LAPLACIAN SUPPORT VECTOR MACHINES
    Huang, Juan
    Chen, Hong
    Tao, Yan-Fang
    PROCEEDINGS OF 2009 INTERNATIONAL CONFERENCE ON WAVELET ANALYSIS AND PATTERN RECOGNITION, 2009, : 128 - +