Prediction of breast cancer by profiling of urinary RNA metabolites using Support Vector Machine-based feature selection

被引:61
|
作者
Henneges, Carsten [1 ]
Bullinger, Dino [2 ]
Fux, Richard [2 ]
Friese, Natascha [2 ]
Seeger, Harald [3 ]
Neubauer, Hans [3 ]
Laufer, Stefan [4 ]
Gleiter, Christoph H. [2 ]
Schwab, Matthias [2 ,5 ]
Zell, Andreas [1 ]
Kammerer, Bernd [2 ]
机构
[1] Ctr Bioinformat Tubingen ZBIT, D-72076 Tubingen, Germany
[2] Univ Tubingen Hosp, Inst Pharmacol & Toxicol, Dept Clin Pharmacol, D-72076 Tubingen, Germany
[3] Univ Tubingen Hosp, Univ Frauenklin, D-72076 Tubingen, Germany
[4] Inst Pharm, D-72076 Tubingen, Germany
[5] Dr Margarete Fischer Bosch Inst Clin Pharmacol, D-70376 Stuttgart, Germany
关键词
MODIFIED NUCLEOSIDES; CLINICAL CORRELATIONS; BIOLOGICAL MARKERS; BIOMEDICAL MARKERS; DIAGNOSIS; CARCINOMA; IDENTIFICATION; MS; PSEUDOURIDINE; METABONOMICS;
D O I
10.1186/1471-2407-9-104
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
Background: Breast cancer belongs to the most frequent and severe cancer types in human. Since excretion of modified nucleosides from increased RNA metabolism has been proposed as a potential target in pathogenesis of breast cancer, the aim of the present study was to elucidate the predictability of breast cancer by means of urinary excreted nucleosides. Methods: We analyzed urine samples from 85 breast cancer women and respective healthy controls to assess the metabolic profiles of nucleosides by a comprehensive bioinformatic approach. All included nucleosides/ribosylated metabolites were isolated by cis-diol specific affinity chromatography and measured with liquid chromatography ion trap mass spectrometry (LC-ITMS). A valid set of urinary metabolites was selected by exclusion of all candidates with poor linearity and/or reproducibility in the analytical setting. The bioinformatic tool of Oscillating Search Algorithm for Feature Selection (OSAF) was applied to iteratively improve features for training of Support Vector Machines (SVM) to better predict breast cancer. Results: After identification of 51 nucleosides/ribosylated metabolites in the urine of breast cancer women and/or controls by LC-ITMS coupling, a valid set of 35 candidates was selected for subsequent computational analyses. OSAF resulted in 44 pairwise ratios of metabolite features by iterative optimization. Based on this approach ultimately estimates for sensitivity and specificity of 83.5% and 90.6% were obtained for best prediction of breast cancer. The classification performance was dominated by metabolite pairs with SAH which highlights its importance for RNA methylation in cancer pathogenesis. Conclusion: Extensive RNA-pathway analysis based on mass spectrometric analysis of metabolites and subsequent bioinformatic feature selection allowed for the identification of significant metabolic features related to breast cancer pathogenesis. The combination of mass spectrometric analysis and subsequent SVM-based feature selection represents a promising tool for the development of a non-invasive prediction system.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] Support vector machine-based feature extractor for L/H transitions in JET
    Gonzalez, S.
    Vega, J.
    Murari, A.
    Pereira, A.
    Ramirez, J. M.
    Dormido-Canto, S.
    REVIEW OF SCIENTIFIC INSTRUMENTS, 2010, 81 (10):
  • [32] Feature selection and classification of breast cancer diagnosis based on support vector machines
    Purnami, Santi Wulan
    Rahayu, S. P.
    Embong, Abdullah
    INTERNATIONAL SYMPOSIUM OF INFORMATION TECHNOLOGY 2008, VOLS 1-4, PROCEEDINGS: COGNITIVE INFORMATICS: BRIDGING NATURAL AND ARTIFICIAL KNOWLEDGE, 2008, : 500 - 505
  • [33] Support Vector Machine-based Prediction for Mercury Speciation in Combustion Flue Gases
    Zhao, Bingtao
    Zhang, Zhongxiao
    Su, Yaxin
    2009 3RD INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICAL ENGINEERING, VOLS 1-11, 2009, : 3649 - +
  • [34] On domain knowledge and feature selection using a support vector machine
    Barzilay, O
    Brailovsky, VL
    PATTERN RECOGNITION LETTERS, 1999, 20 (05) : 475 - 484
  • [35] Support Vector Machine-based QSPR for the Prediction of Glass Transition Temperatures of Polymers
    Yu, Xinliang
    FIBERS AND POLYMERS, 2010, 11 (05) : 757 - 766
  • [36] Support vector-based feature selection using Fisher's linear discriminant and Support Vector Machine
    Youn, Eunseog
    Koenig, Lars
    Jeong, Myong K.
    Baek, Seung H.
    EXPERT SYSTEMS WITH APPLICATIONS, 2010, 37 (09) : 6148 - 6156
  • [37] Support vector machine-based QSPR for the prediction of van der Waals' constants
    Luan, F
    Zhang, RS
    Yao, XJ
    Liu, MC
    Hu, ZD
    Fan, BT
    QSAR & COMBINATORIAL SCIENCE, 2005, 24 (02): : 227 - 239
  • [38] Support Vector Machine-Based Prediction Models for Drug Repurposing and Designing Novel Drugs for Colorectal Cancer
    Sengupta, Avik
    Singh, Saurabh Kumar
    Kumar, Rahul
    ACS OMEGA, 2024, 9 (16): : 18584 - 18592
  • [39] A Comparative Study for Breast Cancer Prediction using Machine Learning and Feature Selection
    Dhanya, R.
    Paul, Irene Rose
    Akula, Sai Sindhu
    Sivakumar, Madhumathi
    Nair, Jyothisha J.
    PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICCS), 2019, : 1049 - 1055
  • [40] Feature selection and classification in breast cancer prediction using IoT and machine learning
    Gopal, V. Nanda
    Al-Turjman, Fadi
    Kumar, R.
    Anand, L.
    Rajesh, M.
    MEASUREMENT, 2021, 178