A comprehensive sensitivity analysis of microarray breast cancer classification under feature variability

被引:16
|
作者
Sontrop, Herman M. J. [1 ]
Moerland, Perry D. [2 ]
van den Ham, Rene [3 ]
Reinders, Marcel J. T. [4 ]
Verhaegh, Wim F. J. [1 ]
机构
[1] Philips Res Labs, Mol Diagnost Dept, NL-5656 AE Eindhoven, Netherlands
[2] Acad Med Ctr, Dept Clin Epidemiol Biostat & Bioinformat, Bioinformat Lab, NL-1100 AZ Amsterdam, Netherlands
[3] Philips Res Labs, Dept Biomol Engn, NL-5656 AE Eindhoven, Netherlands
[4] Delft Univ Technol, Delft Bioinformat Lab, NL-2628 CD Delft, Netherlands
来源
BMC BIOINFORMATICS | 2009年 / 10卷
关键词
PROPAGATING UNCERTAINTY; PROBABILISTIC MODEL; EXPRESSION; LEVEL; PREDICTION; SIGNATURE; NORMALIZATION; METASTASIS; GENES; NOISE;
D O I
10.1186/1471-2105-10-389
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Large discrepancies in signature composition and outcome concordance have been observed between different microarray breast cancer expression profiling studies. This is often ascribed to differences in array platform as well as biological variability. We conjecture that other reasons for the observed discrepancies are the measurement error associated with each feature and the choice of preprocessing method. Microarray data are known to be subject to technical variation and the confidence intervals around individual point estimates of expression levels can be wide. Furthermore, the estimated expression values also vary depending on the selected preprocessing scheme. In microarray breast cancer classification studies, however, these two forms of feature variability are almost always ignored and hence their exact role is unclear. Results: We have performed a comprehensive sensitivity analysis of microarray breast cancer classification under the two types of feature variability mentioned above. We used data from six state of the art preprocessing methods, using a compendium consisting of eight diferent datasets, involving 1131 hybridizations, containing data from both one and two-color array technology. For a wide range of classifiers, we performed a joint study on performance, concordance and stability. In the stability analysis we explicitly tested classifiers for their noise tolerance by using perturbed expression profiles that are based on uncertainty information directly related to the preprocessing methods. Our results indicate that signature composition is strongly influenced by feature variability, even if the array platform and the stratification of patient samples are identical. In addition, we show that there is often a high level of discordance between individual class assignments for signatures constructed on data coming from different preprocessing schemes, even if the actual signature composition is identical. Conclusion: Feature variability can have a strong impact on breast cancer signature composition, as well as the classification of individual patient samples. We therefore strongly recommend that feature variability is considered in analyzing data from microarray breast cancer expression profiling experiments.
引用
收藏
页数:22
相关论文
共 50 条
  • [21] Cortactin in Breast Cancer: Analysis with Tissue Microarray
    Sheen-Chen, Shyr-Ming
    Huang, Chun-Ying
    Liu, Yu-Yin
    Huang, Chao-Cheng
    Tang, Rei-Ping
    ANTICANCER RESEARCH, 2011, 31 (01) : 293 - 297
  • [22] AN INTEGRATED BREAST CANCER MICROARRAY ANALYSIS APPROACH
    Lixandru-Petre, Irina-Oana
    Buiu, Catalin
    UPB Scientific Bulletin, Series C: Electrical Engineering and Computer Science, 2022, 84 (02): : 79 - 90
  • [23] AN INTEGRATED BREAST CANCER MICROARRAY ANALYSIS APPROACH
    Lixandru-Petre, Irina-Oana
    Buiu, Catalin
    UNIVERSITY POLITEHNICA OF BUCHAREST SCIENTIFIC BULLETIN SERIES C-ELECTRICAL ENGINEERING AND COMPUTER SCIENCE, 2022, 84 (02): : 79 - 90
  • [24] Microarray-Based Class Discovery for Molecular Classification of Breast Cancer: Analysis of Interobserver Agreement
    Mackay, Alan
    Weigelt, Britta
    Grigoriadis, Anita
    Kreike, Bas
    Natrajan, Rachael
    A'Hern, Roger
    Tan, David S. P.
    Dowsett, Mitch
    Ashworth, Alan
    Reis-Filho, Jorge S.
    JNCI-JOURNAL OF THE NATIONAL CANCER INSTITUTE, 2011, 103 (08): : 662 - 673
  • [25] Feature Selection Facilitated Classification For Breast Cancer Prediction
    Arunadevi, J.
    Ganeshamoorthi, K.
    PROCEEDINGS OF THE 2019 3RD INTERNATIONAL CONFERENCE ON COMPUTING METHODOLOGIES AND COMMUNICATION (ICCMC 2019), 2019, : 560 - 563
  • [26] Comprehensive feature analysis for sample classification with comprehensive two-dimensional LC
    Reichenbach, Stephen E.
    Tian, Xue
    Tao, Qingping
    Stoll, Dwight R.
    Carr, Peter W.
    JOURNAL OF SEPARATION SCIENCE, 2010, 33 (10) : 1365 - 1374
  • [27] Microarray Lung Cancer Data Classification Using Similarity based Feature Selection
    Amrane, Meriem
    Oukid, Saliha
    Ensari, Tolga
    Benblidia, Nadjia
    Orman, Zeynep
    2019 SCIENTIFIC MEETING ON ELECTRICAL-ELECTRONICS & BIOMEDICAL ENGINEERING AND COMPUTER SCIENCE (EBBT), 2019,
  • [28] Analysis of breast cancer classification robustness with radiomics feature extraction and deep learning techniques
    Rashid, Harun Ur
    Ibrikci, Turgay
    Paydas, Semra
    Binokay, Figen
    Cevik, Ulus
    EXPERT SYSTEMS, 2022, 39 (08)
  • [29] A comprehensive evaluation of multicategory classification methods for microarray gene expression cancer diagnosis
    Statnikov, A
    Aliferis, CF
    Tsamardinos, I
    Hardin, D
    Levy, S
    BIOINFORMATICS, 2005, 21 (05) : 631 - 643
  • [30] Classification of breast cancer using microarray gene expression data: A survey
    Abd-Elnaby, Muhammed
    Alfonse, Marco
    Roushdy, Mohamed
    JOURNAL OF BIOMEDICAL INFORMATICS, 2021, 117