A comprehensive sensitivity analysis of microarray breast cancer classification under feature variability

被引:16
|
作者
Sontrop, Herman M. J. [1 ]
Moerland, Perry D. [2 ]
van den Ham, Rene [3 ]
Reinders, Marcel J. T. [4 ]
Verhaegh, Wim F. J. [1 ]
机构
[1] Philips Res Labs, Mol Diagnost Dept, NL-5656 AE Eindhoven, Netherlands
[2] Acad Med Ctr, Dept Clin Epidemiol Biostat & Bioinformat, Bioinformat Lab, NL-1100 AZ Amsterdam, Netherlands
[3] Philips Res Labs, Dept Biomol Engn, NL-5656 AE Eindhoven, Netherlands
[4] Delft Univ Technol, Delft Bioinformat Lab, NL-2628 CD Delft, Netherlands
来源
BMC BIOINFORMATICS | 2009年 / 10卷
关键词
PROPAGATING UNCERTAINTY; PROBABILISTIC MODEL; EXPRESSION; LEVEL; PREDICTION; SIGNATURE; NORMALIZATION; METASTASIS; GENES; NOISE;
D O I
10.1186/1471-2105-10-389
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Large discrepancies in signature composition and outcome concordance have been observed between different microarray breast cancer expression profiling studies. This is often ascribed to differences in array platform as well as biological variability. We conjecture that other reasons for the observed discrepancies are the measurement error associated with each feature and the choice of preprocessing method. Microarray data are known to be subject to technical variation and the confidence intervals around individual point estimates of expression levels can be wide. Furthermore, the estimated expression values also vary depending on the selected preprocessing scheme. In microarray breast cancer classification studies, however, these two forms of feature variability are almost always ignored and hence their exact role is unclear. Results: We have performed a comprehensive sensitivity analysis of microarray breast cancer classification under the two types of feature variability mentioned above. We used data from six state of the art preprocessing methods, using a compendium consisting of eight diferent datasets, involving 1131 hybridizations, containing data from both one and two-color array technology. For a wide range of classifiers, we performed a joint study on performance, concordance and stability. In the stability analysis we explicitly tested classifiers for their noise tolerance by using perturbed expression profiles that are based on uncertainty information directly related to the preprocessing methods. Our results indicate that signature composition is strongly influenced by feature variability, even if the array platform and the stratification of patient samples are identical. In addition, we show that there is often a high level of discordance between individual class assignments for signatures constructed on data coming from different preprocessing schemes, even if the actual signature composition is identical. Conclusion: Feature variability can have a strong impact on breast cancer signature composition, as well as the classification of individual patient samples. We therefore strongly recommend that feature variability is considered in analyzing data from microarray breast cancer expression profiling experiments.
引用
收藏
页数:22
相关论文
共 50 条
  • [31] A Comprehensive Analysis on Breast Cancer Classification with Radial Basis Function and Gaussian Mixture Model
    Rajaguru, Harikumar
    Prabhakar, Sunil Kumar
    16TH INTERNATIONAL CONFERENCE ON BIOMEDICAL ENGINEERING, 2017, 61 : 21 - 27
  • [32] A Comprehensive Analysis on the Efficacy of Machine Learning-Based Algorithms for Breast Cancer Classification
    Senthilkumar, K. P.
    Narmatha, P.
    Narasimharao, Jonnadula
    Mustare, Narendra
    Rufus, N. Herald Anantha
    Singh, Yashapl
    JOURNAL OF ELECTRICAL SYSTEMS, 2024, 20 (02) : 857 - 866
  • [33] Comprehensive analysis of microRNAs in breast cancer
    Chang, Hong-Tai
    Li, Sung-Chou
    Ho, Meng-Ru
    Pan, Hung-Wei
    Ger, Luo-Ping
    Hu, Ling-Yueh
    Yu, Shou-Yu
    Li, Wen-Hsiung
    Tsai, Kuo-Wang
    BMC GENOMICS, 2012, 13
  • [34] Comprehensive analysis of microRNAs in breast cancer
    Hong-Tai Chang
    Sung-Chou Li
    Meng-Ru Ho
    Hung-Wei Pan
    Luo-Ping Ger
    Ling-Yueh Hu
    Shou-Yu Yu
    Wen-Hsiung Li
    Kuo-Wang Tsai
    BMC Genomics, 13
  • [35] Comprehensive analysis of NuMAvariation in breast cancer
    Outi Kilpivaara
    Matias Rantanen
    Anitta Tamminen
    Kristiina Aittomäki
    Carl Blomqvist
    Heli Nevanlinna
    BMC Cancer, 8
  • [36] Survival feature and trend of female breast cancer: A comprehensive review of survival analysis from cancer registration data
    Tang, Dan-Dan
    Ye, Zhuo-Jun
    Liu, Wan-Wan
    Wu, Jing
    Tan, Jing-Yu
    Zhang, Yan
    Xu, Qun
    Xiang, Yong-Bing
    BREAST, 2025, 79
  • [37] CFSES optimization Feature Selection with neural network classification for microarray data analysis
    Patra, Bichitrananda
    Bisoyi, Sudhansu Sekhar
    2ND INTERNATIONAL CONFERENCE ON DATA SCIENCE AND BUSINESS ANALYTICS (ICDSBA 2018), 2018, : 45 - 50
  • [38] Ensemble Feature Selection Compares to Meta-analysis for Breast Cancer Biomarker Identification from Microarray Data
    Trevizan, Bernardo
    Recamonde-Mendoza, Mariana
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS, ICCSA 2021, PT I, 2021, 12949 : 162 - 178
  • [39] A comparative analysis of sensitivity of convolutional neural networks for histopathology image classification in breast cancer
    Cano, Fabian
    Madabhushi, Anant
    Cruz-Roa, Angel
    14TH INTERNATIONAL SYMPOSIUM ON MEDICAL INFORMATION PROCESSING AND ANALYSIS, 2018, 10975
  • [40] NMD microarray analysis of mutated genes in breast cancer
    Muggerud, Aslaug Aa
    Wolf, Maija
    Edgren, Henrik
    Kilpinen, Sami
    Sorlie, Therese
    Kallioniemi, Olli
    CANCER RESEARCH, 2006, 66 (08)