Preprocessing differential methylation hybridization microarray data

被引:11
|
作者
Sun, Shuying [1 ,2 ]
Huang, Yi-Wen [3 ]
Yan, Pearlly S. [3 ]
Huang, Tim H. M. [3 ]
Lin, Shili [4 ]
机构
[1] Case Western Reserve Univ, Case Comprehens Canc Ctr, Cleveland, OH 44106 USA
[2] Case Western Reserve Univ, Dept Epidemiol & Biostat, Cleveland, OH 44106 USA
[3] Ohio State Univ, Human Canc Genet Program, Columbus, OH 43210 USA
[4] Ohio State Univ, Dept Stat, Columbus, OH 43210 USA
来源
BIODATA MINING | 2011年 / 4卷
基金
美国国家科学基金会;
关键词
DNA METHYLATION; BACKGROUND CORRECTION; HOUSEKEEPING GENES; SYSTEMATIC VARIATION; BREAST-CANCER; NORMALIZATION; EXPRESSION; WIDE;
D O I
10.1186/1756-0381-4-13
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: DNA methylation plays a very important role in the silencing of tumor suppressor genes in various tumor types. In order to gain a genome-wide understanding of how changes in methylation affect tumor growth, the differential methylation hybridization (DMH) protocol has been developed and large amounts of DMH microarray data have been generated. However, it is still unclear how to preprocess this type of microarray data and how different background correction and normalization methods used for two-color gene expression arrays perform for the methylation microarray data. In this paper, we demonstrate our discovery of a set of internal control probes that have log ratios (M) theoretically equal to zero according to this DMH protocol. With the aid of this set of control probes, we propose two LOESS (or LOWESS, locally weighted scatter-plot smoothing) normalization methods that are novel and unique for DMH microarray data. Combining with other normalization methods (global LOESS and no normalization), we compare four normalization methods. In addition, we compare five different background correction methods. Results: We study 20 different preprocessing methods, which are the combination of five background correction methods and four normalization methods. In order to compare these 20 methods, we evaluate their performance of identifying known methylated and un-methylated housekeeping genes based on two statistics. Comparison details are illustrated using breast cancer cell line and ovarian cancer patient methylation microarray data. Our comparison results show that different background correction methods perform similarly; however, four normalization methods perform very differently. In particular, all three different LOESS normalization methods perform better than the one without any normalization. Conclusions: It is necessary to do within-array normalization, and the two LOESS normalization methods based on specific DMH internal control probes produce more stable and relatively better results than the global LOESS normalization method.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] Multivariate analysis of microarray data: differential expression and differential connection
    Kiiveri, Harri T.
    BMC BIOINFORMATICS, 2011, 12
  • [22] Multivariate analysis of microarray data: differential expression and differential connection
    Harri T Kiiveri
    BMC Bioinformatics, 12
  • [23] Microarray Data Analysis for Differential Expression: a Tutorial
    Suarez, Erick
    Burguete, Ana
    Mclachlan, Geoffrey J.
    PUERTO RICO HEALTH SCIENCES JOURNAL, 2009, 28 (02) : 89 - 104
  • [24] Adjustments and measures of differential expression for microarray data
    Tsodikov, A
    Szabo, A
    Jones, D
    BIOINFORMATICS, 2002, 18 (02) : 251 - 260
  • [25] CpGassoc: an R function for analysis of DNA methylation microarray data
    Barfield, Richard T.
    Kilaru, Varun
    Smith, Alicia K.
    Conneely, Karen N.
    BIOINFORMATICS, 2012, 28 (09) : 1280 - 1281
  • [26] An evaluation of statistical methods for DNA methylation microarray data analysis
    Dongmei Li
    Zidian Xie
    Marc Le Pape
    Timothy Dye
    BMC Bioinformatics, 16 (1)
  • [27] An evaluation of statistical methods for DNA methylation microarray data analysis
    Li, Dongmei
    Xie, Zidian
    Le Pape, Marc
    Dye, Timothy
    BMC BIOINFORMATICS, 2015, 16
  • [28] A Differential Geometry Perspective about Multiple Data Streams Preprocessing
    Li Wen-Ping
    Yang Jing
    Zhang Jian-Pei
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2015, 12 (06) : 556 - 565
  • [29] A data-driven approach to preprocessing Illumina 450K methylation array data
    Ruth Pidsley
    Chloe C Y Wong
    Manuela Volta
    Katie Lunnon
    Jonathan Mill
    Leonard C Schalkwyk
    BMC Genomics, 14
  • [30] A data-driven approach to preprocessing Illumina 450K methylation array data
    Pidsley, Ruth
    Wong, Chloe C. Y.
    Volta, Manuela
    Lunnon, Katie
    Mill, Jonathan
    Schalkwyk, Leonard C.
    BMC GENOMICS, 2013, 14