Information recovery from low coverage whole-genome bisulfite sequencing

被引:28
|
作者
Libertini, Emanuele [1 ]
Heath, Simon C. [2 ]
Hamoudi, Rifat A. [3 ]
Gut, Marta [2 ]
Ziller, Michael J. [4 ,5 ,6 ]
Czyz, Agata [7 ]
Ruotti, Victor [7 ]
Stunnenberg, Hendrik G. [8 ]
Frontini, Mattia [9 ,10 ,11 ]
Ouwehand, Willem H. [9 ,10 ,12 ]
Meissner, Alexander [4 ,5 ,6 ]
Gut, Ivo G. [2 ]
Beck, Stephan [1 ]
机构
[1] UCL, Inst Canc, Med Genom, London WC1E 6BT, England
[2] CNAG, Parc Cient Barcelona, Barcelona 08028, Spain
[3] UCL, Div Surg & Intervent Sci, London W1W 7EJ, England
[4] MIT & Harvard, Broad Inst, Cambridge, MA 02142 USA
[5] Harvard Stem Cell Inst, Cambridge, MA 02138 USA
[6] Harvard Univ, Dept Stem Cell & Regenerat Biol, Cambridge, MA 02138 USA
[7] Illumina Inc, San Diego, CA 92121 USA
[8] Radboud Univ Nijmegen, Dept Mol Biol, NL-6525 GA Nijmegen, Netherlands
[9] Univ Cambridge, Dept Haematol, Cambridge CB2 0XY, England
[10] Natl Hlth Serv Blood & Transplant, Cambridge Biomedical Campus, Cambridge CB2 0XY, England
[11] Univ Cambridge, British Heart Fdn Ctr Excellence, Cambridge CB2 0QQ, England
[12] Wellcome Trust Sanger Inst, Wellcome Trust Genome Campus, Cambridge CB10 1SA, England
来源
NATURE COMMUNICATIONS | 2016年 / 7卷
基金
英国惠康基金;
关键词
EPIGENOME-WIDE ASSOCIATION; DNA METHYLATION; IDENTIFICATION; IMPUTATION; PACKAGE; REGIONS;
D O I
10.1038/ncomms11306
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The cost of whole-genome bisulfite sequencing (WGBS) remains a bottleneck for many studies and it is therefore imperative to extract as much information as possible from a given dataset. This is particularly important because even at the recommend 30X coverage for reference methylomes, up to 50% of high-resolution features such as differentially methylated positions (DMPs) cannot be called with current methods as determined by saturation analysis. To address this limitation, we have developed a tool that dynamically segments WGBS methylomes into blocks of comethylation (COMETs) from which lost information can be recovered in the form of differentially methylated COMETs (DMCs). Using this tool, we demonstrate recovery of similar to 30% of the lost DMP information content as DMCs even at very low (5X) coverage. This constitutes twice the amount that can be recovered using an existing method based on differentially methylated regions (DMRs). In addition, we explored the relationship between COMETs and haplotypes in lymphoblastoid cell lines of African and European origin. Using best fit analysis, we show COMETs to be correlated in a population-specific manner, suggesting that this type of dynamic segmentation may be useful for integrated (epi) genome-wide association studies in the future.
引用
收藏
页数:7
相关论文
共 50 条
  • [31] Methylated DNA is over-represented in whole-genome bisulfite sequencing data
    Ji, Lexiang
    Sasaki, Takahiko
    Sun, Xiaoxiao
    Ma, Ping
    Lewis, Zachary A.
    Schmitz, Robert J.
    FRONTIERS IN GENETICS, 2014, 5
  • [32] Indexcov: fast coverage quality control for whole-genome sequencing
    Pedersen, Brent S.
    Collins, Ryan L.
    Talkowski, Michael E.
    Quinlan, Aaron R.
    GIGASCIENCE, 2017, 6 (11):
  • [33] Performance of Mapping Approaches for Whole-Genome Bisulfite Sequencing Data in Crop Plants
    Grehl, Claudius
    Wagner, Marc
    Lemnian, Ioana
    Glaser, Bruno
    Grosse, Ivo
    FRONTIERS IN PLANT SCIENCE, 2020, 11
  • [34] Software updates in the Illumina HiSeq platform affect whole-genome bisulfite sequencing
    Toh, Hidehiro
    Shirane, Kenjiro
    Miura, Fumihito
    Kubo, Naoki
    Ichiyanagi, Kenji
    Hayashi, Katsuhiko
    Saitou, Mitinori
    Suyama, Mikita
    Ito, Takashi
    Sasaki, Hiroyuki
    BMC GENOMICS, 2017, 18
  • [35] Software updates in the Illumina HiSeq platform affect whole-genome bisulfite sequencing
    Hidehiro Toh
    Kenjiro Shirane
    Fumihito Miura
    Naoki Kubo
    Kenji Ichiyanagi
    Katsuhiko Hayashi
    Mitinori Saitou
    Mikita Suyama
    Takashi Ito
    Hiroyuki Sasaki
    BMC Genomics, 18
  • [36] Whole-genome sequencing
    Morris, Huw R.
    Houlden, Henry
    Polke, James
    PRACTICAL NEUROLOGY, 2021, 21 (04) : 322 - +
  • [37] Construction of high coverage whole-genome sequencing libraries from single colon crypts without DNA extraction or whole-genome amplification
    Manojlovic, Zarko
    Wlodarczyk, Jordan
    Okitsu, Cindy
    Jin, Yuxin
    Van Den Berg, David
    Lieber, Michael R.
    Hsieh, Chih-Lin
    BMC RESEARCH NOTES, 2023, 16 (01)
  • [38] Construction of high coverage whole-genome sequencing libraries from single colon crypts without DNA extraction or whole-genome amplification
    Zarko Manojlovic
    Jordan Wlodarczyk
    Cindy Okitsu
    Yuxin Jin
    David Van Den Berg
    Michael R. Lieber
    Chih-Lin Hsieh
    BMC Research Notes, 16
  • [39] Non-Invasive Detection of Breast Cancer by Low-Coverage Whole-Genome Sequencing from Plasma
    Peng, Li
    Yao, Ru
    Gao, Sihang
    Qu, Yang
    Qu, Li
    Zhang, Jingbo
    Zhou, Yidong
    CLINICAL AND EXPERIMENTAL OBSTETRICS & GYNECOLOGY, 2023, 50 (07):
  • [40] Mechanisms governing melon fruit skin pigmentation: Insights from transcriptome sequencing and whole-genome bisulfite sequencing analyses
    Wu, Ting
    Liu, Bin
    Xiong, Tao
    Yan, Miao
    Zhang, Jun-Ling
    Yang, Yong
    Hu, Guo-Zhi
    SCIENTIA HORTICULTURAE, 2024, 333