De novo prediction of cis-regulatory elements and modules through integrative analysis of a large number of ChIP datasets

被引:10
|
作者
Niu, Meng [1 ]
Tabari, Ehsan S. [1 ]
Su, Zhengchang [1 ]
机构
[1] Univ N Carolina, Coll Comp & Informat, Dept Bioinformat & Gen, Charlotte, NC 28223 USA
来源
BMC GENOMICS | 2014年 / 15卷
基金
美国国家科学基金会;
关键词
cis-regulatory elements; cis-regulatory modules; ChIP-chip; ChIP-seq; Drosophila melanogaster; TRANSCRIPTION FACTOR-BINDING; MOTIF DISCOVERY; GENE-REGULATION; OPEN CHROMATIN; NONCODING DNA; FRUIT-FLY; GENOME; SEQ; SITES; EXPRESSION;
D O I
10.1186/1471-2164-15-1047
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: In eukaryotes, transcriptional regulation is usually mediated by interactions of multiple transcription factors (TFs) with their respective specific cis-regulatory elements (CREs) in the so-called cis-regulatory modules (CRMs) in DNA. Although the knowledge of CREs and CRMs in a genome is crucial to elucidate gene regulatory networks and understand many important biological phenomena, little is known about the CREs and CRMs in most eukaryotic genomes due to the difficulty to characterize them by either computational or traditional experimental methods. However, the exponentially increasing number of TF binding location data produced by the recent wide adaptation of chromatin immunoprecipitation coupled with microarray hybridization (ChIP-chip) or high-throughput sequencing (ChIP-seq) technologies has provided an unprecedented opportunity to identify CRMs and CREs in genomes. Nonetheless, how to effectively mine these large volumes of ChIP data to identify CREs and CRMs at nucleotide resolution is a highly challenging task. Results: We have developed a novel graph-theoretic based algorithm DePCRM for genome-wide de novo predictions of CREs and CRMs using a large number of ChIP datasets. DePCRM predicts CREs and CRMs by identifying overrepresented combinatorial CRE motif patterns in multiple ChIP datasets in an effective way. When applied to 168 ChIP datasets of 56 TFs from D. melanogaster, DePCRM identified 184 and 746 overrepresented CRE motifs and their combinatorial patterns, respectively, and predicted a total of 115,932 CRMs in the genome. The predictions recover 77.9% of known CRMs in the datasets and 89.3% of known CRMs containing at least one predicted CRE. We found that the putative CRMs as well as CREs as a whole in a CRM are more conserved than randomly selected sequences. Conclusion: Our results suggest that the CRMs predicted by DePCRM are highly likely to be functional. Our algorithm is the first of its kind for de novo genome-wide prediction of CREs and CRMs using larger number of transcription factor ChIP datasets. The algorithm and predictions will hopefully facilitate the elucidation of gene regulatory networks in eukaryotes. All the predicted CREs, CRMs, and their target genes are available at http://bioinfo.uncc.edu/mniu/pcrms/www/.
引用
收藏
页数:20
相关论文
共 24 条
  • [1] De novo prediction of cis-regulatory elements and modules through integrative analysis of a large number of ChIP datasets
    Meng Niu
    Ehsan S Tabari
    Zhengchang Su
    BMC Genomics, 15
  • [2] BICORN: An R package for integrative inference of de novo cis-regulatory modules
    Chen, Xi
    Gu, Jinghua
    Neuwald, Andrew F.
    Hilakivi-Clarke, Leena
    Clarke, Robert
    Xuan, Jianhua
    SCIENTIFIC REPORTS, 2020, 10 (01)
  • [3] Author Correction: BICORN: An R package for integrative inference of de novo cis-regulatory modules
    Xi Chen
    Jinghua Gu
    Andrew F. Neuwald
    Leena Hilakivi-Clarke
    Robert Clarke
    Jianhua Xuan
    Scientific Reports, 10
  • [4] Bounded search for de novo identification of degenerate cis-regulatory elements
    Carlson, Jonathan M.
    Chakravarty, Arijit
    Khetani, Radhika S.
    Gross, Robert H.
    BMC BIOINFORMATICS, 2006, 7 (1)
  • [5] Bounded search for de novo identification of degenerate cis-regulatory elements
    Jonathan M Carlson
    Arijit Chakravarty
    Radhika S Khetani
    Robert H Gross
    BMC Bioinformatics, 7
  • [6] CisModule:: De novo discovery of' cis-regulatory modules by hierarchical mixture modeling
    Zhou, Q
    Wong, WH
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2004, 101 (33) : 12114 - 12119
  • [7] i-cisTarget: an integrative genomics method for the prediction of regulatory features and cis-regulatory modules
    Herrmann, Carl
    Van de Sande, Bram
    Potier, Delphine
    Aerts, Stein
    NUCLEIC ACIDS RESEARCH, 2012, 40 (15)
  • [8] Evolution of cis-regulatory elements in yeast de novo and duplicated new genes
    Tsai, Zing Tsung-Yeh
    Tsai, Huai-Kuang
    Cheng, Jen-Hao
    Lin, Chih-Hsu
    Tsai, Yuan-Fan
    Wang, Daryi
    BMC GENOMICS, 2012, 13
  • [9] Evolution of cis-regulatory elements in yeast de novo and duplicated new genes
    Zing Tsung-Yeh Tsai
    Huai-Kuang Tsai
    Jen-Hao Cheng
    Chih-Hsu Lin
    Yuan-Fan Tsai
    Daryi Wang
    BMC Genomics, 13
  • [10] Identification of putative cis-regulatory elements in Cryptosporidium parvum by de novo pattern finding
    Nandita Mullapudi
    Cheryl A Lancto
    Mitchell S Abrahamsen
    Jessica C Kissinger
    BMC Genomics, 8