共 50 条
A penalized Bayesian approach to predicting sparse protein-DNA binding landscapes
被引:4
|作者:
Levinson, Matthew
[1
]
Zhou, Qing
[1
]
机构:
[1] Univ Calif Los Angeles, Dept Stat, Los Angeles, CA 90095 USA
基金:
美国国家科学基金会;
关键词:
TRANSCRIPTION-FACTOR-BINDING;
NETWORK;
PLURIPOTENCY;
GENE;
DISCOVERY;
PATTERNS;
MODELS;
D O I:
10.1093/bioinformatics/btt585
中图分类号:
Q5 [生物化学];
学科分类号:
071010 ;
081704 ;
摘要:
Motivation: Cellular processes are controlled, directly or indirectly, by the binding of hundreds of different DNA binding factors (DBFs) to the genome. One key to deeper understanding of the cell is discovering where, when and how strongly these DBFs bind to the DNA sequence. Direct measurement of DBF binding sites (BSs; e.g. through ChIP-Chip or ChIP-Seq experiments) is expensive, noisy and not available for every DBF in every cell type. Naive and most existing computational approaches to detecting which DBFs bind in a set of genomic regions of interest often perform poorly, due to the high false discovery rates and restrictive requirements for prior knowledge. Results: We develop SparScape, a penalized Bayesian method for identifying DBFs active in the considered regions and predicting a joint probabilistic binding landscape. Using a sparsity-inducing penalization, SparScape is able to select a small subset of DBFs with enriched BSs in a set of DNA sequences from a much larger candidate set. This substantially reduces the false positives in prediction of BSs. Analysis of ChIP-Seq data in mouse embryonic stem cells and simulated data show that SparScape dramatically outperforms the naive motif scanning method and the comparable computational approaches in terms of DBF identification and BS prediction.
引用
收藏
页码:636 / 643
页数:8
相关论文