A penalized Bayesian approach to predicting sparse protein-DNA binding landscapes

被引:4
|
作者
Levinson, Matthew [1 ]
Zhou, Qing [1 ]
机构
[1] Univ Calif Los Angeles, Dept Stat, Los Angeles, CA 90095 USA
基金
美国国家科学基金会;
关键词
TRANSCRIPTION-FACTOR-BINDING; NETWORK; PLURIPOTENCY; GENE; DISCOVERY; PATTERNS; MODELS;
D O I
10.1093/bioinformatics/btt585
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Cellular processes are controlled, directly or indirectly, by the binding of hundreds of different DNA binding factors (DBFs) to the genome. One key to deeper understanding of the cell is discovering where, when and how strongly these DBFs bind to the DNA sequence. Direct measurement of DBF binding sites (BSs; e.g. through ChIP-Chip or ChIP-Seq experiments) is expensive, noisy and not available for every DBF in every cell type. Naive and most existing computational approaches to detecting which DBFs bind in a set of genomic regions of interest often perform poorly, due to the high false discovery rates and restrictive requirements for prior knowledge. Results: We develop SparScape, a penalized Bayesian method for identifying DBFs active in the considered regions and predicting a joint probabilistic binding landscape. Using a sparsity-inducing penalization, SparScape is able to select a small subset of DBFs with enriched BSs in a set of DNA sequences from a much larger candidate set. This substantially reduces the false positives in prediction of BSs. Analysis of ChIP-Seq data in mouse embryonic stem cells and simulated data show that SparScape dramatically outperforms the naive motif scanning method and the comparable computational approaches in terms of DBF identification and BS prediction.
引用
收藏
页码:636 / 643
页数:8
相关论文
共 50 条
  • [1] A Biophysical Approach to Predicting Protein-DNA Binding Energetics
    Locke, George
    Morozov, Alexandre V.
    GENETICS, 2015, 200 (04) : 1349 - +
  • [2] Predicting protein-DNA binding modes
    Perez, Alberto
    BIOPHYSICAL JOURNAL, 2022, 121 (03) : 131 - 132
  • [3] PreDBA: A heterogeneous ensemble approach for predicting protein-DNA binding affinity
    Wenyi Yang
    Lei Deng
    Scientific Reports, 10
  • [4] PreDBA: A heterogeneous ensemble approach for predicting protein-DNA binding affinity
    Yang, Wenyi
    Deng, Lei
    SCIENTIFIC REPORTS, 2020, 10 (01)
  • [5] A Bayesian approach to joint modeling of protein-DNA binding, gene expression and sequence data
    Xie, Yang
    Pan, Wei
    Jeong, Kyeong S.
    Xiao, Guanghua
    Khodursky, Arkady B.
    STATISTICS IN MEDICINE, 2010, 29 (04) : 489 - 503
  • [6] Discovery of protein-DNA interactions by penalized multivariate regression
    Zamdborg, Leonid
    Ma, Ping
    NUCLEIC ACIDS RESEARCH, 2009, 37 (16) : 5246 - 5254
  • [7] Weak Frustration Regulates Sliding and Binding Kinetics on Rugged Protein-DNA Landscapes
    Marcovitz, Amir
    Levy, Yaakov
    JOURNAL OF PHYSICAL CHEMISTRY B, 2013, 117 (42): : 13005 - 13014
  • [8] Predicting Protein-DNA Binding Sites by Fine-Tuning BERT
    Zhang, Yue
    Chen, Yuehui
    Chen, Baitong
    Cao, Yi
    Chen, Jiazi
    Cong, Hanhan
    INTELLIGENT COMPUTING THEORIES AND APPLICATION, ICIC 2022, PT II, 2022, 13394 : 663 - 669
  • [9] Linkage of protein assembly to protein-DNA binding
    Wong, I
    Lohman, TM
    ENERGETICS OF BIOLOGICAL MACROMOLECULES, 1995, 259 : 95 - 127
  • [10] From sequence to structure and back again: Approaches for predicting protein-DNA binding
    Höglund A.
    Kohlbacher O.
    Proteome Science, 2 (1)