A Bayesian data fusion based approach for learning genome-wide transcriptional regulatory networks

被引:3
|
作者
Sauta, Elisabetta [1 ]
Demartini, Andrea [1 ]
Vitali, Francesca [2 ]
Riva, Alberto [3 ]
Bellazzi, Riccardo [1 ]
机构
[1] Univ Pavia, Dept Elect Comp & Biomed Engn, Via Ferrata 5, I-27100 Pavia, Italy
[2] Univ Arizona Hlth Sci, Dept Med, Ctr Biomed Informat & Biostat, 1230 Cherry Ave, Tucson, AZ 85719 USA
[3] Univ Florida, Interdisciplinary Ctr Biotechnol Res, Bioinformat Core, Gainesville, FL 32610 USA
关键词
Genomic transcriptional networks; omics-data fusion; Bayesian networks; Hybrid structure learning algorithm; STEM-CELL ENHANCER; GENE-EXPRESSION; DNA BINDING; MLL FAMILY; C-MYC; HEMATOPOIESIS; LEUKEMIA; PROTEIN; SETDB1; TAL1;
D O I
10.1186/s12859-020-3510-1
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background Reverse engineering of transcriptional regulatory networks (TRN) from genomics data has always represented a computational challenge in System Biology. The major issue is modeling the complex crosstalk among transcription factors (TFs) and their target genes, with a method able to handle both the high number of interacting variables and the noise in the available heterogeneous experimental sources of information. Results In this work, we propose a data fusion approach that exploits the integration of complementary omics-data as prior knowledge within a Bayesian framework, in order to learn and model large-scale transcriptional networks. We develop a hybrid structure-learning algorithm able to jointly combine TFs ChIP-Sequencing data and gene expression compendia to reconstruct TRNs in a genome-wide perspective. Applying our method to high-throughput data, we verified its ability to deal with the complexity of a genomic TRN, providing a snapshot of the synergistic TFs regulatory activity. Given the noisy nature of data-driven prior knowledge, which potentially contains incorrect information, we also tested the method's robustness to false priors on a benchmark dataset, comparing the proposed approach to other regulatory network reconstruction algorithms. We demonstrated the effectiveness of our framework by evaluating structural commonalities of our learned genomic network with other existing networks inferred by different DNA binding information-based methods. Conclusions This Bayesian omics-data fusion based methodology allows to gain a genome-wide picture of the transcriptional interplay, helping to unravel key hierarchical transcriptional interactions, which could be subsequently investigated, and it represents a promising learning approach suitable for multi-layered genomic data integration, given its robustness to noisy sources and its tailored framework for handling high dimensional data.
引用
收藏
页数:28
相关论文
共 50 条
  • [1] A Bayesian data fusion based approach for learning genome-wide transcriptional regulatory networks
    Elisabetta Sauta
    Andrea Demartini
    Francesca Vitali
    Alberto Riva
    Riccardo Bellazzi
    BMC Bioinformatics, 21
  • [2] Data Fusion Approach for Learning Transcriptional Bayesian Networks
    Sauta, Elisabetta
    Demartini, Andrea
    Vitali, Francesca
    Riva, Alberto
    Bellazzi, Riccardo
    ARTIFICIAL INTELLIGENCE IN MEDICINE, AIME 2017, 2017, 10259 : 76 - 80
  • [3] Learning Hierarchical Bayesian Networks for Genome-Wide Association Studies
    Mourad, Raphael
    Sinoquet, Christine
    Leray, Philippe
    COMPSTAT'2010: 19TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL STATISTICS, 2010, : 549 - 556
  • [4] Predicting eukaryotic transcriptional cooperativity by Bayesian network integration of genome-wide data
    Wang, Yong
    Zhang, Xiang-Sun
    Xia, Yu
    NUCLEIC ACIDS RESEARCH, 2009, 37 (18) : 5943 - 5958
  • [5] Identifying Genetic Interactions in Genome-Wide Data Using Bayesian Networks
    Jiang, Xia
    Barmada, M. Michael
    Visweswaran, Shyam
    GENETIC EPIDEMIOLOGY, 2010, 34 (06) : 575 - 581
  • [6] The genome-wide transcriptional regulatory landscape of ecdysone in the silkworm
    Cheng, Dong
    Cheng, Tingcai
    Yang, Xi
    Zhang, Quan
    Fu, Jianfeng
    Feng, Tieshan
    Gong, Jiao
    Xia, Qingyou
    EPIGENETICS & CHROMATIN, 2018, 11
  • [7] The genome-wide transcriptional regulatory landscape of ecdysone in the silkworm
    Dong Cheng
    Tingcai Cheng
    Xi Yang
    Quan Zhang
    Jianfeng Fu
    Tieshan Feng
    Jiao Gong
    Qingyou Xia
    Epigenetics & Chromatin, 11
  • [8] Genome-Wide Measurement of Bacterial Transcriptional Regulatory States
    Freddolino, Peter
    Tavazoie, Saeed
    BIOPHYSICAL JOURNAL, 2013, 104 (02) : 328A - 328A
  • [9] Genome-wide prediction and prioritization of human aging genes by data fusion: a machine learning approach
    Arabfard, Masoud
    Ohadi, Mina
    Tabar, Vahid Rezaei
    Delbari, Ahmad
    Kavousi, Kaveh
    BMC GENOMICS, 2019, 20 (01)
  • [10] Genome-wide prediction and prioritization of human aging genes by data fusion: a machine learning approach
    Masoud Arabfard
    Mina Ohadi
    Vahid Rezaei Tabar
    Ahmad Delbari
    Kaveh Kavousi
    BMC Genomics, 20