Identification of Functional Modules by Integration of Multiple Data Sources Using a Bayesian Network Classifier

被引:3
|
作者
Wang, Jinlian [1 ]
Zuo, Yiming [1 ,2 ]
Liu, Lun [3 ]
Man, Yangao [4 ]
Tadesse, Mahlet G. [5 ]
Ressom, Habtom W. [1 ]
机构
[1] Georgetown Univ, Med Ctr, Lombardi Comprehens Canc Ctr, Washington, DC 20057 USA
[2] Virginia Polytech Inst & State Univ, Dept Elect & Comp Engn, Arlington, VA USA
[3] Beijing Acad Agr & Forestry Sci, Beijing Res Ctr Informat Technol, Beijing, Peoples R China
[4] Henry Jackson Fdn, Diagnost & Translat Res Ctr, Gaithersburg, MD USA
[5] Georgetown Univ, Dept Math & Stat, Washington, DC 20057 USA
关键词
genomics; systems biology; models; statistical; computational biology; gene expression; genetics; protein interaction domains and motifs; PROTEIN-PROTEIN INTERACTIONS; GENE NETWORKS; DOMAIN-DOMAIN; PATHWAYS;
D O I
10.1161/CIRCGENETICS.113.000087
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Background- Prediction of functional modules is indispensable for detecting protein deregulation in human complex diseases such as cancer. Bayesian network is one of the most commonly used models to integrate heterogeneous data from multiple sources such as protein domain, interactome, functional annotation, genome-wide gene expression, and the literature. Methods and Results- In this article, we present a Bayesian network classifier that is customized to (1) increase the ability to integrate diverse information from different sources, (2) effectively predict protein-protein interactions, (3) infer aberrant networks with scale-free and small-world properties, and (4) group molecules into functional modules or pathways based on the primary function and biological features. Application of this model in discovering protein biomarkers of hepatocellular carcinoma leads to the identification of functional modules that provide insights into the mechanism of the development and progression of hepatocellular carcinoma. These functional modules include cell cycle deregulation, increased angiogenesis (eg, vascular endothelial growth factor, blood vessel morphogenesis), oxidative metabolic alterations, and aberrant activation of signaling pathways involved in cellular proliferation, survival, and differentiation. Conclusions- The discoveries and conclusions derived from our customized Bayesian network classifier are consistent with previously published results. The proposed approach for determining Bayesian network structure facilitates the integration of heterogeneous data from multiple sources to elucidate the mechanisms of complex diseases.
引用
收藏
页码:206 / 217
页数:12
相关论文
共 50 条
  • [31] Diagnostics and prognostics of multi-mode failure scenarios in miter gates using multiple data sources and a dynamic Bayesian network
    Zihan Wu
    Travis B. Fillmore
    Manuel A. Vega
    Zhen Hu
    Michael D. Todd
    Structural and Multidisciplinary Optimization, 2022, 65
  • [32] Integration of Epigenetic Data in Bayesian Network Modeling of Gene Regulatory Network
    Zheng, Jie
    Chaturvedi, Iti
    Rajapakse, Jagath C.
    PATTERN RECOGNITION IN BIOINFORMATICS, 2011, 7036 : 87 - 96
  • [33] Sales Forecasting using Data warehouse and Naive Bayesian classifier
    Katkar, Vijay
    Gangopadhyay, Surupendu Prakash
    Rathod, Sagar
    Shetty, Aakash
    2015 INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING (ICPC), 2015,
  • [34] Epidemiological cluster identification using multiple data sources: an approach using logistic regression
    Susvitasari, Kurnia
    Tupper, Paul F.
    Cancino-Munos, Irving
    Lopez, Mariana G.
    Comas, Inaki
    Colijn, Caroline
    MICROBIAL GENOMICS, 2023, 9 (03):
  • [35] IoT streaming data integration from multiple sources
    Doan Quang Tu
    A. S. M. Kayes
    Wenny Rahayu
    Kinh Nguyen
    Computing, 2020, 102 : 2299 - 2329
  • [36] An ontology for the integration of multiple genetic disorder data sources
    Gong, P.
    Qu, W.
    Feng, D. D.
    2005 27TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-7, 2005, : 2824 - 2827
  • [37] Identification of functional modules in a PPI network by clique percolation clustering
    Zhang, Shihua
    Ning, Xuemei
    Zhang, Xiang-Sun
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2006, 30 (06) : 445 - 451
  • [38] IoT streaming data integration from multiple sources
    Tu, Doan Quang
    Kayes, A. S. M.
    Rahayu, Wenny
    Nguyen, Kinh
    COMPUTING, 2020, 102 (10) : 2299 - 2329
  • [39] A Bayesian Approach to Discovering Truth from Conflicting Sources for Data Integration
    Zhao, Bo
    Rubinstein, Benjamin I. P.
    Gemmell, Jim
    Han, Jiawei
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2012, 5 (06): : 550 - 561
  • [40] A graph-based integrative method of detecting consistent protein functional modules from multiple data sources
    Zhang, Yuan
    Cheng, Yue
    Ge, Liang
    Du, Nan
    Jia, Kebin
    Zhang, Aidong
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2015, 13 (02) : 122 - 140