Identification of Functional Modules by Integration of Multiple Data Sources Using a Bayesian Network Classifier

被引:3
|
作者
Wang, Jinlian [1 ]
Zuo, Yiming [1 ,2 ]
Liu, Lun [3 ]
Man, Yangao [4 ]
Tadesse, Mahlet G. [5 ]
Ressom, Habtom W. [1 ]
机构
[1] Georgetown Univ, Med Ctr, Lombardi Comprehens Canc Ctr, Washington, DC 20057 USA
[2] Virginia Polytech Inst & State Univ, Dept Elect & Comp Engn, Arlington, VA USA
[3] Beijing Acad Agr & Forestry Sci, Beijing Res Ctr Informat Technol, Beijing, Peoples R China
[4] Henry Jackson Fdn, Diagnost & Translat Res Ctr, Gaithersburg, MD USA
[5] Georgetown Univ, Dept Math & Stat, Washington, DC 20057 USA
关键词
genomics; systems biology; models; statistical; computational biology; gene expression; genetics; protein interaction domains and motifs; PROTEIN-PROTEIN INTERACTIONS; GENE NETWORKS; DOMAIN-DOMAIN; PATHWAYS;
D O I
10.1161/CIRCGENETICS.113.000087
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Background- Prediction of functional modules is indispensable for detecting protein deregulation in human complex diseases such as cancer. Bayesian network is one of the most commonly used models to integrate heterogeneous data from multiple sources such as protein domain, interactome, functional annotation, genome-wide gene expression, and the literature. Methods and Results- In this article, we present a Bayesian network classifier that is customized to (1) increase the ability to integrate diverse information from different sources, (2) effectively predict protein-protein interactions, (3) infer aberrant networks with scale-free and small-world properties, and (4) group molecules into functional modules or pathways based on the primary function and biological features. Application of this model in discovering protein biomarkers of hepatocellular carcinoma leads to the identification of functional modules that provide insights into the mechanism of the development and progression of hepatocellular carcinoma. These functional modules include cell cycle deregulation, increased angiogenesis (eg, vascular endothelial growth factor, blood vessel morphogenesis), oxidative metabolic alterations, and aberrant activation of signaling pathways involved in cellular proliferation, survival, and differentiation. Conclusions- The discoveries and conclusions derived from our customized Bayesian network classifier are consistent with previously published results. The proposed approach for determining Bayesian network structure facilitates the integration of heterogeneous data from multiple sources to elucidate the mechanisms of complex diseases.
引用
收藏
页码:206 / 217
页数:12
相关论文
共 50 条
  • [21] miniTUBA: medical inference by network integration of temporal data using Bayesian analysis
    Xiang, Zuoshuang
    Minter, Rebecca M.
    Bi, Xiaoming
    Woolf, Peter J.
    He, Yongqun
    BIOINFORMATICS, 2007, 23 (18) : 2423 - 2432
  • [22] INTEGRATION OF MULTIPLE DATA SOURCES IN IMMIGRANT STUDIES
    TIENDA, M
    SULLIVAN, T
    POPULATION INDEX, 1984, 50 (03) : 413 - 414
  • [23] INTEGRATION OF MULTIPLE DATA SOURCES IN IMMIGRANT STUDIES
    SULLIVAN, TA
    TIENDA, M
    REVIEW OF PUBLIC DATA USE, 1984, 12 (04): : 233 - 244
  • [24] Network anomaly identification using supervised classifier
    1600, Slovene Society Informatika (37):
  • [25] Deep neural network classifier for multidimensional functional data
    Wang, Shuoyang
    Cao, Guanqun
    Shang, Zuofeng
    SCANDINAVIAN JOURNAL OF STATISTICS, 2023, 50 (04) : 1667 - 1686
  • [26] Network Anomaly Identification using Supervised Classifier
    Gogoi, Prasanta
    Borah, B.
    Bhattacharyya, D. K.
    INFORMATICA-JOURNAL OF COMPUTING AND INFORMATICS, 2013, 37 (01): : 93 - 106
  • [27] Towards the identification of protein complexes and functional modules by integrating PPI network and gene expression data
    Min Li
    Xuehong Wu
    Jianxin Wang
    Yi Pan
    BMC Bioinformatics, 13
  • [28] Towards the identification of protein complexes and functional modules by integrating PPI network and gene expression data
    Li, Min
    Wu, Xuehong
    Wang, Jianxin
    Pan, Yi
    BMC BIOINFORMATICS, 2012, 13
  • [29] NEURAL NETWORK FOR THE IDENTIFICATION OF A FUNCTIONAL DEPENDENCE USING DATA PRESELECTION
    Hlavac, V
    NEURAL NETWORK WORLD, 2021, 31 (02) : 109 - 124
  • [30] Diagnostics and prognostics of multi-mode failure scenarios in miter gates using multiple data sources and a dynamic Bayesian network
    Wu, Zihan
    Fillmore, Travis B.
    Vega, Manuel A.
    Hu, Zhen
    Todd, Michael D.
    STRUCTURAL AND MULTIDISCIPLINARY OPTIMIZATION, 2022, 65 (09)