A network-based machine-learning framework to identify both functional modules and disease genes

被引:0
|
作者
Kuo Yang
Kezhi Lu
Yang Wu
Jian Yu
Baoyan Liu
Yi Zhao
Jianxin Chen
Xuezhong Zhou
机构
[1] Beijing Jiaotong University,School of Computer and Information Technology, Institute of Medical Intelligence
[2] Tsinghua University,Institute for TCM
[3] Chinese Academy of Sciences,X, MOE Key Laboratory of Bioinformatics / Bioinformatics Division, BNRIST, Department of Automation
[4] Beijing Jiaotong University,Key Laboratory of Intelligent Information Processing, Advanced Computer Research Center, Institute of Computing Technology
[5] China Academy of Chinese Medical Sciences,Beijing Key Laboratory of Traffic Data Analysis and Mining, School of Computer and Information Technology
[6] Beijing University of Chinese Medicine,Data Center of Traditional Chinese Medicine
[7] KU Leuven,imec
来源
Human Genetics | 2021年 / 140卷
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Disease gene identification is a critical step towards uncovering the molecular mechanisms of diseases and systematically investigating complex disease phenotypes. Despite considerable efforts to develop powerful computing methods, candidate gene identification remains a severe challenge owing to the connectivity of an incomplete interactome network, which hampers the discovery of true novel candidate genes. We developed a network-based machine-learning framework to identify both functional modules and disease candidate genes. In this framework, we designed a semi-supervised non-negative matrix factorization model to obtain the functional modules related to the diseases and genes. Of note, we proposed a disease gene-prioritizing method called MapGene that integrates the correlations from both functional modules and network closeness. Our framework identified a set of functional modules with highly functional homogeneity and close gene interactions. Experiments on a large-scale benchmark dataset showed that MapGene performs significantly better than the state-of-the-art algorithms. Further analysis demonstrates MapGene can effectively relieve the impact of the incompleteness of interactome networks and obtain highly reliable rankings of candidate genes. In addition, disease cases on Parkinson’s disease and diabetes mellitus confirmed the generalization of MapGene for novel candidate gene identification. This work proposed, for the first time, an integrated computing framework to predict both functional modules and disease candidate genes. The methodology and results support that our framework has the potential to help discover underlying functional modules and reliable candidate genes in human disease.
引用
收藏
页码:897 / 913
页数:16
相关论文
共 50 条
  • [21] Optimizing network sensors using unsupervised machine-learning approach to identify a pollutant source
    Alaoui, Sidi Mohammed
    Djemal, Khalifa
    Gooya, Ehsan Sedgh
    Feiz, Amir Ali
    Alfalou, Ayman
    Ngae, Pierre
    PATTERN RECOGNITION AND PREDICTION XXXV, 2024, 13040
  • [22] Microarray and network-based identification of functional modules and pathways of active tuberculosis
    Bian, Zhong-Rui
    Yin, Juan
    Sun, Wen
    Lin, Dian-Jie
    MICROBIAL PATHOGENESIS, 2017, 105 : 68 - 73
  • [23] Machine-learning identifies Parkinson's disease patients based on resting-state between-network functional connectivity
    Rubbert, Christian
    Mathys, Christian
    Jockwitz, Christiane
    Hartmann, Christian J.
    Eickhoff, Simon B.
    Hoffstaedter, Felix
    Caspers, Svenja
    Eickhoff, Claudia R.
    Sigl, Benjamin
    Teichert, Nikolas A.
    Suedmeyer, Martin
    Turowski, Bernd
    Schnitzler, Alfons
    Caspers, Julian
    BRITISH JOURNAL OF RADIOLOGY, 2019, 92 (1101):
  • [24] Generative Adversarial Network-based Deep Learning Framework for Cardiovascular Disease Risk Prediction
    Bhagawati, Mrinalini
    Paul, Sudip
    2024 5TH INTERNATIONAL CONFERENCE ON INNOVATIVE TRENDS IN INFORMATION TECHNOLOGY, ICITIIT 2024, 2024,
  • [25] A network-based approach to identify expression modules underlying rejection in pediatric liver transplantation
    Ningappa, Mylarappa
    Rahman, Syed A.
    Higgs, Brandon W.
    Ashokkumar, Chethan S.
    Sahni, Nidhi
    Sindhi, Rakesh
    Das, Jishnu
    CELL REPORTS MEDICINE, 2022, 3 (04)
  • [26] Role of Centrality in Network-Based Prioritization of Disease Genes
    Erten, Sinan
    Koyuturk, Mehmet
    EVOLUTIONARY COMPUTATION, MACHINE LEARNING AND DATA MINING IN BIOINFORMATICS, PROCEEDINGS, 2010, 6023 : 13 - 25
  • [27] Network-based prediction and knowledge mining of disease genes
    Carson, Matthew B.
    Lu, Hui
    BMC MEDICAL GENOMICS, 2015, 8
  • [28] Network-based global inference of human disease genes
    Wu, Xuebing
    Jiang, Rui
    Zhang, Michael Q.
    Li, Shao
    MOLECULAR SYSTEMS BIOLOGY, 2008, 4 (1)
  • [29] Network-based prediction and knowledge mining of disease genes
    Matthew B Carson
    Hui Lu
    BMC Medical Genomics, 8
  • [30] An integrative network-based approach to identify novel disease genes and pathways: a case study in the context of inflammatory bowel disease
    Eguchi, Ryohei
    Karim, Mohammand Bozlul
    Hu, Pingzhao
    Sato, Tetsuo
    Ono, Naoaki
    Kanaya, Shigehiko
    Altaf-Ul-Amin, Md.
    BMC BIOINFORMATICS, 2018, 19