A network-based machine-learning framework to identify both functional modules and disease genes

被引:0
|
作者
Kuo Yang
Kezhi Lu
Yang Wu
Jian Yu
Baoyan Liu
Yi Zhao
Jianxin Chen
Xuezhong Zhou
机构
[1] Beijing Jiaotong University,School of Computer and Information Technology, Institute of Medical Intelligence
[2] Tsinghua University,Institute for TCM
[3] Chinese Academy of Sciences,X, MOE Key Laboratory of Bioinformatics / Bioinformatics Division, BNRIST, Department of Automation
[4] Beijing Jiaotong University,Key Laboratory of Intelligent Information Processing, Advanced Computer Research Center, Institute of Computing Technology
[5] China Academy of Chinese Medical Sciences,Beijing Key Laboratory of Traffic Data Analysis and Mining, School of Computer and Information Technology
[6] Beijing University of Chinese Medicine,Data Center of Traditional Chinese Medicine
[7] KU Leuven,imec
来源
Human Genetics | 2021年 / 140卷
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Disease gene identification is a critical step towards uncovering the molecular mechanisms of diseases and systematically investigating complex disease phenotypes. Despite considerable efforts to develop powerful computing methods, candidate gene identification remains a severe challenge owing to the connectivity of an incomplete interactome network, which hampers the discovery of true novel candidate genes. We developed a network-based machine-learning framework to identify both functional modules and disease candidate genes. In this framework, we designed a semi-supervised non-negative matrix factorization model to obtain the functional modules related to the diseases and genes. Of note, we proposed a disease gene-prioritizing method called MapGene that integrates the correlations from both functional modules and network closeness. Our framework identified a set of functional modules with highly functional homogeneity and close gene interactions. Experiments on a large-scale benchmark dataset showed that MapGene performs significantly better than the state-of-the-art algorithms. Further analysis demonstrates MapGene can effectively relieve the impact of the incompleteness of interactome networks and obtain highly reliable rankings of candidate genes. In addition, disease cases on Parkinson’s disease and diabetes mellitus confirmed the generalization of MapGene for novel candidate gene identification. This work proposed, for the first time, an integrated computing framework to predict both functional modules and disease candidate genes. The methodology and results support that our framework has the potential to help discover underlying functional modules and reliable candidate genes in human disease.
引用
收藏
页码:897 / 913
页数:16
相关论文
共 50 条
  • [41] Network-based investigation of genetic modules associated with functional brain networks in schizophrenia
    Lin, Dongdong
    He, Hao
    Li, Jingyao
    Deng, Hong-Wen
    Calhoun, Vince D.
    Wang, Yu-Ping
    2013 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2013,
  • [42] Network-based Classification of Authentication Attempts using Machine Learning
    Taylor, Curtis R.
    Lanson, Julian P.
    2019 INTERNATIONAL CONFERENCE ON COMPUTING, NETWORKING AND COMMUNICATIONS (ICNC), 2019, : 669 - 673
  • [43] STRUCTURE LEARNING IN A BAYESIAN NETWORK-BASED VIDEO INDEXING FRAMEWORK
    Baghdadi, Siwar
    Gravier, Guillaume
    Demarty, Claire-Helene
    Gros, Patrick
    2008 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-4, 2008, : 677 - +
  • [44] Neural network-based leaf classification using machine learning
    Palanisamy, Tamilselvi
    Sadayan, Geetha
    Pathinetampadiyan, Nagasankar
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (08):
  • [45] Network-based EDM learning framework in precision manufacturing engineering
    Liang, Janus S.
    PROCEEDINGS OF THE FIFTH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY: NEW GENERATIONS, 2008, : 1293 - 1294
  • [46] Network-Based Elucidation of Human Disease Similarities Reveals Common Functional Modules Enriched for Pluripotent Drug Targets
    Suthram, Silpa
    Dudley, Joel T.
    Chiang, Annie P.
    Chen, Rong
    Hastie, Trevor J.
    Butte, Atul J.
    PLOS COMPUTATIONAL BIOLOGY, 2010, 6 (02)
  • [47] A network-based feature selection approach to identify metabolic signatures in disease
    Netzer, Michael
    Kugler, Karl G.
    Mueller, Laurin A. J.
    Weinberger, Klaus M.
    Graber, Armin
    Baumgartner, Christian
    Dehmer, Matthias
    JOURNAL OF THEORETICAL BIOLOGY, 2012, 310 : 216 - 222
  • [48] A claims-based, machine-learning algorithm to identify patients with pulmonary arterial hypertension
    Hyde, Bethany
    Paoli, Carly J.
    Panjabi, Sumeet
    Bettencourt, Katherine C.
    Lynum, Karimah S. Bell S.
    Selej, Mona
    PULMONARY CIRCULATION, 2023, 13 (02)
  • [49] A Network-Based Method for Predicting Disease-Causing Genes
    Karni, Shaul
    Soreq, Hermona
    Sharan, Roded
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2009, 16 (02) : 181 - 189
  • [50] GeneSurrounder: network-based identification of disease genes in expression data
    Shah, Sahil D.
    Braun, Rosemary
    BMC BIOINFORMATICS, 2019, 20 (1)