A network-based machine-learning framework to identify both functional modules and disease genes

被引:0
|
作者
Kuo Yang
Kezhi Lu
Yang Wu
Jian Yu
Baoyan Liu
Yi Zhao
Jianxin Chen
Xuezhong Zhou
机构
[1] Beijing Jiaotong University,School of Computer and Information Technology, Institute of Medical Intelligence
[2] Tsinghua University,Institute for TCM
[3] Chinese Academy of Sciences,X, MOE Key Laboratory of Bioinformatics / Bioinformatics Division, BNRIST, Department of Automation
[4] Beijing Jiaotong University,Key Laboratory of Intelligent Information Processing, Advanced Computer Research Center, Institute of Computing Technology
[5] China Academy of Chinese Medical Sciences,Beijing Key Laboratory of Traffic Data Analysis and Mining, School of Computer and Information Technology
[6] Beijing University of Chinese Medicine,Data Center of Traditional Chinese Medicine
[7] KU Leuven,imec
来源
Human Genetics | 2021年 / 140卷
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Disease gene identification is a critical step towards uncovering the molecular mechanisms of diseases and systematically investigating complex disease phenotypes. Despite considerable efforts to develop powerful computing methods, candidate gene identification remains a severe challenge owing to the connectivity of an incomplete interactome network, which hampers the discovery of true novel candidate genes. We developed a network-based machine-learning framework to identify both functional modules and disease candidate genes. In this framework, we designed a semi-supervised non-negative matrix factorization model to obtain the functional modules related to the diseases and genes. Of note, we proposed a disease gene-prioritizing method called MapGene that integrates the correlations from both functional modules and network closeness. Our framework identified a set of functional modules with highly functional homogeneity and close gene interactions. Experiments on a large-scale benchmark dataset showed that MapGene performs significantly better than the state-of-the-art algorithms. Further analysis demonstrates MapGene can effectively relieve the impact of the incompleteness of interactome networks and obtain highly reliable rankings of candidate genes. In addition, disease cases on Parkinson’s disease and diabetes mellitus confirmed the generalization of MapGene for novel candidate gene identification. This work proposed, for the first time, an integrated computing framework to predict both functional modules and disease candidate genes. The methodology and results support that our framework has the potential to help discover underlying functional modules and reliable candidate genes in human disease.
引用
收藏
页码:897 / 913
页数:16
相关论文
共 50 条
  • [1] A network-based machine-learning framework to identify both functional modules and disease genes
    Yang, Kuo
    Lu, Kezhi
    Wu, Yang
    Yu, Jian
    Liu, Baoyan
    Zhao, Yi
    Chen, Jianxin
    Zhou, Xuezhong
    HUMAN GENETICS, 2021, 140 (06) : 897 - 913
  • [2] Network-Based and Machine-Learning Approaches Identify Diagnostic and Prognostic Models for EMT-Type Gastric Tumors
    Sadeghi, Mehdi
    Karimi, Mohammad Reza
    Karimi, Amir Hossein
    Farshbaf, Nafiseh Ghorbanpour
    Barzegar, Abolfazl
    Schmitz, Ulf
    GENES, 2023, 14 (03)
  • [3] GNN4DM: a graph neural network-based method to identify overlapping functional disease modules
    Gezsi, Andras
    Peter, Antal
    BIOINFORMATICS, 2024, 40 (10)
  • [4] A Bayesian Network-based Machine-learning Algorithm for Enhancing Performance in Typical Scenarios
    Wang Q.
    IEIE Transactions on Smart Processing and Computing, 2023, 12 (03): : 252 - 260
  • [5] Identification of dysfunctional modules and disease genes in congenital heart disease by a network-based approach
    He, Danning
    Liu, Zhi-Ping
    Chen, Luonan
    BMC GENOMICS, 2011, 12
  • [6] Identification of dysfunctional modules and disease genes in congenital heart disease by a network-based approach
    Danning He
    Zhi-Ping Liu
    Luonan Chen
    BMC Genomics, 12
  • [7] NETWORK-BASED GENOME WIDE STUDY OF HIPPOCAMPAL IMAGING PHENOTYPE IN ALZHEIMER'S DISEASE TO IDENTIFY FUNCTIONAL INTERACTION MODULES
    Yao, Xiaohui
    Yan, Jingwen
    Risacher, Shannon
    Moore, Jason
    Saykin, Andrew
    Shen, Li
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 6170 - 6174
  • [8] A Machine-Learning Framework to Identify Distinct Phenotypes of Aortic Stenosis Severity
    Sengupta, Partho P.
    Shrestha, Sirish
    Kagiyama, Nobuyuki
    Hamirani, Yasmin
    Kulkarni, Hemant
    Yanamala, Naveena
    Bing, Rong
    Chin, Calvin W. L.
    Pawade, Tania A.
    Messika-Zeitoun, David
    Tastet, Lionel
    Shen, Mylene
    Newby, David E.
    Clavel, Marie-Annick
    Pibarot, Phillippe
    Dweck, Marc R.
    JACC-CARDIOVASCULAR IMAGING, 2021, 14 (09) : 1707 - 1720
  • [9] Functional characterization of drought-responsive modules and genes in Oryza sativa: a network-based approach
    Sircar, Sanchari
    Parekh, Nita
    FRONTIERS IN GENETICS, 2015, 6
  • [10] Identify Inflammatory Bowel Disease-Related Genes Based on Machine Learning
    Ye, Lili
    Lin, Yongwei
    Fan, Xing-di
    Chen, Yaoming
    Deng, Zengli
    Yang, Qian
    Lei, Xiaotian
    Mao, Jizong
    Cui, Chunhui
    FRONTIERS IN CELL AND DEVELOPMENTAL BIOLOGY, 2021, 9