A network-based machine-learning framework to identify both functional modules and disease genes

被引:0
|
作者
Kuo Yang
Kezhi Lu
Yang Wu
Jian Yu
Baoyan Liu
Yi Zhao
Jianxin Chen
Xuezhong Zhou
机构
[1] Beijing Jiaotong University,School of Computer and Information Technology, Institute of Medical Intelligence
[2] Tsinghua University,Institute for TCM
[3] Chinese Academy of Sciences,X, MOE Key Laboratory of Bioinformatics / Bioinformatics Division, BNRIST, Department of Automation
[4] Beijing Jiaotong University,Key Laboratory of Intelligent Information Processing, Advanced Computer Research Center, Institute of Computing Technology
[5] China Academy of Chinese Medical Sciences,Beijing Key Laboratory of Traffic Data Analysis and Mining, School of Computer and Information Technology
[6] Beijing University of Chinese Medicine,Data Center of Traditional Chinese Medicine
[7] KU Leuven,imec
来源
Human Genetics | 2021年 / 140卷
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Disease gene identification is a critical step towards uncovering the molecular mechanisms of diseases and systematically investigating complex disease phenotypes. Despite considerable efforts to develop powerful computing methods, candidate gene identification remains a severe challenge owing to the connectivity of an incomplete interactome network, which hampers the discovery of true novel candidate genes. We developed a network-based machine-learning framework to identify both functional modules and disease candidate genes. In this framework, we designed a semi-supervised non-negative matrix factorization model to obtain the functional modules related to the diseases and genes. Of note, we proposed a disease gene-prioritizing method called MapGene that integrates the correlations from both functional modules and network closeness. Our framework identified a set of functional modules with highly functional homogeneity and close gene interactions. Experiments on a large-scale benchmark dataset showed that MapGene performs significantly better than the state-of-the-art algorithms. Further analysis demonstrates MapGene can effectively relieve the impact of the incompleteness of interactome networks and obtain highly reliable rankings of candidate genes. In addition, disease cases on Parkinson’s disease and diabetes mellitus confirmed the generalization of MapGene for novel candidate gene identification. This work proposed, for the first time, an integrated computing framework to predict both functional modules and disease candidate genes. The methodology and results support that our framework has the potential to help discover underlying functional modules and reliable candidate genes in human disease.
引用
收藏
页码:897 / 913
页数:16
相关论文
共 50 条
  • [31] An integrative network-based approach to identify novel disease genes and pathways: a case study in the context of inflammatory bowel disease
    Ryohei Eguchi
    Mohammand Bozlul Karim
    Pingzhao Hu
    Tetsuo Sato
    Naoaki Ono
    Shigehiko Kanaya
    Md. Altaf-Ul-Amin
    BMC Bioinformatics, 19
  • [32] A network-based approach to identify disease-associated gene modules through integrating DNA methylation and gene expression
    Zhang, Yuanyuan
    Zhang, Junying
    Liu, Zhaowen
    Liu, Yajun
    Tuo, Shouheng
    BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 2015, 465 (03) : 437 - 442
  • [33] On the improvement of the extrapolation capability of an iterative machine-learning based RANS Framework
    Liu, Weishuo
    Fang, Jian
    Rolfo, Stefano
    Moulinec, Charles
    Emerson, David R.
    COMPUTERS & FLUIDS, 2023, 256
  • [34] A Machine-Learning Framework to Improve Wi-Fi Based Indoorpositioning
    Pichaimani, Venkateswari
    Manjula, K. R.
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2022, 33 (01): : 383 - 397
  • [35] Identification of marker genes in Alzheimer's disease using a machine-learning model
    Madar, Inamul Hasan
    Sultan, Ghazala
    Tayubi, Iftikhar Aslam
    Hasan, Atif Noorul
    Pahi, Bandana
    Rai, Anjali
    Sivanandan, Pravitha Kasu
    Loganathan, Tamizhini
    Begum, Mahamuda
    Rai, Sneha
    BIOINFORMATION, 2021, 17 (02) : 348 - 355
  • [36] Transcriptome network-based method to identify genes associated with unruptured intracranial aneurysms
    Wei, L.
    Gao, Y. J.
    Wei, S. P.
    Zhang, Y. F.
    Zhang, W. F.
    Jiang, J. X.
    Sun, Z. Y.
    Xu, W.
    GENETICS AND MOLECULAR RESEARCH, 2013, 12 (03) : 3263 - 3273
  • [37] A graph neural network-based framework to identify flow phenomena on unstructured meshes
    Wang, L.
    Fournier, Y.
    Wald, J. F.
    Mesri, Y.
    PHYSICS OF FLUIDS, 2023, 35 (07)
  • [38] A Heterogeneous Graph Convolutional Network-Based Deep Learning Model to Identify miRNA-Disease Association
    Che, Zicheng
    Peng, Wei
    Dai, Wei
    Wei, Shoulin
    Lan, Wei
    BIOINFORMATICS RESEARCH AND APPLICATIONS, ISBRA 2021, 2021, 13064 : 130 - 141
  • [39] A machine-learning based objective measure for ALS disease severity
    Fernando G. Vieira
    Subhashini Venugopalan
    Alan S. Premasiri
    Maeve McNally
    Aren Jansen
    Kevin McCloskey
    Michael P. Brenner
    Steven Perrin
    npj Digital Medicine, 5
  • [40] A machine-learning based objective measure for ALS disease severity
    Vieira, Fernando G.
    Venugopalan, Subhashini
    Premasiri, Alan S.
    McNally, Maeve
    Jansen, Aren
    McCloskey, Kevin
    Brenner, Michael P.
    Perrin, Steven
    NPJ DIGITAL MEDICINE, 2022, 5 (01)