Identifying Candidate Disease Genes in Multilayer Heterogeneous Biological Networks

被引:0
|
作者
Ding C.-F. [1 ]
Wang J. [1 ]
Zhang Z.-Y. [1 ]
机构
[1] College of Mathematics and Computer Science, Yan'an University, Yan'an
来源
基金
中国国家自然科学基金;
关键词
biased random walk; biological network; candidate gene identification; Multilayer heterogeneous network;
D O I
10.16383/j.aas.c210577
中图分类号
学科分类号
摘要
Most of existing random walk methods to identify candidate disease genes preferentially visit highly-connected genes, while unwell-known or poorly-connected genes probably relevant to known diseases are more easily ignored or complicated to identify. Moreover, these methods access only a single gene network or an aggregated network of various gene data, leading to bias and incompleteness. Therefore, it is a pressing challenge for controlling the motion direction of random walk and for integrating multiple data sources involving different information for disease-gene identification. To this end, we first construct a multilayer network and multilayer heterogeneous genetic network. Then, we propose a topologically biased random walk with restart (BRWR) algorithm applicable to multilayer and multilayer heterogeneous networks for the identification of candidate disease genes. Experimental results show that the BRWR algorithm to identify candidate disease genes outperforms the state-of-the-art ones on different types of networks. Finally, the BRWR algorithm on multilayer heterogeneous networks is used to predict disease genes implicated in the undiagnosed neonatal progeroid syndrome. © 2024 Science Press. All rights reserved.
引用
收藏
页码:1246 / 1260
页数:14
相关论文
共 59 条
  • [41] Aerts S, Lambrechts D, Maity S, Van Loo P, Coessens B, De Smet F, Et al., Gene prioritization through genomic data fusion, Nature Biotechnology, 24, 5, (2006)
  • [42] Pinero J, Bravo A, Queralt-Rosinach N, Gutierrez-Sacristan A, Deu-Pons J, Centeno E, Et al., DisGeNET: A comprehensive platform integrating information on human disease-associated genes and variants, Nucleic Acids Research, 45, D1, pp. D833-D839, (2017)
  • [43] Hanley J A, McNeil B J., The meaning and use of the area under a receiver operating characteristic (ROC) curve, Radiology, 143, 1, (1982)
  • [44] Mordelet F, Vert J P., ProDiGe: Prioritization of disease genes with multitask machine learning from positive and unlabeled examples, BMC Bioinformatics, 12, (2011)
  • [45] Wu X B, Jiang R, Zhang M Q, Li S., Network-based global inference of human disease genes, Molecular Systems Biology, 4, (2008)
  • [46] Chen X, Liu M X, Yan G Y., Drug-target interaction prediction by random walk on the heterogeneous network, Molecular BioSystems, 8, 7, pp. 1970-1978, (2012)
  • [47] Blatti C, Sinha S., Characterizing gene sets using discriminative random walks with restart on heterogeneous biological networks, Bioinformatics, 32, 14, pp. 2167-2175, (2016)
  • [48] De Domenico M, Sole-Ribalta A, Gomez S, Arenas A., Navigability of interconnected networks under random failures, Proceedings of the National Academy of Sciences of the United States of America, 111, 23, pp. 8351-8356, (2014)
  • [49] Pivnick E K, Angle B, Kaufman R A, Hall B D, Pitukcheewanont P, Hersh J H, Et al., Neonatal progeroid (Wiedemann-Rautenstrauch) syndrome: Report of five new cases and review, American Journal of Medical Genetics, 90, 2, (2000)
  • [50] Kiraz A, Ozen S, Tubas F, Usta Y, Aldemir O, Alanay Y., Wiedemann-Rautenstrauch syndrome: Report of a variant case, American Journal of Medical Genetics Part A, 158A, 6, pp. 1434-1436, (2012)