An unsupervised deep learning framework for predicting human essential genes from population and functional genomic data

被引:4
|
作者
Lapolice, Troy M. [1 ,2 ,3 ]
Huang, Yi-Fei [1 ,3 ]
机构
[1] Penn State Univ, Dept Biol, University Pk, PA 16802 USA
[2] Penn State Univ, Bioinformat & Genom Grad Program, University Pk, PA 16802 USA
[3] Penn State Univ, Huck Inst Life Sci, University Pk, PA 16802 USA
关键词
Deep Learning; Unsupervised; Essential Genes; Loss of Function Intolerance; Population Genomics; Functional Genomics; VARIANTS; ETIOLOGY;
D O I
10.1186/s12859-023-05481-z
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
BackgroundThe ability to accurately predict essential genes intolerant to loss-of-function (LOF) mutations can dramatically improve the identification of disease-associated genes. Recently, there have been numerous computational methods developed to predict human essential genes from population genomic data. While the existing methods are highly predictive of essential genes of long length, they have limited power in pinpointing short essential genes due to the sparsity of polymorphisms in the human genome.ResultsMotivated by the premise that population and functional genomic data may provide complementary evidence for gene essentiality, here we present an evolution-based deep learning model, DeepLOF, to predict essential genes in an unsupervised manner. Unlike previous population genetic methods, DeepLOF utilizes a novel deep learning framework to integrate both population and functional genomic data, allowing us to pinpoint short essential genes that can hardly be predicted from population genomic data alone. Compared with previous methods, DeepLOF shows unmatched performance in predicting ClinGen haploinsufficient genes, mouse essential genes, and essential genes in human cell lines. Notably, at a false positive rate of 5%, DeepLOF detects 50% more ClinGen haploinsufficient genes than previous methods. Furthermore, DeepLOF discovers 109 novel essential genes that are too short to be identified by previous methods.ConclusionThe predictive power of DeepLOF shows that it is a compelling computational method to aid in the discovery of essential genes.
引用
收藏
页数:21
相关论文
共 50 条
  • [1] An unsupervised deep learning framework for predicting human essential genes from population and functional genomic data
    Troy M. LaPolice
    Yi-Fei Huang
    BMC Bioinformatics, 24
  • [2] DeepHE: Accurately predicting human essential genes based on deep learning
    Zhang, Xue
    Xiao, Wangxin
    Xiao, Weijia
    PLOS COMPUTATIONAL BIOLOGY, 2020, 16 (09)
  • [3] Unsupervised multiple-instance learning for functional profiling of genomic data
    Henegar, Corneliu
    Clement, Karine
    Zucker, Jean-Daniel
    MACHINE LEARNING: ECML 2006, PROCEEDINGS, 2006, 4212 : 186 - 197
  • [4] GenNet framework: interpretable deep learning for predicting phenotypes from genetic data
    Arno van Hilten
    Steven A. Kushner
    Manfred Kayser
    M. Arfan Ikram
    Hieab H. H. Adams
    Caroline C. W. Klaver
    Wiro J. Niessen
    Gennady V. Roshchupkin
    Communications Biology, 4
  • [5] GenNet framework: interpretable deep learning for predicting phenotypes from genetic data
    van Hilten, Arno
    Kushner, Steven A.
    Kayser, Manfred
    Arfan Ikram, M.
    Adams, Hieab H. H.
    Klaver, Caroline C. W.
    Niessen, Wiro J.
    Roshchupkin, Gennady V.
    COMMUNICATIONS BIOLOGY, 2021, 4 (01)
  • [6] DeepARG: a deep learning approach for predicting antibiotic resistance genes from metagenomic data
    Arango-Argoty, Gustavo
    Garner, Emily
    Prudent, Amy
    Heath, Lenwood S.
    Vikesland, Peter
    Zhang, Liqing
    MICROBIOME, 2018, 6
  • [7] DeepARG: a deep learning approach for predicting antibiotic resistance genes from metagenomic data
    Gustavo Arango-Argoty
    Emily Garner
    Amy Pruden
    Lenwood S. Heath
    Peter Vikesland
    Liqing Zhang
    Microbiome, 6
  • [8] RLclean: An unsupervised integrated data cleaning framework based on deep reinforcement learning
    Peng, Jinfeng
    Shen, Derong
    Nie, Tiezheng
    Kou, Yue
    INFORMATION SCIENCES, 2024, 682
  • [9] Predicting phenotypes from novel genomic markers using deep learning
    Sehrawat, Shivani
    Najafian, Keyhan
    Jin, Lingling
    BIOINFORMATICS ADVANCES, 2023, 3 (01):
  • [10] Unsupervised deep learning framework for data-driven gating in positron emission tomography
    Li, Tiantian
    Xie, Zhaoheng
    Qi, Wenyuan
    Asma, Evren
    Qi, Jinyi
    MEDICAL PHYSICS, 2023, 50 (10) : 6047 - 6059