An unsupervised deep learning framework for predicting human essential genes from population and functional genomic data

被引:4
|
作者
Lapolice, Troy M. [1 ,2 ,3 ]
Huang, Yi-Fei [1 ,3 ]
机构
[1] Penn State Univ, Dept Biol, University Pk, PA 16802 USA
[2] Penn State Univ, Bioinformat & Genom Grad Program, University Pk, PA 16802 USA
[3] Penn State Univ, Huck Inst Life Sci, University Pk, PA 16802 USA
关键词
Deep Learning; Unsupervised; Essential Genes; Loss of Function Intolerance; Population Genomics; Functional Genomics; VARIANTS; ETIOLOGY;
D O I
10.1186/s12859-023-05481-z
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
BackgroundThe ability to accurately predict essential genes intolerant to loss-of-function (LOF) mutations can dramatically improve the identification of disease-associated genes. Recently, there have been numerous computational methods developed to predict human essential genes from population genomic data. While the existing methods are highly predictive of essential genes of long length, they have limited power in pinpointing short essential genes due to the sparsity of polymorphisms in the human genome.ResultsMotivated by the premise that population and functional genomic data may provide complementary evidence for gene essentiality, here we present an evolution-based deep learning model, DeepLOF, to predict essential genes in an unsupervised manner. Unlike previous population genetic methods, DeepLOF utilizes a novel deep learning framework to integrate both population and functional genomic data, allowing us to pinpoint short essential genes that can hardly be predicted from population genomic data alone. Compared with previous methods, DeepLOF shows unmatched performance in predicting ClinGen haploinsufficient genes, mouse essential genes, and essential genes in human cell lines. Notably, at a false positive rate of 5%, DeepLOF detects 50% more ClinGen haploinsufficient genes than previous methods. Furthermore, DeepLOF discovers 109 novel essential genes that are too short to be identified by previous methods.ConclusionThe predictive power of DeepLOF shows that it is a compelling computational method to aid in the discovery of essential genes.
引用
收藏
页数:21
相关论文
共 50 条
  • [31] A semi-supervised deep learning approach for predicting the functional effects of genomic non-coding variations
    Jia, Hao
    Park, Sung-Joon
    Nakai, Kenta
    BMC BIOINFORMATICS, 2021, 22 (SUPPL 6)
  • [32] A semi-supervised deep learning approach for predicting the functional effects of genomic non-coding variations
    Jia, Hao
    Park, Sung-Joon
    Nakai, Kenta
    BMC Bioinformatics, 2021, 22
  • [33] A semi-supervised deep learning approach for predicting the functional effects of genomic non-coding variations
    Hao Jia
    Sung-Joon Park
    Kenta Nakai
    BMC Bioinformatics, 22
  • [34] An Unsupervised Framework for Sensing Individual and Cluster Behavior Patterns From Human Mobile Data
    Zheng, Jiangchuan
    Ni, Lionel M.
    UBICOMP'12: PROCEEDINGS OF THE 2012 ACM INTERNATIONAL CONFERENCE ON UBIQUITOUS COMPUTING, 2012, : 153 - 162
  • [35] Human Identification via Unsupervised Feature Learning from UWB Radar Data
    Yin, Jie
    Tran, Son N.
    Zhang, Qing
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2018, PT I, 2018, 10937 : 322 - 334
  • [36] A Deep Learning Model for Predicting Tumor Suppressor Genes and Oncogenes from PDB Structure
    Tavanaei, Amirhossein
    Anandanadarajah, Nishanth
    Maida, Anthony
    Loganantharaj, Rasiah
    2017 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2017, : 613 - 617
  • [37] Learning From Noisy Data: An Unsupervised Random Denoising Method for Seismic Data Using Model-Based Deep Learning
    Wang, Feng
    Yang, Bo
    Wang, Yuqing
    Wang, Ming
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [38] Deep learning approach for predicting functional Z-DNA regions using omics data
    Beknazarov, Nazar
    Jin, Seungmin
    Poptsova, Maria
    SCIENTIFIC REPORTS, 2020, 10 (01)
  • [39] Deep learning approach for predicting functional Z-DNA regions using omics data
    Nazar Beknazarov
    Seungmin Jin
    Maria Poptsova
    Scientific Reports, 10
  • [40] Predicting online shopping behaviour from clickstream data using deep learning
    Koehn, Dennis
    Lessmann, Stefan
    Schaal, Markus
    EXPERT SYSTEMS WITH APPLICATIONS, 2020, 150