ChromDL: a next-generation regulatory DNA classifier

被引:0
|
作者
Hill, Christopher [1 ,2 ]
Hudaiberdiev, Sanjarbek [1 ]
Ovcharenko, Ivan [1 ]
机构
[1] NIH, Computat Biol Branch, Intramural Res Program, Natl Lib Med, Bethesda, MD 20892 USA
[2] Univ Penn, Sch Engn & Appl Sci, Philadelphia, PA 19104 USA
基金
美国国家卫生研究院;
关键词
GENOME BROWSER; ENCODE DATA; ELEMENTS; ENCYCLOPEDIA; DATABASE; UPDATE;
D O I
10.1093/bioinformatics/btad217
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Predicting the regulatory function of non-coding DNA using only the DNA sequence continues to be a major challenge in genomics. With the advent of improved optimization algorithms, faster GPU speeds, and more intricate machine-learning libraries, hybrid convolutional and recurrent neural network architectures can be constructed and applied to extract crucial information from non-coding DNA.Results: Using a comparative analysis of the performance of thousands of Deep Learning architectures, we developed ChromDL, a neural network architecture combining bidirectional gated recurrent units, convolutional neural networks, and bidirectional long short-term memory units, which significantly improves upon a range of prediction metrics compared to its predecessors in transcription factor binding site, histone modification, and DNase-I hyper-sensitive site detection. Combined with a secondary model, it can be utilized for accurate classification of gene regulatory elements. The model can also detect weak transcription factor binding as compared to previously developed methods and has the potential to help delineate transcription factor binding motif specificities.Availability and implementationThe ChromDL source code can be found at .
引用
收藏
页码:i377 / i385
页数:9
相关论文
共 50 条
  • [1] ChromDL: a next-generation regulatory DNA classifier
    Hill, Christopher
    Hudaiberdiev, Sanjarbek
    Ovcharenko, Ivan
    BIOINFORMATICS, 2023, 39 : I377 - I385
  • [2] Next-generation DNA sequencing
    Shendure, Jay
    Ji, Hanlee
    NATURE BIOTECHNOLOGY, 2008, 26 (10) : 1135 - 1145
  • [3] Next-generation DNA sequencing
    Jay Shendure
    Hanlee Ji
    Nature Biotechnology, 2008, 26 : 1135 - 1145
  • [4] Next-generation regulatory T cell therapy
    Leonardo M. R. Ferreira
    Yannick D. Muller
    Jeffrey A. Bluestone
    Qizhi Tang
    Nature Reviews Drug Discovery, 2019, 18 : 749 - 769
  • [5] Next-generation regulatory T cell therapy
    Ferreira, Leonardo M. R.
    Muller, Yannick D.
    Bluestone, Jeffrey A.
    Tang, Qizhi
    NATURE REVIEWS DRUG DISCOVERY, 2019, 18 (10) : 749 - 769
  • [6] Next-Generation DNA Assembly Tools
    Peng, Lansha
    Tsvetanova, Billyana
    Liang, Xiquan
    Katzen, Federico
    GENETIC ENGINEERING & BIOTECHNOLOGY NEWS, 2010, 30 (18): : 32 - 33
  • [7] Next-generation DNA sequencing methods
    Mardis, Elaine R.
    ANNUAL REVIEW OF GENOMICS AND HUMAN GENETICS, 2008, 9 : 387 - 402
  • [8] Next-generation DNA damage sequencing
    Mingard, Cecile
    Wu, Junzhou
    McKeague, Maureen
    Sturla, Shana J.
    CHEMICAL SOCIETY REVIEWS, 2020, 49 (20) : 7354 - 7377
  • [9] Next-generation DNA assembly tools
    Peng, Lansha
    Tsvetanova, Billyana
    Liang, Xiquan
    Katzen, Federico
    Genetic Engineering and Biotechnology News, 2010, 30 (18):
  • [10] Next-Generation DNA Sequencing Technologies
    Kurekci, Gulsum Kayman
    Dincer, Pervin
    ERCIYES MEDICAL JOURNAL, 2014, 36 (03) : 99 - 103