High-resolution transcription factor binding sites prediction improved performance and interpretability by deep learning method

被引:19
|
作者
Zhang, Yongqing [1 ]
Wang, Zixuan [2 ]
Zeng, Yuanqi [2 ]
Zhou, Jiliu [1 ]
Zou, Quan [3 ]
机构
[1] Chengdu Univ Informat Technol, Sch Comp Sci, Chengdu 610225, Peoples R China
[2] Chengdu Univ Informat Technol, Comp Sci, Chengdu, Peoples R China
[3] Univ Elect Sci & Technol China, Chengdu, Peoples R China
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
transcription factor binding sites; Attention Gate; interpretability; motif discovery; DNA;
D O I
10.1093/bib/bbab273
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Transcription factors (TFs) are essential proteins in regulating the spatiotemporal expression of genes. It is crucial to infer the potential transcription factor binding sites (TFBSs) with high resolution to promote biology and realize precision medicine. Recently, deep learning-based models have shown exemplary performance in the prediction of TFBSs at the base-pair level. However, the previous models fail to integrate nucleotide position information and semantic information without noisy responses. Thus, there is still room for improvement. Moreover, both the inner mechanism and prediction results of these models are challenging to interpret. To this end, the Deep Attentive Encoder-Decoder Neural Network (D-AEDNet) is developed to identify the location of TFs-DNA binding sites in DNA sequences. In particular, our model adopts Skip Architecture to leverage the nucleotide position information in the encoder and removes noisy responses in the information fusion process by Attention Gate. Simultaneously, the Transcription Factor Motif Discovery based on Sliding Window (TF-MoDSW), an approach to discover TFs-DNA binding motifs by utilizing the output of neural networks, is proposed to understand the biological meaning of the predicted result. On ChIP-exo datasets, experimental results show that D-AEDNet has better performance than competing methods. Besides, we authenticate that Attention Gate can improve the interpretability of our model by ways of visualization analysis. Furthermore, we confirm that ability of D-AEDNet to learn TFs-DNA binding motifs outperform the state-of-the-art methods and availability of TF-MoDSW to discover biological sequence motifs in TFs-DNA interaction by conducting experiment on ChIP-seq datasets.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] A blind deconvolution approach to high-resolution mapping of transcription factor binding sites from ChIP-seq data
    Desmond S Lun
    Ashley Sherrid
    Brian Weiner
    David R Sherman
    James E Galagan
    Genome Biology, 10
  • [22] METHOD FOR INCREASING INTERPRETABILITY OF HIGH-RESOLUTION ELECTRON-MICROGRAPHS
    BLOCK, H
    LASCH, R
    BOSECK, S
    EUROPEAN JOURNAL OF CELL BIOLOGY, 1987, 44 : 7 - 7
  • [23] Predicting splicing patterns from the transcription factor binding sites in the promoter with deep learning
    Lin, Tzu-Chieh
    Tsai, Cheng-Hung
    Shiau, Cheng-Kai
    Huang, Jia-Hsin
    Tsai, Huai-Kuang
    BMC GENOMICS, 2024, 25 (SUPPL 3):
  • [24] Comprehensive, high-resolution binding energy landscapes reveal context dependencies of transcription factor binding
    Le, Daniel D.
    Shimko, Tyler C.
    Aditham, Arjun K.
    Keys, Allison M.
    Longwell, Scott A.
    Orenstein, Yaron
    Fordyce, Polly M.
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2018, 115 (16) : E3702 - E3711
  • [25] A deep learning model for predicting transcription factor binding location at Single Nucleotide Resolution
    Salekin, Sirajul
    Zhang, Jianqiu
    Huang, Yufei
    2017 IEEE EMBS INTERNATIONAL CONFERENCE ON BIOMEDICAL & HEALTH INFORMATICS (BHI), 2017, : 57 - 60
  • [26] Recognition of transcription factor binding sites by the SiteGA method
    Levitskii V.G.
    Ignat'eva E.V.
    Anan'ko E.A.
    Merkulova T.I.
    Kolchanov N.A.
    Hodgman C.
    Biophysics, 2006, 51 (4) : 565 - 570
  • [27] A subspace method for the detection of transcription factor binding sites
    Pairo, Erola
    Maynou, Joan
    Marco, Santiago
    Perera, Alexandre
    BIOINFORMATICS, 2012, 28 (10) : 1328 - 1335
  • [28] Method SiteGA for the recognition of transcription factor binding sites
    Levitsky, V. G.
    Ignatieva, E. V.
    Ananko, E. A.
    Merkulova, T. I.
    Kolchanov, N. A.
    Hodgman, T. C.
    BIOFIZIKA, 2006, 51 (04): : 633 - 639
  • [29] A SUBSPACE METHOD FOR THE DETECTION OF TRANSCRIPTION FACTOR BINDING SITES
    Pairo, Erola
    Marco, Santiago
    Perera, Alexandre
    BIONFORMATICS 2010: PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON BIOINFORMATICS, 2010, : 102 - 107
  • [30] Method for identifying transcription factor binding sites in yeast
    Tsai, Huai-Kuang
    Huang, Grace Tzu-Wei
    Chou, Meng-Yuan
    Lu, Henry Horng-Shing
    Li, Wen-Hsiung
    BIOINFORMATICS, 2006, 22 (14) : 1675 - 1681