Combining Sequence and Epigenomic Data to Predict Transcription Factor Binding Sites Using Deep Learning

被引:1
|
作者
Jing, Fang [1 ]
Zhang, Shao-Wu [1 ]
Cao, Zhen [2 ]
Zhang, Shihua l [2 ,3 ]
机构
[1] Northwestern Polytech Univ, Coll Automat, Minist Educ, Key Lab Informat Fusion Technol, Xian 710072, Shaanxi, Peoples R China
[2] Chinese Acad Sci, Acad Math & Syst Sci, NCMIS, CEMS,RCSDS, Beijing 100190, Peoples R China
[3] Univ Chinese Acad Sci, Sch Math Sci, Beijing 100049, Peoples R China
基金
中国国家自然科学基金;
关键词
Bioinformatics; Machine learning; Transcription factors binding sites; Convolutional neural networks; DNA accessibility; Histone modification; CHROMATIN ACCESSIBILITY PREDICTION; NETWORKS;
D O I
10.1007/978-3-319-94968-0_23
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Knowing the transcription factor binding sites (TFBSs) is essential for modeling the underlying binding mechanisms and follow-up cellular functions. Convolutional neural networks (CNNs) have outperformed methods in predicting TFBSs from the primary DNA sequence. In addition to DNA sequences, histone modifications and chromatin accessibility are also important factors influencing their activity. They have been explored to predict TFBSs recently. However, current methods rarely take into account histone modifications and chromatin accessibility using CNN in an integrative framework. To this end, we developed a general CNN model to integrate these data for predicting TFBSs. We systematically benchmarked a series of architecture variants by changing network structure in terms of width and depth, and explored the effects of sample length at flanking regions. We evaluated the performance of the three types of data and their combinations using 256 ChIP-seq experiments and also compared it with competing machine learning methods. We find that contributions from these three types of data are complementary to each other. Moreover, the integrative CNN framework is superior to traditional machine learning methods with significant improvements.
引用
收藏
页码:241 / 252
页数:12
相关论文
共 50 条
  • [31] BTFBS: Binding Prediction of Bacterial Transcription Factors and Binding Sites Based on Deep Learning
    Jin, Bingbing
    Liang, Song
    Liu, Xiaoqian
    Zhang, Rui
    Zhu, Yun
    Chen, Yuanyuan
    Liu, Guangjin
    Yang, Tao
    MATHEMATICS, 2025, 13 (04)
  • [32] DeepD2V: A Novel Deep Learning-Based Framework for Predicting Transcription Factor Binding Sites from Combined DNA Sequence
    Deng, Lei
    Wu, Hui
    Liu, Xuejun
    Liu, Hui
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2021, 22 (11)
  • [33] Identification of transcription factor binding sites using GA and PSO
    Chang, Xiao-Yu
    Zhou, Chun-Guang
    Li, Yan-Wen
    Hu, Ping
    ISDA 2006: SIXTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, VOL 1, 2006, : 473 - 477
  • [34] Predicting transcription factor binding sites using structural knowledge
    Kaplan, T
    Friedman, N
    Margalit, H
    RESEARCH IN COMPUTATIONAL MOLECULAR BIOLOGY, PROCEEDINGS, 2005, 3500 : 522 - 537
  • [35] Prediction of transcription factor binding sites using genetic algorithm
    Chang, Xiaoyu
    Zhou, Wengang
    Zhou, Chunguang
    Liang, Yanchun
    ICIEA 2006: 1ST IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS, VOLS 1-3, PROCEEDINGS, 2006, : 430 - 433
  • [36] Detection of transcription factor binding sites using Renyi entropy
    Maynou, Joan
    Vallverdu, Montserrat
    Claria, Francesc
    Perera, Alexandre
    Caminal, Pere
    8TH IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOENGINEERING, VOLS 1 AND 2, 2008, : 601 - +
  • [37] Prediction of transcription factor binding sites using genetic algorithm
    Chang, Xiaoyu
    Zhou, Wengang
    Zhou, Chunguang
    Liang, Yanchun
    2006 1ST IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS, VOLS 1-3, 2006, : 932 - +
  • [38] Database of repetitive elements in complete genomes and data mining using transcription factor binding sites
    Horng, JT
    Lin, FM
    Lin, JH
    Huang, HD
    Liu, BJ
    IEEE TRANSACTIONS ON INFORMATION TECHNOLOGY IN BIOMEDICINE, 2003, 7 (02): : 93 - 100
  • [39] High-resolution transcription factor binding sites prediction improved performance and interpretability by deep learning method
    Zhang, Yongqing
    Wang, Zixuan
    Zeng, Yuanqi
    Zhou, Jiliu
    Zou, Quan
    BRIEFINGS IN BIOINFORMATICS, 2021, 22 (06)
  • [40] Pseudocounts for transcription factor binding sites
    Nishida, Keishin
    Frith, Martin C.
    Nakai, Kenta
    NUCLEIC ACIDS RESEARCH, 2009, 37 (03) : 939 - 944