Cross-Species Prediction of Transcription Factor Binding by Adversarial Training of a Novel Nucleotide-Level Deep Neural Network

被引:1
|
作者
Zhang, Qinhu [1 ,2 ,3 ]
Wang, Siguo [1 ]
Li, Zhipeng [1 ]
Pan, Yijie [1 ]
Huang, De-Shuang [1 ,4 ]
机构
[1] Ningbo Inst Digital Twin, Eastern Inst Technol, Ningbo 315201, Peoples R China
[2] Univ Sci & Technol China, Div Life Sci & Med, Hefei 230021, Peoples R China
[3] Guangxi Acad Sci, Big Data & Intelligent Comp Res Ctr, Nanning 530007, Peoples R China
[4] Tongji Univ, Shanghai East Hosp, Inst Regenerat Med, Shanghai 200092, Peoples R China
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
adversarial training; cross-species transcription factor binding prediction; nucleotide-level models; sequence-level models; DNA; DATABASE; GENOME; MOUSE;
D O I
10.1002/advs.202405685
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Cross-species prediction of TF binding remains a major challenge due to the rapid evolutionary turnover of individual TF binding sites, resulting in cross-species predictive performance being consistently worse than within-species performance. In this study, a novel Nucleotide-Level Deep Neural Network (NLDNN) is first proposed to predict TF binding within or across species. NLDNN regards the task of TF binding prediction as a nucleotide-level regression task, which takes DNA sequences as input and directly predicts experimental coverage values. Beyond predictive performance, it also assesses model performance by locating potential TF binding regions, discriminating TF-specific single-nucleotide polymorphisms (SNPs), and identifying causal disease-associated SNPs. The experimental results show that NLDNN outperforms the competing methods in these tasks. Then, a dual-path framework is designed for adversarial training of NLDNN to further improve the cross-species prediction performance by pulling the domain space of human and mouse species closer. Through comparison and analysis, it finds that adversarial training not only can improve the cross-species prediction performance between humans and mice but also enhance the ability to locate TF binding regions and discriminate TF-specific SNPs. By visualizing the predictions, it is figured out that the framework corrects some mispredictions by amplifying the coverage values of incorrectly predicted peaks. This work proposes a Nucleotide-Level Deep Neural Network (NLDNN) to study transcription factor binding within or across species from the perspective of nucleotide resolution. Moreover, this work designs a dual-path framework for adversarial training of NLDNN to further improve the cross-species prediction performance by pulling the domain space of human and mouse species closer. image
引用
收藏
页数:17
相关论文
共 35 条
  • [1] Domain-adaptive neural networks improve cross-species prediction of transcription factor binding
    Cochran, Kelly
    Srivastava, Divyanshi
    Shrikumar, Avanti
    Balsubramani, Akshay
    Hardison, Ross C.
    Kundaje, Anshul
    Mahony, Shaun
    GENOME RESEARCH, 2022, 32 (03) : 512 - 523
  • [2] Nucleotide-level prediction of CircRNA-protein binding based on fully convolutional neural network
    Shen, Zhen
    Liu, Wei
    Zhao, Shujun
    Zhang, Qinhu
    Wang, Siguo
    Yuan, Lin
    FRONTIERS IN GENETICS, 2023, 14
  • [3] Factors influencing the identification of transcription factor binding sites by cross-species comparison
    McCue, LA
    Thompson, W
    Carmack, CS
    Lawrence, CE
    GENOME RESEARCH, 2002, 12 (10) : 1523 - 1532
  • [4] PhyloScan: identification of transcription factor binding sites using cross-species evidence
    Carmack, C. Steven
    McCue, Lee Ann
    Newberg, Lee A.
    Lawrence, Charles E.
    ALGORITHMS FOR MOLECULAR BIOLOGY, 2007, 2
  • [5] PhyloScan: identification of transcription factor binding sites using cross-species evidence
    C Steven Carmack
    Lee Ann McCue
    Lee A Newberg
    Charles E Lawrence
    Algorithms for Molecular Biology, 2
  • [6] A Novel Adversarial Training Scheme for Deep Neural Network based Speech Enhancement
    Cornell, Samuele
    Principi, Emanuele
    Squartini, Stefano
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [7] Cross-species behavior analysis with attention-based domain-adversarial deep neural networks
    Takuya Maekawa
    Daiki Higashide
    Takahiro Hara
    Kentarou Matsumura
    Kaoru Ide
    Takahisa Miyatake
    Koutarou D. Kimura
    Susumu Takahashi
    Nature Communications, 12
  • [8] Cross-species behavior analysis with attention-based domain-adversarial deep neural networks
    Maekawa, Takuya
    Higashide, Daiki
    Hara, Takahiro
    Matsumura, Kentarou
    Ide, Kaoru
    Miyatake, Takahisa
    Kimura, Koutarou D.
    Takahashi, Susumu
    NATURE COMMUNICATIONS, 2021, 12 (01)
  • [9] Robust Transcription Factor Binding Site Prediction Using Deep Neural Networks
    Geete, Kanu
    Pandey, Manish
    CURRENT BIOINFORMATICS, 2020, 15 (10) : 1137 - 1152
  • [10] Prediction of Transcription Factor Binding Sites With an Attention Augmented Convolutional Neural Network
    Jing, Fang
    Zhang, Shao-Wu
    Zhang, Shihua
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2022, 19 (06) : 3614 - 3623