NetBCE: An Interpretable Deep Neural Network for Accurate Prediction of Linear B-cell Epitopes

被引:9
|
作者
Xu, Haodong [1 ]
Zhao, Zhongming [1 ,2 ,3 ,4 ]
机构
[1] Univ Texas Hlth Sci Ctr Houston, Ctr Precis Hlth, Sch Biomed Informat, Houston, TX 77030 USA
[2] Univ Texas Hlth Sci Ctr Houston, Human Genet Ctr, Sch Publ Hlth, Houston, TX 77030 USA
[3] Univ Texas MD Anderson Canc Ctr UTHlth Houston, Grad Sch Biomed Sci, Houston, TX 77030 USA
[4] Vanderbilt Univ, Dept Biomed Informat, Med Ctr, Nashville, TN 37203 USA
基金
美国国家卫生研究院;
关键词
B-cell epitope; Immunotherapy; Deep learning; Machine learning; Vaccine development; DATABASE; PROGRAM;
D O I
10.1016/j.gpb.2022.11.009
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Identification of B-cell epitopes (BCEs) plays an essential role in the development of peptide vaccines and immuno-diagnostic reagents, as well as antibody design and production. In this work, we generated a large benchmark dataset comprising 124,879 experimentally supported linear epitope-containing regions in 3567 protein clusters from over 1.3 million B cell assays. Analysis of this curated dataset showed large pathogen diversity covering 176 different families. The accuracy in linear BCE prediction was found to strongly vary with different features, while all sequencederived and structural features were informative. To search more efficient and interpretive feature representations, a ten-layer deep learning framework for linear BCE prediction, namely NetBCE, was developed. NetBCE achieved high accuracy and robust performance with the average area under the curve (AUC) value of 0.8455 in five-fold cross-validation through automatically learning the informative classification features. NetBCE substantially outperformed the conventional machine learning algorithms and other tools, with more than 22.06% improvement of AUC value compared to other tools using an independent dataset. Through investigating the output of important network modules in NetBCE, epitopes and non-epitopes tended to be presented in distinct regions with efficient feature representation along the network layer hierarchy. The NetBCE is freely available at https://github.com/bsml320/NetBCE.
引用
收藏
页码:1002 / 1012
页数:11
相关论文
共 50 条
  • [1] NetBCE: An Interpretable Deep Neural Network for Accurate Prediction of Linear B-cell Epitopes
    Haodong Xu
    Zhongming Zhao
    Genomics,Proteomics & Bioinformatics, 2022, Proteomics & Bioinformatics2022 (05) : 1002 - 1012
  • [2] EpiDope: a deep neural network for linear B-cell epitope prediction
    Collatz, Maximilian
    Mock, Florian
    Barth, Emanuel
    Hoelzer, Martin
    Sachse, Konrad
    Marz, Manja
    BIOINFORMATICS, 2021, 37 (04) : 448 - 455
  • [3] Prediction of linear B-cell epitopes
    Davydov, Ya. I.
    Tonevitsky, A. G.
    MOLECULAR BIOLOGY, 2009, 43 (01) : 150 - 158
  • [4] Prediction of linear B-cell epitopes
    Ya. I. Davydov
    A. G. Tonevitsky
    Molecular Biology, 2009, 43 : 150 - 158
  • [5] Prediction of Linear B-cell Epitopes Based on PCA and RNN Network
    Liu, Ling-yun
    Yang, Hong-guang
    Cheng, Bin
    PROCEEDINGS OF 2019 IEEE 7TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND COMPUTATIONAL BIOLOGY (ICBCB 2019), 2019, : 39 - 43
  • [6] Prediction of continuous B-cell epitopes in an antigen using recurrent neural network
    Saha, Sudipto
    Raghava, G. P. S.
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2006, 65 (01) : 40 - 48
  • [7] The Computational Prediction Methods for Linear B-cell Epitopes
    Jia, Cangzhi
    Gong, Hongyan
    Zhu, Yan
    Shi, Yixia
    CURRENT BIOINFORMATICS, 2019, 14 (03) : 226 - 233
  • [8] Prediction of linear B-cell epitopes using AAT scale
    Wang, Lian
    Liu, Juan
    Zhu, Shanfeng
    Gao, YangYang
    2009 3RD INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICAL ENGINEERING, VOLS 1-11, 2009, : 291 - +
  • [9] DL-TCNN: Deep Learning-based Temporal Convolutional Neural Network for prediction of conformational B-cell epitopes
    Angaitkar, Pratik
    Janghel, Rekh Ram
    Sahu, Tirath Prasad
    3 BIOTECH, 2023, 13 (09)
  • [10] DL-TCNN: Deep Learning-based Temporal Convolutional Neural Network for prediction of conformational B-cell epitopes
    Pratik Angaitkar
    Rekh Ram Janghel
    Tirath Prasad Sahu
    3 Biotech, 2023, 13