PhageTailFinder: A tool for phage tail module detection and annotation

被引:4
|
作者
Zhou, Fengxia [1 ]
Yang, Han [1 ]
Si, Yu [1 ]
Gan, Rui [1 ]
Yu, Ling [1 ]
Chen, Chuangeng [1 ]
Ren, Chunyan [2 ]
Wu, Jiqiu [3 ]
Zhang, Fan [1 ,4 ]
机构
[1] Harbin Inst Technol, HIT Ctr Life Sci, Sch Life Sci & Technol, Harbin, Peoples R China
[2] Harvard Med Sch, Boston Childrens Hosp, Dept Hematol, Dept Oncol, Boston, MA USA
[3] Univ Groningen, Univ Med Ctr Groningen, Dept Genet, Groningen, Netherlands
[4] Chinese Acad Sci, Hefei Inst Phys Sci, Inst Hlth & Med Technol, Anhui Prov Key Lab Med Phys & Technol, Hefei, Peoples R China
基金
中国国家自然科学基金;
关键词
phage; tail gene cluster; two-state HMM; DBSCAN; phage therapy; BACTERIOPHAGES; VIRUSES;
D O I
10.3389/fgene.2023.947466
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Decades of overconsumption of antimicrobials in the treatment and prevention of bacterial infections have resulted in the increasing emergence of drug-resistant bacteria, which poses a significant challenge to public health, driving the urgent need to find alternatives to conventional antibiotics. Bacteriophages are viruses infecting specific bacterial hosts, often destroying the infected bacterial hosts. Phages attach to and enter their potential hosts using their tail proteins, with the composition of the tail determining the range of potentially infected bacteria. To aid the exploitation of bacteriophages for therapeutic purposes, we developed the PhageTailFinder algorithm to predict tail-related proteins and identify the putative tail module in previously uncharacterized phages. The PhageTailFinder relies on a two-state hidden Markov model (HMM) to predict the probability of a given protein being tail-related. The process takes into account the natural modularity of phage tail-related proteins, rather than simply considering amino acid properties or secondary structures for each protein in isolation. The PhageTailFinder exhibited robust predictive power for phage tail proteins in novel phages due to this sequence-independent operation. The performance of the prediction model was evaluated in 13 extensively studied phages and a sample of 992 complete phages from the NCBI database. The algorithm achieved a high true-positive prediction rate (> 80%) in over half (571) of the studied phages, and the ROC value was 0.877 using general models and 0.968 using corresponding morphologic models. It is notable that the median ROC value of 992 complete phages is more than 0.75 even for novel phages, indicating the high accuracy and specificity of the PhageTailFinder. When applied to a dataset containing 189,680 viral genomes derived from 11,810 bulk metagenomic human stool samples, the ROC value was 0.895. In addition, tail protein clusters could be identified for further studies by density-based spatial clustering of applications with the noise algorithm (DBSCAN). The developed PhageTailFinder tool can be accessed either as a web server (http://www.microbiome-bigdata.com/PHISDetector/index/tools/ PhageTailFinder) or as a stand-alone program on a standard desktop computer (https://github.com/HIT-ImmunologyLab/PhageTailFinder).
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Structure dissection of a contractile phage tail tube
    Martinez, D.
    Berbon, M.
    Armel, B.
    Camille, S.
    Fronzes, R.
    Loquet, A.
    Sophie, Z. J.
    Habenstein, B.
    EUROPEAN BIOPHYSICS JOURNAL WITH BIOPHYSICS LETTERS, 2019, 48 : S38 - S38
  • [22] Structure dissection of a contractile phage tail tube
    Martinez, D.
    Berbon, M.
    Armel, B.
    Camille, S.
    Fronzes, R.
    Loquet, A.
    Sophie, Z. J.
    Habenstein, B.
    EUROPEAN BIOPHYSICS JOURNAL WITH BIOPHYSICS LETTERS, 2019, 48 : S92 - S92
  • [23] multiPhATE: bioinformatics pipeline for functional annotation of phage isolates
    Zhou, Carol L. Ecale
    Malfatti, Stephanie
    Kimbrel, Jeffrey
    Philipson, Casandra
    McNair, Katelyn
    Hamilton, Theron
    Edwards, Robert
    Souza, Brian
    BIOINFORMATICS, 2019, 35 (21) : 4402 - 4404
  • [24] Bioeditor - A tool for structure annotation
    Craig, PA
    Yang, P
    Bourne, PE
    FASEB JOURNAL, 2002, 16 (05): : A741 - A742
  • [25] An annotation tool for video understanding
    Rincon, A.
    Martinez-Cantos, J.
    COMPUTER AIDED SYSTEMS THEORY- EUROCAST 2007, 2007, 4739 : 701 - 708
  • [26] brat Rapid Annotation Tool
    Saklofske, Jon
    RENAISSANCE AND REFORMATION, 2019, 42 (02) : 180 - 189
  • [27] CAT: the CELCT Annotation Tool
    Lenzi, Valentina Bartalesi
    Moretti, Giovanni
    Sprugnoli, Rachele
    LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 333 - 338
  • [28] An annotation tool for semantic documents
    Eriksson, Henrik
    SEMANTIC WEB: RESEARCH AND APPLICATIONS, PROCEEDINGS, 2007, 4519 : 759 - 768
  • [29] ePIANNO: ePIgenomics ANNOtation tool
    Liu, Chia-Hsin
    Ho, Bing-Ching
    Chen, Chun-Ling
    Chang, Ya-Hsuan
    Hsu, Yi-Chiung
    Li, Yu-Cheng
    Yuan, Shin-Sheng
    Huang, Yi-Huan
    Chang, Chi-Sheng
    Li, Ker-Chau
    Chen, Hsuan-Yu
    PLOS ONE, 2016, 11 (02):
  • [30] AnCoraPipe: A tool for multilevel annotation
    Bertran, Manuel
    Borrega, Oriol
    Recasens, Marta
    Soriano, Barbara
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2008, (41): : 291 - 292