PhageTailFinder: A tool for phage tail module detection and annotation

被引:4
|
作者
Zhou, Fengxia [1 ]
Yang, Han [1 ]
Si, Yu [1 ]
Gan, Rui [1 ]
Yu, Ling [1 ]
Chen, Chuangeng [1 ]
Ren, Chunyan [2 ]
Wu, Jiqiu [3 ]
Zhang, Fan [1 ,4 ]
机构
[1] Harbin Inst Technol, HIT Ctr Life Sci, Sch Life Sci & Technol, Harbin, Peoples R China
[2] Harvard Med Sch, Boston Childrens Hosp, Dept Hematol, Dept Oncol, Boston, MA USA
[3] Univ Groningen, Univ Med Ctr Groningen, Dept Genet, Groningen, Netherlands
[4] Chinese Acad Sci, Hefei Inst Phys Sci, Inst Hlth & Med Technol, Anhui Prov Key Lab Med Phys & Technol, Hefei, Peoples R China
基金
中国国家自然科学基金;
关键词
phage; tail gene cluster; two-state HMM; DBSCAN; phage therapy; BACTERIOPHAGES; VIRUSES;
D O I
10.3389/fgene.2023.947466
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Decades of overconsumption of antimicrobials in the treatment and prevention of bacterial infections have resulted in the increasing emergence of drug-resistant bacteria, which poses a significant challenge to public health, driving the urgent need to find alternatives to conventional antibiotics. Bacteriophages are viruses infecting specific bacterial hosts, often destroying the infected bacterial hosts. Phages attach to and enter their potential hosts using their tail proteins, with the composition of the tail determining the range of potentially infected bacteria. To aid the exploitation of bacteriophages for therapeutic purposes, we developed the PhageTailFinder algorithm to predict tail-related proteins and identify the putative tail module in previously uncharacterized phages. The PhageTailFinder relies on a two-state hidden Markov model (HMM) to predict the probability of a given protein being tail-related. The process takes into account the natural modularity of phage tail-related proteins, rather than simply considering amino acid properties or secondary structures for each protein in isolation. The PhageTailFinder exhibited robust predictive power for phage tail proteins in novel phages due to this sequence-independent operation. The performance of the prediction model was evaluated in 13 extensively studied phages and a sample of 992 complete phages from the NCBI database. The algorithm achieved a high true-positive prediction rate (> 80%) in over half (571) of the studied phages, and the ROC value was 0.877 using general models and 0.968 using corresponding morphologic models. It is notable that the median ROC value of 992 complete phages is more than 0.75 even for novel phages, indicating the high accuracy and specificity of the PhageTailFinder. When applied to a dataset containing 189,680 viral genomes derived from 11,810 bulk metagenomic human stool samples, the ROC value was 0.895. In addition, tail protein clusters could be identified for further studies by density-based spatial clustering of applications with the noise algorithm (DBSCAN). The developed PhageTailFinder tool can be accessed either as a web server (http://www.microbiome-bigdata.com/PHISDetector/index/tools/ PhageTailFinder) or as a stand-alone program on a standard desktop computer (https://github.com/HIT-ImmunologyLab/PhageTailFinder).
引用
收藏
页数:10
相关论文
共 50 条
  • [31] ProsoBeast Prosody Annotation Tool
    Gerazov, Branislav
    Wagner, Michael
    INTERSPEECH 2021, 2021, : 2621 - 2625
  • [32] A dynamic multimedia annotation tool
    Pfund, T
    Marchand-Maillet, S
    INTERNET IMAGING III, 2002, 4672 : 216 - 224
  • [33] Annotation tool hits a milestone
    Shaikh-Lesko, Rina
    NATURE, 2019, 569 (7755) : 295 - 295
  • [34] SEMI AUTOMATIC ANNOTATION TOOL
    Dongel, Tugce
    Cevikalp, Hakan
    2014 22ND SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2014, : 1067 - 1070
  • [35] Interactive Video Annotation Tool
    Serrano, Miguel A.
    Gracia, Jesus
    Patricio, Miguel A.
    Molina, Jose M.
    DISTRIBUTED COMPUTING AND ARTIFICIAL INTELLIGENCE, 2010, 79 : 325 - 332
  • [36] Phage long tail fiber protein-immobilized magnetic nanoparticles for rapid and ultrasensitive detection of Salmonella
    Wang, Luokai
    Lin, Hong
    Zhang, Jing
    Wang, Jingxue
    TALANTA, 2022, 248
  • [37] PHAGE TYPING - AN EPIZOOTIOLOGICAL TOOL
    KUHN, H
    MONATSHEFTE FUR VETERINARMEDIZIN, 1982, 37 (19): : 748 - 752
  • [38] VirClust-A Tool for Hierarchical Clustering, Core Protein Detection and Annotation of (Prokaryotic) Viruses
    Moraru, Cristina
    VIRUSES-BASEL, 2023, 15 (04):
  • [39] NERO: A Text-based Tool for Content Annotation and Detection of Smells in Feature Requests
    Mu, Fangwen
    Shi, Lin
    Zhou, Wei
    Zhang, Yuanzhong
    Zhao, Huixia
    2020 28TH IEEE INTERNATIONAL REQUIREMENTS ENGINEERING CONFERENCE (RE'20), 2020, : 400 - 403
  • [40] SpliceLauncher: a tool for detection, annotation and relative quantification of alternative junctions from RNAseq data
    Leman, Raphael
    Harter, Valentin
    Atkinson, Alexandre
    Davy, Gregoire
    Rousselin, Antoine
    Muller, Etienne
    Castera, Laurent
    Lemoine, Frederic
    de la Grange, Pierre
    Guillaud-Bataille, Marine
    Vaur, Dominique
    Krieger, Sophie
    BIOINFORMATICS, 2020, 36 (05) : 1634 - 1636