PhageTailFinder: A tool for phage tail module detection and annotation

被引:4
|
作者
Zhou, Fengxia [1 ]
Yang, Han [1 ]
Si, Yu [1 ]
Gan, Rui [1 ]
Yu, Ling [1 ]
Chen, Chuangeng [1 ]
Ren, Chunyan [2 ]
Wu, Jiqiu [3 ]
Zhang, Fan [1 ,4 ]
机构
[1] Harbin Inst Technol, HIT Ctr Life Sci, Sch Life Sci & Technol, Harbin, Peoples R China
[2] Harvard Med Sch, Boston Childrens Hosp, Dept Hematol, Dept Oncol, Boston, MA USA
[3] Univ Groningen, Univ Med Ctr Groningen, Dept Genet, Groningen, Netherlands
[4] Chinese Acad Sci, Hefei Inst Phys Sci, Inst Hlth & Med Technol, Anhui Prov Key Lab Med Phys & Technol, Hefei, Peoples R China
基金
中国国家自然科学基金;
关键词
phage; tail gene cluster; two-state HMM; DBSCAN; phage therapy; BACTERIOPHAGES; VIRUSES;
D O I
10.3389/fgene.2023.947466
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Decades of overconsumption of antimicrobials in the treatment and prevention of bacterial infections have resulted in the increasing emergence of drug-resistant bacteria, which poses a significant challenge to public health, driving the urgent need to find alternatives to conventional antibiotics. Bacteriophages are viruses infecting specific bacterial hosts, often destroying the infected bacterial hosts. Phages attach to and enter their potential hosts using their tail proteins, with the composition of the tail determining the range of potentially infected bacteria. To aid the exploitation of bacteriophages for therapeutic purposes, we developed the PhageTailFinder algorithm to predict tail-related proteins and identify the putative tail module in previously uncharacterized phages. The PhageTailFinder relies on a two-state hidden Markov model (HMM) to predict the probability of a given protein being tail-related. The process takes into account the natural modularity of phage tail-related proteins, rather than simply considering amino acid properties or secondary structures for each protein in isolation. The PhageTailFinder exhibited robust predictive power for phage tail proteins in novel phages due to this sequence-independent operation. The performance of the prediction model was evaluated in 13 extensively studied phages and a sample of 992 complete phages from the NCBI database. The algorithm achieved a high true-positive prediction rate (> 80%) in over half (571) of the studied phages, and the ROC value was 0.877 using general models and 0.968 using corresponding morphologic models. It is notable that the median ROC value of 992 complete phages is more than 0.75 even for novel phages, indicating the high accuracy and specificity of the PhageTailFinder. When applied to a dataset containing 189,680 viral genomes derived from 11,810 bulk metagenomic human stool samples, the ROC value was 0.895. In addition, tail protein clusters could be identified for further studies by density-based spatial clustering of applications with the noise algorithm (DBSCAN). The developed PhageTailFinder tool can be accessed either as a web server (http://www.microbiome-bigdata.com/PHISDetector/index/tools/ PhageTailFinder) or as a stand-alone program on a standard desktop computer (https://github.com/HIT-ImmunologyLab/PhageTailFinder).
引用
收藏
页数:10
相关论文
共 50 条
  • [41] DeAnnIso: a tool for online detection and annotation of isomiRs from small RNA sequencing data
    Zhang, Yuanwei
    Zang, Qiguang
    Zhang, Huan
    Ban, Rongjun
    Yang, Yifan
    Iqbal, Furhan
    Li, Ao
    Shi, Qinghua
    NUCLEIC ACIDS RESEARCH, 2016, 44 (W1) : W166 - W175
  • [42] Analysis of enterocoliticin, a phage tail-like bacteriocin
    Strauch, E
    Kaspar, H
    Schaudinn, C
    Damasko, C
    Konietzny, A
    Dersch, P
    Skurnik, M
    Appel, B
    GENUS YERSINIA: ENTERING THE FUNCTIONAL GENOMIC ERA, 2003, 529 : 249 - 251
  • [43] MORPHOGENESIS OF T3 PHAGE - TAIL MORPHOGENESIS
    KATO, H
    FUJISAWA, H
    JAPANESE JOURNAL OF GENETICS, 1975, 49 (05): : 301 - 302
  • [44] STRUCTURE OF TAIL OF TEMPORATE PHAGE NO 1 OF BAC MEGATERIUM
    MIKHAILOV, AM
    BELIAEVA, NN
    ZOGRAPH, ON
    TIKHONENKO, AS
    DOKLADY AKADEMII NAUK SSSR, 1978, 242 (04): : 946 - &
  • [45] STUDIES ON PHAGE TAIL BACTERIOCINS OF PROTEUS-RETTGERI
    TRAUB, WH
    ACKER, G
    KLEBER, I
    ZENTRALBLATT FUR BAKTERIOLOGIE MIKROBIOLOGIE UND HYGIENE SERIES A-MEDICAL MICROBIOLOGY INFECTIOUS DISEASES VIROLOGY PARASITOLOGY, 1974, 229 (03): : 362 - 371
  • [46] Structure of the membrane-piercing phage tail spike
    Leiman, Petr G.
    Browning, Christopher
    Shneider, Mikhail
    ACTA CRYSTALLOGRAPHICA A-FOUNDATION AND ADVANCES, 2011, 67 : C409 - C410
  • [47] Sequence and annotation of the Wizard007 mycobacterium phage genome
    Ejike Anyanwu
    Kaitlyn Cole
    Karlee Driver
    Anthony Falcone
    Elizabeth Farnsworth
    Benjamin Howard
    Brittney Howard
    Courtney Howard
    Rodney King
    Jordan Olberding
    Mackenzie Perkins
    Claire Rinehart
    Heidi Sayre
    Tyler Scaff
    Sarah Schrader
    Prasanna Tamarapu Parthasarathy
    Cynthia Tope
    BMC Bioinformatics, 11 (Suppl 4)
  • [48] Separation and colorimetric detection of Escherichia coli by phage tail fiber protein combined with nano-magnetic beads
    Hong, Bin
    Li, Yanmei
    Wang, Wenhai
    Ma, Yi
    Wang, Jufang
    MICROCHIMICA ACTA, 2023, 190 (06)
  • [49] A Text Annotation Tool with Pre-annotation Based on Deep Learning
    Teng, Fei
    Ma, Minbo
    Ma, Zheng
    Huang, Lufei
    Xiao, Ming
    Li, Xuan
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, KSEM 2019, PT I, 2019, 11775 : 440 - 451
  • [50] P-CSEM: An Attention Module for Improved Laparoscopic Surgical Tool Detection
    Arabian, Herag
    Alshirbaji, Tamer Abdulbaki
    Jalal, Nour Aldeen
    Krueger-Ziolek, Sabine
    Moeller, Knut
    SENSORS, 2023, 23 (16)