Recovering Escherichia coli Plasmids in the Absence of Long-Read Sequencing Data

被引:12
|
作者
Paganini, Julian A. [1 ]
Plantinga, Nienke L. [1 ]
Arredondo-Alonso, Sergio [2 ,3 ]
Willems, Rob J. L. [1 ]
Schurch, Anita C. [1 ]
机构
[1] Univ Med Ctr Utrecht, Dept Med Microbiol, NL-3584 CX Utrecht, Netherlands
[2] Univ Oslo, Fac Med, Dept Biostat, N-0372 Oslo, Norway
[3] Wellcome Sanger Inst, Parasites & Microbes, Cambridge CB10 1SA, England
基金
欧盟地平线“2020”;
关键词
WGS; plasmids; antibiotic resistance; bioinformatics; Escherichia coli; RESISTANCE GENES; EPIDEMIOLOGY;
D O I
10.3390/microorganisms9081613
中图分类号
Q93 [微生物学];
学科分类号
071005 ; 100705 ;
摘要
The incidence of infections caused by multidrug-resistant E. coli strains has risen in the past years. Antibiotic resistance in E. coli is often mediated by acquisition and maintenance of plasmids. The study of E. coli plasmid epidemiology and genomics often requires long-read sequencing information, but recently a number of tools that allow plasmid prediction from short-read data have been developed. Here, we reviewed 25 available plasmid prediction tools and categorized them into binary plasmid/chromosome classification tools and plasmid reconstruction tools. We benchmarked six tools (MOB-suite, plasmidSPAdes, gplas, FishingForPlasmids, HyAsP and SCAPP) that aim to reliably reconstruct distinct plasmids, with a special focus on plasmids carrying antibiotic resistance genes (ARGs) such as extended-spectrum beta-lactamase genes. We found that two thirds (n = 425, 66.3%) of all plasmids were correctly reconstructed by at least one of the six tools, with a range of 92 (14.58%) to 317 (50.23%) correctly predicted plasmids. However, the majority of plasmids that carried antibiotic resistance genes (n = 85, 57.8%) could not be completely recovered as distinct plasmids by any of the tools. MOB-suite was the only tool that was able to correctly reconstruct the majority of plasmids (n = 317, 50.23%), and performed best at reconstructing large plasmids (n = 166, 46.37%) and ARG-plasmids (n = 41, 27.9%), but predictions frequently contained chromosome contamination (40%). In contrast, plasmidSPAdes reconstructed the highest fraction of plasmids smaller than 18 kbp (n = 168, 61.54%). Large ARG-plasmids, however, were frequently merged with sequences derived from distinct replicons. Available bioinformatic tools can provide valuable insight into E. coli plasmids, but also have important limitations. This work will serve as a guideline for selecting the most appropriate plasmid reconstruction tool for studies focusing on E. coli plasmids in the absence of long-read sequencing data.
引用
收藏
页数:20
相关论文
共 50 条
  • [21] The Application of Long-Read Sequencing to Cancer
    Ermini, Luca
    Driguez, Patrick
    CANCERS, 2024, 16 (07)
  • [22] Nanopore long-read sequencing of circRNAs
    Rahimi, Karim
    Nielsen, Anne Faerch
    Veno, Morten T.
    Kjems, Jorgen
    METHODS, 2021, 196 : 23 - 29
  • [23] Startups use short-read data to expand long-read sequencing market
    Eisenstein, Michael
    NATURE BIOTECHNOLOGY, 2015, 33 (05) : 433 - 435
  • [24] Startups use short-read data to expand long-read sequencing market
    Michael Eisenstein
    Nature Biotechnology, 2015, 33 : 433 - 435
  • [25] long-read-tools.org: an interactive catalogue of analysis methods for long-read sequencing data
    Amarasinghe, Shanika L.
    Ritchie, Matthew E.
    Gouil, Quentin
    GIGASCIENCE, 2021, 10 (02):
  • [26] Detection of Escherichia coli O157:H7 in Ground Beef Using Long-Read Sequencing
    Counihan, Katrina L.
    Kanrar, Siddhartha
    Tilman, Shannon
    Capobianco, Joseph
    Armstrong, Cheryl M.
    Gehring, Andrew
    FOODS, 2024, 13 (06)
  • [27] Long-Read Annotation: Automated Eukaryotic Genome Annotation Based on Long-Read cDNA Sequencing
    Cook, David E.
    Valle-Inclan, Jose Espejo
    Pajoro, Alice
    Rovenich, Hanna
    Thomma, Bart P. H. J.
    Faino, Luigi
    PLANT PHYSIOLOGY, 2019, 179 (01) : 38 - 54
  • [28] Evolution of a zoonotic pathogen: investigating prophage diversity in enterohaemorrhagic Escherichia coli O157 by long-read sequencing
    Shaaban, Sharif
    Cowley, Lauren A.
    McAteer, Sean P.
    Jenkins, Claire
    Dallman, Timothy J.
    Bono, James L.
    Gally, David L.
    MICROBIAL GENOMICS, 2016, 2 (12): : e000096
  • [29] Detecting Fusion Genes in Long-Read Transcriptome Sequencing Data with FUGAREC
    Masuda K.
    Sota Y.
    Matsuda H.
    IPSJ Transactions on Bioinformatics, 2024, 17 : 1 - 9
  • [30] The application of long-read sequencing in clinical settings
    Josephine B. Oehler
    Helen Wright
    Zornitza Stark
    Andrew J. Mallett
    Ulf Schmitz
    Human Genomics, 17