Comparison of three source attribution methods applied to whole genome sequencing data of monophasic and biphasic Salmonella Typhimurium isolates from the British Isles and Denmark

被引:0
|
作者
Guzinski, Jaromir [1 ]
Arnold, Mark [2 ]
Whiteley, Tim [2 ]
Tang, Yue [1 ]
Patel, Virag [2 ]
Trew, Jahcub [1 ]
Litrup, Eva [3 ]
Hald, Tine [4 ]
Smith, Richard Piers [2 ]
Petrovska, Liljana [1 ]
机构
[1] Anim & Plant Hlth Agcy, Dept Bacteriol, Addlestone, England
[2] Anim & Plant Hlth Agcy, Dept Epidemiol Sci, Addlestone, England
[3] Statens Serum Inst, Dept Bacteria Parasites & Fungi, Foodborne Infect, Copenhagen, Denmark
[4] Tech Univ Denmark, Natl Food Inst, Res Grp Genom Epidemiol, Kongens Lyngby, Denmark
关键词
source attribution; monophasic and biphasic Salmonella Typhimurium; machine learning; random forest; Bayesian modeling; Accessory genes-Based Source Attribution; bacterial genomics; FOOD SOURCES; INFECTIONS; SELECTION; BURDEN;
D O I
10.3389/fmicb.2024.1393824
中图分类号
Q93 [微生物学];
学科分类号
071005 ; 100705 ;
摘要
Methodologies for source attribution (SA) of foodborne illnesses comprise a rapidly expanding suite of techniques for estimating the most important source or sources of human infection. Recently, the increasing availability of whole genome sequencing (WGS) data for a wide range of bacterial strains has led to the development of novel SA methods. These techniques utilize the unique features of bacterial genomes adapted to different host types and hence offer increased resolution of the outputs. Comparative studies of different SA techniques reliant on WGS data are currently lacking. Here, we critically assessed and compared the outputs of three SA methods: a supervised classification random forest machine learning algorithm (RandomForest), an Accessory genes-Based Source Attribution method (AB_SA), and a Bayesian frequency matching method (Bayesian). Each technique was applied to the WGS data of a panel of 902 reservoir host and human monophasic and biphasic Salmonella enterica subsp. enterica serovar Typhimurium isolates sampled in the British Isles (BI) and Denmark from 2012 to 2016. Additionally, for RandomForest and Bayesian, we explored whether utilization of accessory genome features as model inputs improved attribution accuracy of these methods over using the core genome derived features only. Results indicated that this was the case for RandomForest, but for Bayesian the overall attribution estimates varied little regardless of the inclusion or not of the accessory genome features. All three methods attributed the vast majority of human isolates to the Pigs primary source class, which was expected given the known high relative prevalence rates in pigs, and hence routes of infection into the human population, of monophasic and biphasic S. Typhimurium in the BI and Denmark. The accuracy of AB_SA was lower than of RandomForest when attributing the primary source classes to the 120 animal test set isolates with known primary sources. A major advantage of both AB_SA and Bayesian was a much faster execution time as compared to RandomForest. Overall, the SA method comparison presented in this study describes the strengths and weaknesses of each of the three methods applied to attributing potential monophasic and biphasic S. Typhimurium animal sources to human infections that could be valuable when deciding which SA methodology would be the most applicable to foodborne disease outbreak scenarios involving monophasic and biphasic S. Typhimurium.
引用
收藏
页数:14
相关论文
共 9 条
  • [1] Development and validation of a random forest algorithm for source attribution of animal and human Salmonella Typhimurium and monophasic variants of S. Typhimurium isolates in England and Wales utilising whole genome sequencing data
    Guzinski, Jaromir
    Tang, Yue
    Chattaway, Marie Anne
    Dallman, Timothy J.
    Petrovska, Liljana
    FRONTIERS IN MICROBIOLOGY, 2024, 14
  • [2] Investigation of Outbreaks of Salmonella enterica Serovar Typhimurium and Its Monophasic Variants Using Whole-Genome Sequencing, Denmark
    Gymoese, Pernille
    Sorensen, Gitte
    Litrup, Eva
    Olsen, John Elmerdal
    Nielsen, Eva Moller
    Torpdahl, Mia
    EMERGING INFECTIOUS DISEASES, 2017, 23 (10) : 1631 - 1639
  • [3] A Review on Microbiological Source Attribution Methods of Human Salmonellosis: From Subtyping to Whole-Genome Sequencing
    Cardim Falcao, Rebeca
    Edwards, Megan R.
    Hurst, Matt
    Fraser, Erin
    Otterstatter, Michael
    FOODBORNE PATHOGENS AND DISEASE, 2024, 21 (03) : 137 - 146
  • [4] Comparison of Conventional Molecular and Whole-Genome Sequencing Methods for Differentiating Salmonella enterica Serovar Schwarzengrund Isolates Obtained from Food and Animal Sources
    Li, I-Chen
    Wu, Rayean
    Hu, Chung-Wen
    Wu, Keh-Ming
    Chen, Zeng-Weng
    Chou, Chung-Hsi
    MICROORGANISMS, 2021, 9 (10)
  • [5] Determining antimicrobial susceptibility in Salmonella enterica serovar Typhimurium through whole genome sequencing: a comparison against multiple phenotypic susceptibility testing methods
    Nana Mensah
    Yue Tang
    Shaun Cawthraw
    Manal AbuOun
    Jackie Fenner
    Nicholas R. Thomson
    Alison E. Mather
    Liljana Petrovska-Holmes
    BMC Microbiology, 19
  • [6] Determining antimicrobial susceptibility in Salmonella enterica serovar Typhimurium through whole genome sequencing: a comparison against multiple phenotypic susceptibility testing methods
    Mensah, Nana
    Tang, Yue
    Cawthraw, Shaun
    AbuOun, Manal
    Fenner, Jackie
    Thomson, Nicholas R.
    Mather, Alison E.
    Petrovska-Holmes, Liljana
    BMC MICROBIOLOGY, 2019, 19 (1)
  • [7] Comparison of conventional molecular and whole-genome sequencing methods for subtyping Salmonella enterica serovar Enteritidis strains from Tunisia
    Boutheina Ksibi
    Sonia Ktari
    Houcemeddine Othman
    Kais Ghedira
    Sonda Maalej
    Basma Mnif
    Mohamed salah Abbassi
    Laetitia Fabre
    Faouzia Rhimi
    Simon Le Hello
    Adnene Hammami
    European Journal of Clinical Microbiology & Infectious Diseases, 2021, 40 : 597 - 606
  • [8] Whole-Genome Sequencing of Drug-Resistant Salmonella enterica Isolates from Dairy Cattle and Humans in New York and Washington States Reveals Source and Geographic Associations
    Carroll, Laura M.
    Wiedmann, Martin
    den Bakker, Henk
    Siler, Julie
    Warchocki, Steven
    Kent, David
    Lyalina, Svetlana
    Davis, Margaret
    Sischo, William
    Besser, Thomas
    Warnick, Lorin D.
    Pereira, Richard V.
    APPLIED AND ENVIRONMENTAL MICROBIOLOGY, 2017, 83 (12)
  • [9] Real-Time Pathogen Detection in the Era of Whole-Genome Sequencing and Big Data: Comparison of k-mer and Site-Based Methods for Inferring the Genetic Distances among Tens of Thousands of Salmonella Samples
    Pettengill, James B.
    Pightling, Arthur W.
    Baugher, Joseph D.
    Rand, Hugh
    Strain, Errol
    PLOS ONE, 2016, 11 (11):