Development and implementation of a core genome multilocus sequence typing scheme for Haemophilus influenzae

被引:2
|
作者
Krisna, Made Ananda [1 ,2 ,3 ]
Jolley, Keith A. [2 ]
Monteith, William [2 ,4 ]
Boubour, Alexandra [5 ]
Hamers, Raph L. [1 ,3 ]
Brueggemann, Angela B. [5 ]
Harrison, Odile B. [2 ,5 ]
Maiden, Martin C. J. [2 ]
机构
[1] Univ Oxford, Ctr Trop Med & Global Hlth, Nuffield Dept Med, Oxford, England
[2] Univ Oxford, Dept Biol, Oxford, England
[3] Univ Indonesia, Oxford Univ Clin Res Unit Indonesia, Fac Med, Jakarta, Indonesia
[4] Univ Bath, Dept Biol & Biochem, Bath, England
[5] Univ Oxford, Nuffield Dept Populat Hlth, Oxford, England
来源
MICROBIAL GENOMICS | 2024年 / 10卷 / 08期
基金
英国惠康基金;
关键词
cgMLST; core genome; Haemophilus influenzae; population genetics; typing scheme; DISEASE; VACCINE; PAN; IDENTIFICATION; EPIDEMIOLOGY; SEROTYPE;
D O I
10.1099/mgen.0.001281
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Haemophilus influenzae is part of the human nasopharyngeal microbiota and a pathogen causing invasive disease. The extensive genetic diversity observed in H. influenzae necessitates discriminatory analytical approaches to evaluate its population structure. This study developed a core genome multilocus sequence typing (cgMLST) scheme for H. influenzae using pangenome analysis tools and validated the cgMLST scheme using datasets consisting of complete reference genomes (N N = 14) and high-quality draft H. influenzae genomes (N N = 2297). The draft genome dataset was divided into a development dataset (N N = 921) and a validation dataset (N N = 1376). The development dataset was used to identify potential core genes, and the validation dataset was used to refine the final core gene list to ensure the reliability of the proposed cgMLST scheme. Functional classifications were made for all the resulting core genes. Phylogenetic analyses were performed using both allelic profiles and nucleotide sequence alignments of the core genome to test congruence, as assessed by Spearman's correlation and ordinary least square linear regression tests. Preliminary analyses using the development dataset identified 1067 core genes, which were refined to 1037 with the validation dataset. More than 70% of core genes were predicted to encode proteins essential for metabolism or genetic information processing. Phylogenetic and statistical analyses indicated that the core genome allelic profile accurately represented phylogenetic relatedness among the isolates (R2 R 2 = 0.945). We used this cgMLST scheme to define a high- resolution population structure for H. influenzae, , which enhances the genomic analysis of this clinically relevant human pathogen.
引用
收藏
页数:14
相关论文
共 50 条
  • [11] Analysis of genetic relatedness of Haemophilus influenzae isolates by multilocus sequence typing
    Erwin, Alice L.
    Sandstedt, Sara A.
    Bonthuis, Paul J.
    Geelhood, Jennifer L.
    Nelson, Kevin L.
    Unrath, William C. T.
    Diggle, Mathew A.
    Theodore, Mary J.
    Pleatman, Cynthia R.
    Mothershed, Elizabeth A.
    Sacchi, Claudio T.
    Mayer, Leonard W.
    Gilsdorf, Janet R.
    Smith, Arnold L.
    JOURNAL OF BACTERIOLOGY, 2008, 190 (04) : 1473 - 1483
  • [12] Development and evaluation of a core genome multilocus sequence typing (cgMLST) scheme for Brucella spp.
    Sankarasubramanian, Jagadesan
    Vishnu, Udayakumar S.
    Gunasekaran, Paramasamy
    Rajendhran, Jeyaprakash
    INFECTION GENETICS AND EVOLUTION, 2019, 67 : 38 - 43
  • [13] Development and evaluation of a core genome multilocus typing scheme for whole-genome sequence-based typing of Acinetobacter baumannii
    Higgins, Paul G.
    Prior, Karola
    Harmsen, Dag
    Seifert, Harald
    PLOS ONE, 2017, 12 (06):
  • [14] Haemophilus influenzae may be untypable by the multilocus sequence typing scheme due to a complete deletion of the fucose operon
    Ridderberg, Winnie
    Fenger, Mette G.
    Norskov-Lauritsen, Niels
    JOURNAL OF MEDICAL MICROBIOLOGY, 2010, 59 (06) : 740 - 742
  • [15] Core Genome Multilocus Sequence Typing Scheme for High-Resolution Typing of Enterococcus faecium
    de Been, Mark
    Pinholt, Mette
    Top, Janetta
    Bletz, Stefan
    Mellmann, Alexander
    van Schaik, Willem
    Brouwer, Ellen
    Rogers, Malbert
    Kraat, Yvette
    Bonten, Marc
    Corander, Jukka
    Westh, Henrik
    Harmsen, Dag
    Willems, Rob J. L.
    JOURNAL OF CLINICAL MICROBIOLOGY, 2015, 53 (12) : 3788 - 3797
  • [16] Defining and Evaluating a Core Genome Multilocus Sequence Typing Scheme for Genome-Wide Typing of Clostridium difficile
    Bletz, Stefan
    Janezic, Sandra
    Harmsen, Dag
    Rupnik, Maja
    Mellmann, Alexander
    JOURNAL OF CLINICAL MICROBIOLOGY, 2018, 56 (06)
  • [17] Development and Validation of a Burkholderia pseudomallei Core Genome Multilocus Sequence Typing Scheme To Facilitate Molecular Surveillance
    Lichtenegger, Sabine
    Trinh, Trung T.
    Assig, Karoline
    Prior, Karola
    Harmsen, Dag
    Pesl, Julian
    Zauner, Andrea
    Lipp, Michaela
    Que, Tram A.
    Mutsam, Beatrice
    Kleinhappl, Barbara
    Steinmetz, Ivo
    Wagner, Gabriel E.
    JOURNAL OF CLINICAL MICROBIOLOGY, 2021, 59 (08)
  • [18] Development of a multilocus sequence typing scheme for Ureaplasma
    Zhang, J.
    Kong, Y.
    Feng, Y.
    Huang, J.
    Song, T.
    Ruan, Z.
    Song, J.
    Jiang, Y.
    Yu, Y.
    Xie, X.
    EUROPEAN JOURNAL OF CLINICAL MICROBIOLOGY & INFECTIOUS DISEASES, 2014, 33 (04) : 537 - 544
  • [19] Establishment of a Publicly Available Core Genome Multilocus Sequence Typing Scheme for Clostridium perfringens
    Abdel-Glil, Mostafa Y.
    Thomas, Prasad
    Linde, Joerg
    Jolley, Keith A.
    Harmsen, Dag
    Wieler, Lothar H.
    Neubauer, Heinrich
    Seyboldt, Christian
    MICROBIOLOGY SPECTRUM, 2021, 9 (02):
  • [20] Defining a Core Genome Multilocus Sequence Typing Scheme for the Global Epidemiology of Vibrio parahaemolyticus
    Gonzalez-Escaloa, Narjol
    Jolley, Keith A.
    Reed, Elizabeth
    Martinez-Urtaza, Jaime
    JOURNAL OF CLINICAL MICROBIOLOGY, 2017, 55 (06) : 1682 - 1697