Pan-genomic analysis of the species Salmonella enterica: Identification of core essential and putative essential genes

被引:8
|
作者
Chand, Yamini [1 ]
Alam, Md. Afroz [2 ]
Singh, Sachidanand [1 ]
机构
[1] Shri Ramswaroop Mem Univ, Inst Biosci & Technol, Fac Biotechnol, Lucknow Deva Rd, Barabanki 225003, Uttar Pradesh, India
[2] Karunya Inst Technol & Sci, Dept Biotechnol, Coimbatore 641114, Tamil Nadu, India
来源
GENE REPORTS | 2020年 / 20卷
关键词
Salmonella enterica; Comparative genomics; Phylogenetic analysis; BLAST matrix; Pan-genome; Core genome; Dispensable genome; COG classification; Essentiality analysis; DRUG TARGETS; ESCHERICHIA-COLI; DATABASE; DEG; PRIORITIZATION; ANNOTATION; PREDICTION; PROTEINS; SEQUENCE; REVEALS;
D O I
10.1016/j.genrep.2020.100669
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Background: Essential genes are defined as the minimal gene set required to support bacterial life. In order to develop new antimicrobials to treat multidrug-resistant pathogens, such as serovars of Salmonella enterica, the identification of essential genes is crucial. Methodology: In the present work, we hypothesize that essential genes within a group of evolutionary closely related organisms may be highly conserved. We, therefore, conducted an extensive comparative genomic analysis of 44 genome sequences representing 17 serovars of S. enterica to gain an improved understanding of conserved essential genes for its survival. Results: Pan-genome estimates indicate that the genus Salmonella displays an open pan-genome structure comprising a reservoir of 10,775 gene families. Of these, 2847, 4657, and 3271 constitute the core gene families (CGFs), dispensable gene families (DGFs), and strain-specific gene families (SSGFs), respectively. The pan-genome family tree based on the presence/absence of gene families is highly concordant with the 16S rRNA tree, though the former provides a more robust phylogenetic resolution. The Clusters of Orthologous Groups of proteins (COGs) database categorized the vast majority of the CGFs (40.9%) to metabolism, whereas a large proportion of the DGFs (70.6%) was uncharacterized. Homology analysis of the CGFs against the Database of essential genes (DEG) identified 1695 essential CGFs (E-CGFs). Of these, 687 are experimentally verified as essential in Salmonella, 1157 are identified in >= 2 species, 159 are conserved in >= 7 species, and 538 were present in at least one species. Thus, for the species, S. enterica 69%, 52%, and 31% of the genome are dedicated to the core, essential, and dispensable functions, respectively. Conclusion: The E-CGFs identified may serve as important targets for the development of novel antimicrobials, and their detailed analysis may shed new light on a better understanding of Salmonella's survival.
引用
收藏
页数:16
相关论文
共 50 条
  • [41] Core Proteomic Analysis of Unique Metabolic Pathways of Salmonella enterica for the Identification of Potential Drug Targets
    Uddin, Reaz
    Sufian, Muhammad
    PLOS ONE, 2016, 11 (01):
  • [42] Comprehensive pan-cancer analysis: essential role of ABCB family genes in cancer
    Xiao, Hui-Ni
    Zhao, Zi-Yue
    Li, Jin-Ping
    Li, Ao-Yu
    TRANSLATIONAL CANCER RESEARCH, 2024, 13 (04) : 1642 - 1664
  • [43] PanOCT: automated clustering of orthologs using conserved gene neighborhood for pan-genomic analysis of bacterial strains and closely related species
    Fouts, Derrick E.
    Brinkac, Lauren
    Beck, Erin
    Inman, Jason
    Sutton, Granger
    NUCLEIC ACIDS RESEARCH, 2012, 40 (22)
  • [44] Effects of Thyme (Thymus vulgaris) Essential Oil on Bacterial Growth and Expression of Some Virulence Genes in Salmonella enterica Serovar Enteritidis
    Hassanzadeh, Mohammad
    Mirzaie, Sara
    Pirmahalle, Faezeh Rahimi
    Yahyaraeyat, Ramak
    Razmyar, Jamshid
    VETERINARY MEDICINE AND SCIENCE, 2024, 10 (06)
  • [45] Application of the Subtractive Genomics and Molecular Docking Analysis for the Identification of Novel Putative Drug Targets against Salmonella enterica subsp enterica serovar Poona
    Hossain, Tanvir
    Kamruzzaman, Mohammad
    Choudhury, Talita Zahin
    Mahmood, Hamida Nooreen
    Nabi, A. H. M. Nurun
    Hosen, Md. Ismail
    BIOMED RESEARCH INTERNATIONAL, 2017, 2017
  • [46] Pan-genome Analysis of Ancient and Modern Salmonella enterica Demonstrates Genomic Stability of the Invasive Para C Lineage for Millennia
    Zhou, Zhemin
    Lundstrom, Inge
    Tran-Dien, Alicia
    Duchene, Sebastian
    Alikhan, Nabil-Fareed
    Sergeant, Martin J.
    Langridge, Gemma
    Fotakis, Anna K.
    Nair, Satheesh
    Stenoien, Hans K.
    Hamre, Stian S.
    Casjens, Sherwood
    Christophersen, Axel
    Quince, Christopher
    Thomson, Nicholas R.
    Weill, Francois-Xavier
    Ho, Simon Y. W.
    Gilbert, M. Thomas P.
    Achtman, Mark
    CURRENT BIOLOGY, 2018, 28 (15) : 2420 - +
  • [47] Insilico analysis of hypothetical proteins unveils putative metabolic pathways and essential genes in Leishmania donovani
    Ravooru, Nithin
    Ganji, Sandesh
    Sathyanarayanan, Nitish
    Nagendra, Holenarsipur G.
    FRONTIERS IN GENETICS, 2014, 5
  • [48] Identification of a novel core type in Salmonella lipopolysaccharide -: Complete structural analysis of the core region of the lipopolysaccharide from Salmonella enterica sv. Arizonae 062
    Olsthoorn, MMA
    Petersen, BO
    Schlecht, S
    Haverkamp, J
    Bock, K
    Thomas-Oates, JE
    Holst, O
    JOURNAL OF BIOLOGICAL CHEMISTRY, 1998, 273 (07) : 3817 - 3829
  • [49] Identification of putative non-host essential genes and novel drug targets against Acinetobacter baumannii by in silica comparative genome analysis
    Uddin, Reaz
    Masood, Fareha
    Azam, Syed Sikander
    Wadood, Abdul
    MICROBIAL PATHOGENESIS, 2019, 128 : 28 - 35
  • [50] Genome-Wide Identification and Expression Analysis of SOS Response Genes in Salmonella enterica Serovar Typhimurium
    Merida-Floriano, Angela
    Rowe, Will P. M.
    Casadesus, Josep
    CELLS, 2021, 10 (04)