Pan-genomic analysis of the species Salmonella enterica: Identification of core essential and putative essential genes

被引:8
|
作者
Chand, Yamini [1 ]
Alam, Md. Afroz [2 ]
Singh, Sachidanand [1 ]
机构
[1] Shri Ramswaroop Mem Univ, Inst Biosci & Technol, Fac Biotechnol, Lucknow Deva Rd, Barabanki 225003, Uttar Pradesh, India
[2] Karunya Inst Technol & Sci, Dept Biotechnol, Coimbatore 641114, Tamil Nadu, India
来源
GENE REPORTS | 2020年 / 20卷
关键词
Salmonella enterica; Comparative genomics; Phylogenetic analysis; BLAST matrix; Pan-genome; Core genome; Dispensable genome; COG classification; Essentiality analysis; DRUG TARGETS; ESCHERICHIA-COLI; DATABASE; DEG; PRIORITIZATION; ANNOTATION; PREDICTION; PROTEINS; SEQUENCE; REVEALS;
D O I
10.1016/j.genrep.2020.100669
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Background: Essential genes are defined as the minimal gene set required to support bacterial life. In order to develop new antimicrobials to treat multidrug-resistant pathogens, such as serovars of Salmonella enterica, the identification of essential genes is crucial. Methodology: In the present work, we hypothesize that essential genes within a group of evolutionary closely related organisms may be highly conserved. We, therefore, conducted an extensive comparative genomic analysis of 44 genome sequences representing 17 serovars of S. enterica to gain an improved understanding of conserved essential genes for its survival. Results: Pan-genome estimates indicate that the genus Salmonella displays an open pan-genome structure comprising a reservoir of 10,775 gene families. Of these, 2847, 4657, and 3271 constitute the core gene families (CGFs), dispensable gene families (DGFs), and strain-specific gene families (SSGFs), respectively. The pan-genome family tree based on the presence/absence of gene families is highly concordant with the 16S rRNA tree, though the former provides a more robust phylogenetic resolution. The Clusters of Orthologous Groups of proteins (COGs) database categorized the vast majority of the CGFs (40.9%) to metabolism, whereas a large proportion of the DGFs (70.6%) was uncharacterized. Homology analysis of the CGFs against the Database of essential genes (DEG) identified 1695 essential CGFs (E-CGFs). Of these, 687 are experimentally verified as essential in Salmonella, 1157 are identified in >= 2 species, 159 are conserved in >= 7 species, and 538 were present in at least one species. Thus, for the species, S. enterica 69%, 52%, and 31% of the genome are dedicated to the core, essential, and dispensable functions, respectively. Conclusion: The E-CGFs identified may serve as important targets for the development of novel antimicrobials, and their detailed analysis may shed new light on a better understanding of Salmonella's survival.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] Putative drug and vaccine target protein identification using comparative genomic metabolic pathways of Salmonella enterica serovar Typhimurium
    Lee, S. -J.
    Park, S. -C.
    JOURNAL OF VETERINARY PHARMACOLOGY AND THERAPEUTICS, 2015, 38 : 90 - 91
  • [32] Genomic analysis of the MLST population structure and antimicrobial resistance genes associated with Salmonella enterica in Mexico
    Gomez-Baltazar, Adrian
    Godinez-Oviedo, Angelica
    Vazquez-Marrufo, Gerardo
    Vazquez-Garciduenas, Ma. Soledad
    Hernandez-Iturriaga, Montserrat
    GENOME, 2023, 66 (12) : 319 - 332
  • [33] Identification of Salmonella enterica species- and subgroup-specific genomic regions using Panseq 2.0
    Laing, Chad
    Villegas, Andre
    Taboada, Eduardo N.
    Kropinski, Andrew
    Thomas, James E.
    Gannon, Victor P. J.
    INFECTION GENETICS AND EVOLUTION, 2011, 11 (08) : 2151 - 2161
  • [34] Identification and functional analysis of Salmonella enterica serovar Typhimurium PmrA-regulated genes
    Tamayo, R
    Prouty, AM
    Gunn, JS
    FEMS IMMUNOLOGY AND MEDICAL MICROBIOLOGY, 2005, 43 (02): : 249 - 258
  • [35] Identification and functional analysis of essential, conserved, housekeeping and duplicated genes
    Arun, P. V. Parvati Sai
    Miryala, Sravan Kumar
    Chattopadhyay, Subhayan
    Thiyyagura, Kranthi
    Bawa, Payal
    Bhattacharjee, Madhuchhanda
    Yellaboina, Sailu
    FEBS LETTERS, 2016, 590 (10) : 1428 - 1437
  • [36] Genomic analysis of stationary-phase and exit in Saccharomyces cerevisiae:: Gene expression and identification of novel essential genes
    Martinez, MJ
    Roy, S
    Archuletta, AB
    Wentzell, PD
    Santa Anna-Arriola, S
    Rodriguez, AL
    Aragon, AD
    Quiñones, GA
    Allen, C
    Werner-Washburne, M
    MOLECULAR BIOLOGY OF THE CELL, 2004, 15 (12) : 5295 - 5305
  • [37] Identification of putative ancestors of the multidrug-resistant Salmonella enterica serovar typhimurium DT104 clone harboring the Salmonella genomic island 1
    J. Matiasovicova
    P. Adams
    P. A. Barrow
    H. Hradecka
    M. Malcova
    R. Karpiskova
    E. Budinska
    L. Pilousova
    I. Rychlik
    Archives of Microbiology, 2007, 187
  • [38] Identification of putative ancestors of the multidrug-resistant Salmonella enterica serovar typhimurium DT104 clone harboring the Salmonella genomic island 1
    Matiasovicova, J.
    Adams, P.
    Barrow, P. A.
    Hradecka, H.
    Malcova, M.
    Karpiskova, R.
    Budinska, E.
    Pilousova, L.
    Rychlik, I.
    ARCHIVES OF MICROBIOLOGY, 2007, 187 (05) : 415 - 424
  • [39] Comparative Genomic Analysis of Citrobacter and Key Genes Essential for the Pathogenicity of Citrobacter koseri
    Yuan, Chao
    Yin, Zhiqiu
    Wang, Junyue
    Qian, Chengqian
    Wei, Yi
    Zhang, Si
    Jiang, Lingyan
    Liu, Bin
    FRONTIERS IN MICROBIOLOGY, 2019, 10
  • [40] Comparative Genomics of Mycoplasma: Analysis of Conserved Essential Genes and Diversity of the Pan-Genome
    Liu, Wei
    Fang, Liurong
    Li, Mao
    Li, Sha
    Guo, Shaohua
    Luo, Rui
    Feng, Zhixin
    Li, Bin
    Zhou, Zhemin
    Shao, Guoqing
    Chen, Huanchun
    Xiao, Shaobo
    PLOS ONE, 2012, 7 (04):