Connecting genomic islands across prokaryotic and phage genomes via protein families

被引:2
|
作者
Aldaihani, Reem [1 ,2 ]
Heath, Lenwood S. [2 ]
机构
[1] Kuwait Univ, Dept Comp Sci, Kuwait, Kuwait
[2] Virginia Tech, Dept Comp Sci, Blacksburg, VA 24061 USA
关键词
HORMAECHEI;
D O I
10.1038/s41598-023-27584-6
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Prokaryotic genomes evolve via horizontal gene transfer (HGT), mutations, and rearrangements. A noteworthy part of the HGT process is facilitated by genomic islands (GIs). While previous computational biology research has focused on developing tools to detect GIs in prokaryotic genomes, there has been little research investigating GI patterns and biological connections across species. We have pursued the novel idea of connecting GIs across prokaryotic and phage genomes via patterns of protein families. Such patterns are sequences of protein families frequently present in the genomes of multiple species. We combined the large data set from the IslandViewer4 database with protein families from Pfam while implementing a comprehensive strategy to identify patterns making use of HMMER, BLAST, and MUSCLE. we also implemented Python programs that link the analysis into a single pipeline. Research results demonstrated that related GIs often exist in species that are evolutionarily unrelated and in multiple bacterial phyla. Analysis of the discovered patterns led to the identification of biological connections among prokaryotes and phages. These connections suggest broad HGT connections across the bacterial kingdom and its associated phages. The discovered patterns and connections could provide the basis for additional analysis on HGT breadth and the patterns in pathogenic GIs.
引用
收藏
页数:9
相关论文
共 8 条
  • [1] Connecting genomic islands across prokaryotic and phage genomes via protein families
    Reem Aldaihani
    Lenwood S. Heath
    Scientific Reports, 13
  • [2] Score-based prediction of genomic islands in prokaryotic genomes using hidden Markov models
    Stephan Waack
    Oliver Keller
    Roman Asper
    Thomas Brodag
    Carsten Damm
    Wolfgang Florian Fricke
    Katharina Surovcik
    Peter Meinicke
    Rainer Merkl
    BMC Bioinformatics, 7
  • [3] Score-based prediction of genomic islands in prokaryotic genomes using hidden Markov models
    Waack, Stephan
    Keller, Oliver
    Asper, Roman
    Brodag, Thomas
    Damm, Carsten
    Fricke, Wolfgang Florian
    Surovcik, Katharina
    Meinicke, Peter
    Merkl, Rainer
    BMC BIOINFORMATICS, 2006, 7 (1)
  • [4] GDPF: a data resource for the distribution of prokaryotic protein families across the global biosphere
    Pan, Zhuo
    Li, Dan-dan
    Li, Peng
    Geng, Yu
    Jiang, Yiru
    Liu, Ya
    Li, Yue-zhong
    Zhang, Zheng
    NUCLEIC ACIDS RESEARCH, 2024, 52 (D1) : D724 - D731
  • [5] InPrePPI: an integrated evaluation method based on genomic context for predicting protein-protein interactions in prokaryotic genomes
    Jingchun Sun
    Yan Sun
    Guohui Ding
    Qi Liu
    Chuan Wang
    Youyu He
    Tieliu Shi
    Yixue Li
    Zhongming Zhao
    BMC Bioinformatics, 8
  • [6] InPrePPI: an integrated evaluation method based on genomic context for predicting protein-protein interactions in prokaryotic genomes
    Sun, Jingchun
    Sun, Yan
    Ding, Guohui
    Liu, Qi
    Wang, Chuan
    He, Youyu
    Shi, Tieliu
    Li, Yixue
    Zhao, Zhongming
    BMC BIOINFORMATICS, 2007, 8 (1)
  • [7] The landscape of diversity and divergence across genomes highlights links between genome structure and evolution in the formation of genomic islands
    Orton, R. W.
    Schield, D. R.
    Nikolakis, Z. L.
    Perry, B. W.
    Demuth, J. P.
    Mackessy, S. P.
    Smith, C. F.
    Meik, J. M.
    Castoe, T. A.
    INTEGRATIVE AND COMPARATIVE BIOLOGY, 2020, 60 : E180 - E180
  • [8] A guild of 45 CRISPR-associated (Cas) protein families and multiple CRISPR/Cas subtypes exist in prokaryotic genomes
    Haft, DH
    Selengut, J
    Mongodin, EF
    Nelson, KE
    PLOS COMPUTATIONAL BIOLOGY, 2005, 1 (06) : 474 - 483