PlantTribes2: Tools for comparative gene family analysis in plant genomics

被引:10
|
作者
Wafula, Eric K. K. [1 ]
Zhang, Huiting [2 ,3 ]
Von Kuster, Gregory [4 ]
Leebens-Mack, James H. H. [5 ]
Honaas, Loren A. A. [2 ]
dePamphilis, Claude W. W. [1 ,4 ]
机构
[1] Penn State Univ, Dept Biol, University Pk, PA 16802 USA
[2] USDA, Tree Fruit Res Lab, ARS, Wenatchee, WA USA
[3] Washington State Univ, Dept Hort, Pullman, WA USA
[4] Penn State Univ, Huck Inst Life Sci, University Pk, PA 16802 USA
[5] Univ Georgia, Dept Plant Biol, Athens, GA USA
来源
关键词
gene family phylogenetics; multiple sequence alignment; genome duplication; galaxy; modular tools; applied agriculture; comparative genomics; CROG analysis; MULTIPLE SEQUENCE ALIGNMENT; PHYLOGENETIC ANALYSIS; SOFTWARE; ORTHOLOGY; EVOLUTION; RESOURCE; CLASSIFICATION; DUPLICATION; ORTHOMCL; PLATFORM;
D O I
10.3389/fpls.2022.1011199
中图分类号
Q94 [植物学];
学科分类号
071001 ;
摘要
Plant genome-scale resources are being generated at an increasing rate as sequencing technologies continue to improve and raw data costs continue to fall; however, the cost of downstream analyses remains large. This has resulted in a considerable range of genome assembly and annotation qualities across plant genomes due to their varying sizes, complexity, and the technology used for the assembly and annotation. To effectively work across genomes, researchers increasingly rely on comparative genomic approaches that integrate across plant community resources and data types. Such efforts have aided the genome annotation process and yielded novel insights into the evolutionary history of genomes and gene families, including complex non-model organisms. The essential tools to achieve these insights rely on gene family analysis at a genome-scale, but they are not well integrated for rapid analysis of new data, and the learning curve can be steep. Here we present PlantTribes2, a scalable, easily accessible, highly customizable, and broadly applicable gene family analysis framework with multiple entry points including user provided data. It uses objective classifications of annotated protein sequences from existing, high-quality plant genomes for comparative and evolutionary studies. PlantTribes2 can improve transcript models and then sort them, either genome-scale annotations or individual gene coding sequences, into pre-computed orthologous gene family clusters with rich functional annotation information. Then, for gene families of interest, PlantTribes2 performs downstream analyses and customizable visualizations including, (1) multiple sequence alignment, (2) gene family phylogeny, (3) estimation of synonymous and non-synonymous substitution rates among homologous sequences, and (4) inference of large-scale duplication events. We give examples of PlantTribes2 applications in functional genomic studies of economically important plant families, namely transcriptomics in the weedy Orobanchaceae and a core orthogroup analysis (CROG) in Rosaceae. PlantTribes2 is freely available for use within the main public Galaxy instance and can be downloaded from GitHub or Bioconda. Importantly, PlantTribes2 can be readily adapted for use with genomic and transcriptomic data from any kind of organism.
引用
收藏
页数:17
相关论文
共 50 条
  • [31] Comparative genomics analysis of the MYB gene family in barley: preliminary insights into evolution and biological function in Blue Qingke
    Li, Hongyan
    Yao, Youhua
    Li, Xin
    Cui, Yongmei
    An, Likun
    Ding, Baojun
    Yao, Xiaohua
    Wu, Kunlun
    PEERJ, 2024, 12
  • [32] Comparative genomics and transcriptomics analysis of the bHLH gene family indicate their roles in regulating flavonoid biosynthesis in Sophora flavescens
    Liu, Ake
    Lu, Junjie
    Song, Huifang
    Wang, Xi
    Wang, Mingyang
    Lei, Zhenhong
    Liu, Huixuan
    Lei, Haiying
    Niu, Tianzeng
    FRONTIERS IN PLANT SCIENCE, 2024, 15
  • [33] Comparative Genomics Analysis of Ganoderma Orthologs Involved in Plant-Pathogenesis
    Ho, Chai-Ling
    FORESTS, 2023, 14 (03):
  • [34] Editorial: Microbial comparative genomics and pangenomics: new tools, approaches and insights into gene and genome evolution
    Verma, Digvijay
    Satyanarayana, Tulasi
    Dias, Paulo Jorge
    FRONTIERS IN GENETICS, 2024, 15
  • [35] Characterization and comparative analysis of the EGLN gene family
    Taylor, MS
    GENE, 2001, 275 (01) : 125 - 132
  • [36] DNA microarrays for comparative genomics and analysis of gene expression in Trypanosoma cruzi
    Baptista, CS
    Vêncio, RZN
    Abdala, S
    Valadares, MP
    Martins, C
    Pereira, CAD
    Zingales, B
    MOLECULAR AND BIOCHEMICAL PARASITOLOGY, 2004, 138 (02) : 183 - 194
  • [37] Analysis of the Prefoldin Gene Family in 14 Plant Species
    Cao, Jun
    FRONTIERS IN PLANT SCIENCE, 2016, 7
  • [38] Comparative genomics and adaptive selection of the ATP-binding-cassette gene family in Caenorhabditis species
    Zhao, Zhongying
    Thomas, James H.
    Chen, Nansheng
    Sheps, Jonathan A.
    Baillie, David L.
    GENETICS, 2007, 175 (03) : 1407 - 1418
  • [39] Comparative genomics of six Juglans species reveals disease-associated gene family contractions
    Trouern-Trend, Alexander J.
    Falk, Taylor
    Zaman, Sumaira
    Caballero, Madison
    Neale, David B.
    Langley, Charles H.
    Dandekar, Abhaya M.
    Stevens, Kristian A.
    Wegrzyn, Jill L.
    PLANT JOURNAL, 2020, 102 (02): : 410 - 423
  • [40] Origin, evolution and functional diversity of bHLH gene family: insights from comparative genomics and interactomics
    Min, Mingwei
    Xu, Junli
    Yue, Bisong
    Zhang, Yizheng
    Li, Xiao
    2010 4TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICAL ENGINEERING (ICBBE 2010), 2010,