PlantTribes2: Tools for comparative gene family analysis in plant genomics

被引:10
|
作者
Wafula, Eric K. K. [1 ]
Zhang, Huiting [2 ,3 ]
Von Kuster, Gregory [4 ]
Leebens-Mack, James H. H. [5 ]
Honaas, Loren A. A. [2 ]
dePamphilis, Claude W. W. [1 ,4 ]
机构
[1] Penn State Univ, Dept Biol, University Pk, PA 16802 USA
[2] USDA, Tree Fruit Res Lab, ARS, Wenatchee, WA USA
[3] Washington State Univ, Dept Hort, Pullman, WA USA
[4] Penn State Univ, Huck Inst Life Sci, University Pk, PA 16802 USA
[5] Univ Georgia, Dept Plant Biol, Athens, GA USA
来源
关键词
gene family phylogenetics; multiple sequence alignment; genome duplication; galaxy; modular tools; applied agriculture; comparative genomics; CROG analysis; MULTIPLE SEQUENCE ALIGNMENT; PHYLOGENETIC ANALYSIS; SOFTWARE; ORTHOLOGY; EVOLUTION; RESOURCE; CLASSIFICATION; DUPLICATION; ORTHOMCL; PLATFORM;
D O I
10.3389/fpls.2022.1011199
中图分类号
Q94 [植物学];
学科分类号
071001 ;
摘要
Plant genome-scale resources are being generated at an increasing rate as sequencing technologies continue to improve and raw data costs continue to fall; however, the cost of downstream analyses remains large. This has resulted in a considerable range of genome assembly and annotation qualities across plant genomes due to their varying sizes, complexity, and the technology used for the assembly and annotation. To effectively work across genomes, researchers increasingly rely on comparative genomic approaches that integrate across plant community resources and data types. Such efforts have aided the genome annotation process and yielded novel insights into the evolutionary history of genomes and gene families, including complex non-model organisms. The essential tools to achieve these insights rely on gene family analysis at a genome-scale, but they are not well integrated for rapid analysis of new data, and the learning curve can be steep. Here we present PlantTribes2, a scalable, easily accessible, highly customizable, and broadly applicable gene family analysis framework with multiple entry points including user provided data. It uses objective classifications of annotated protein sequences from existing, high-quality plant genomes for comparative and evolutionary studies. PlantTribes2 can improve transcript models and then sort them, either genome-scale annotations or individual gene coding sequences, into pre-computed orthologous gene family clusters with rich functional annotation information. Then, for gene families of interest, PlantTribes2 performs downstream analyses and customizable visualizations including, (1) multiple sequence alignment, (2) gene family phylogeny, (3) estimation of synonymous and non-synonymous substitution rates among homologous sequences, and (4) inference of large-scale duplication events. We give examples of PlantTribes2 applications in functional genomic studies of economically important plant families, namely transcriptomics in the weedy Orobanchaceae and a core orthogroup analysis (CROG) in Rosaceae. PlantTribes2 is freely available for use within the main public Galaxy instance and can be downloaded from GitHub or Bioconda. Importantly, PlantTribes2 can be readily adapted for use with genomic and transcriptomic data from any kind of organism.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] PlantTribes:: a gene and gene family resource for comparative genomics in plants
    Wall, P. Kerr
    Leebens-Mack, Jim
    Mueller, Kai F.
    Field, Dawn
    Altman, Naomi S.
    dePamphilis, Claude W.
    NUCLEIC ACIDS RESEARCH, 2008, 36 : D970 - D976
  • [2] Gene traps: Tools for plant development and genomics
    Springer, PS
    PLANT CELL, 2000, 12 (07): : 1007 - 1020
  • [3] Comparative genomics of the NKG2D ligand gene family
    Kasahara, Masanori
    Sutoh, Yoichi
    IMMUNOLOGICAL REVIEWS, 2015, 267 (01) : 72 - 87
  • [4] Comparative functional genomics analysis of bHLH gene family in rice, maize and wheat
    Wei, Kaifa
    Chen, Huiqin
    BMC PLANT BIOLOGY, 2018, 18
  • [5] Comparative functional genomics analysis of bHLH gene family in rice, maize and wheat
    Kaifa Wei
    Huiqin Chen
    BMC Plant Biology, 18
  • [6] MOSGA 2: Comparative genomics and validation tools
    Martin, Roman
    Dressler, Hagen
    Hattab, Georges
    Hackl, Thomas
    Fischer, Matthias G.
    Heider, Dominik
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2021, 19 : 5504 - 5509
  • [7] Genomics tools for QTL analysis and gene discovery
    Borevitz, JO
    Chory, J
    CURRENT OPINION IN PLANT BIOLOGY, 2004, 7 (02) : 132 - 136
  • [8] Comparative Genomics, Evolutionary and Gene Regulatory Regions Analysis of Casein Gene Family in Bubalus bubalis
    Rehman, Saif Ur
    Feng, Tong
    Wu, Siwen
    Luo, Xier
    Lei, An
    Luobu, Basang
    Hassan, Faiz-ul
    Liu, Qingyou
    FRONTIERS IN GENETICS, 2021, 12
  • [9] A comparative analysis of the plant cellulose synthase (CesA) gene family
    Holland, N
    Holland, D
    Helentjaris, T
    Dhugga, KS
    Xoconostle-Cazares, B
    Delmer, DP
    PLANT PHYSIOLOGY, 2000, 123 (04) : 1313 - 1323
  • [10] Comparative Genomics Studies on the dmrt Gene Family in Fish
    Dong, Junjian
    Li, Jia
    Hu, Jie
    Sun, Chengfei
    Tian, Yuanyuan
    Li, Wuhui
    Yan, Ningning
    Sun, Chengxi
    Sheng, Xihui
    Yang, Song
    Shi, Qiong
    Ye, Xing
    FRONTIERS IN GENETICS, 2020, 11