PlantTribes2: Tools for comparative gene family analysis in plant genomics

被引:10
|
作者
Wafula, Eric K. K. [1 ]
Zhang, Huiting [2 ,3 ]
Von Kuster, Gregory [4 ]
Leebens-Mack, James H. H. [5 ]
Honaas, Loren A. A. [2 ]
dePamphilis, Claude W. W. [1 ,4 ]
机构
[1] Penn State Univ, Dept Biol, University Pk, PA 16802 USA
[2] USDA, Tree Fruit Res Lab, ARS, Wenatchee, WA USA
[3] Washington State Univ, Dept Hort, Pullman, WA USA
[4] Penn State Univ, Huck Inst Life Sci, University Pk, PA 16802 USA
[5] Univ Georgia, Dept Plant Biol, Athens, GA USA
来源
关键词
gene family phylogenetics; multiple sequence alignment; genome duplication; galaxy; modular tools; applied agriculture; comparative genomics; CROG analysis; MULTIPLE SEQUENCE ALIGNMENT; PHYLOGENETIC ANALYSIS; SOFTWARE; ORTHOLOGY; EVOLUTION; RESOURCE; CLASSIFICATION; DUPLICATION; ORTHOMCL; PLATFORM;
D O I
10.3389/fpls.2022.1011199
中图分类号
Q94 [植物学];
学科分类号
071001 ;
摘要
Plant genome-scale resources are being generated at an increasing rate as sequencing technologies continue to improve and raw data costs continue to fall; however, the cost of downstream analyses remains large. This has resulted in a considerable range of genome assembly and annotation qualities across plant genomes due to their varying sizes, complexity, and the technology used for the assembly and annotation. To effectively work across genomes, researchers increasingly rely on comparative genomic approaches that integrate across plant community resources and data types. Such efforts have aided the genome annotation process and yielded novel insights into the evolutionary history of genomes and gene families, including complex non-model organisms. The essential tools to achieve these insights rely on gene family analysis at a genome-scale, but they are not well integrated for rapid analysis of new data, and the learning curve can be steep. Here we present PlantTribes2, a scalable, easily accessible, highly customizable, and broadly applicable gene family analysis framework with multiple entry points including user provided data. It uses objective classifications of annotated protein sequences from existing, high-quality plant genomes for comparative and evolutionary studies. PlantTribes2 can improve transcript models and then sort them, either genome-scale annotations or individual gene coding sequences, into pre-computed orthologous gene family clusters with rich functional annotation information. Then, for gene families of interest, PlantTribes2 performs downstream analyses and customizable visualizations including, (1) multiple sequence alignment, (2) gene family phylogeny, (3) estimation of synonymous and non-synonymous substitution rates among homologous sequences, and (4) inference of large-scale duplication events. We give examples of PlantTribes2 applications in functional genomic studies of economically important plant families, namely transcriptomics in the weedy Orobanchaceae and a core orthogroup analysis (CROG) in Rosaceae. PlantTribes2 is freely available for use within the main public Galaxy instance and can be downloaded from GitHub or Bioconda. Importantly, PlantTribes2 can be readily adapted for use with genomic and transcriptomic data from any kind of organism.
引用
收藏
页数:17
相关论文
共 50 条
  • [41] COMPARATIVE GENOMICS/TRANSCRIPTOMICS TO STUDY EFFECTOR GENE BIRTH IN PLANT-PARASITIC NEMATODES.
    Eves-van den Akker, S.
    Lilley, C. J.
    Danchin, E. G. J.
    Rancurel, C.
    Cock, P. J. A.
    Jones, L. M.
    Yusup, H. B.
    Jones, J. T.
    Urwin, P. E.
    JOURNAL OF NEMATOLOGY, 2015, 47 (03) : 235 - 235
  • [42] Comparative analysis of the superoxide dismutase gene family in Cetartiodactyla
    Tian, Ran
    Geng, Yuepan
    Guo, Han
    Yang, Chen
    Seim, Inge
    Yang, Guang
    JOURNAL OF EVOLUTIONARY BIOLOGY, 2021, 34 (07) : 1046 - 1060
  • [43] In silico Comparative Analysis of Gene and Protein of Plant Lectins
    Khairiya, Fathiya
    Dwivany, Fenny Martha
    Suhandono, Sony
    Hessel, Sofia Safitri
    Zainuddin, Ima Mulyama
    Tallei, Trina Ekawati
    PERTANIKA JOURNAL OF TROPICAL AGRICULTURAL SCIENCE, 2023, 46 (03): : 815 - 838
  • [44] Comparative analysis of the RTFL peptide family on the control of plant organogenesis
    Pin Guo
    Asami Yoshimura
    Naoko Ishikawa
    Takahiro Yamaguchi
    Youhao Guo
    Hirokazu Tsukaya
    Journal of Plant Research, 2015, 128 : 497 - 510
  • [45] Comparative analysis of the RTFL peptide family on the control of plant organogenesis
    Guo, Pin
    Yoshimura, Asami
    Ishikawa, Naoko
    Yamaguchi, Takahiro
    Guo, Youhao
    Tsukaya, Hirokazu
    JOURNAL OF PLANT RESEARCH, 2015, 128 (03) : 497 - 510
  • [46] Comparative Genomics Uncovers Unique Gene Turnover and Evolutionary Rates in a Gene Family Involved in the Detection of Insect Cuticular Pheromones
    Torres-Oliva, Montserrat
    Almeida, Francisca C.
    Sanchez-Gracia, Alejandro
    Rozas, Julio
    GENOME BIOLOGY AND EVOLUTION, 2016, 8 (06): : 1734 - 1747
  • [47] PlasmoGF:: an integrated system for comparative genomics and phylogenetic analysis of Plasmodium gene families
    Xu, Xiang
    Wu, Jinyu
    Xiao, Jian
    Tan, Yi
    Bao, Qiyu
    Zhao, Fangqing
    Li, Xiaokun
    BIOINFORMATICS, 2008, 24 (09) : 1217 - 1220
  • [48] The basic helix-loop-helix protein family: Comparative genomics and phylogenetic analysis
    Ledent, V
    Vervoort, M
    GENOME RESEARCH, 2001, 11 (05) : 754 - 770
  • [49] Comparative Genomics and Evolutionary Analysis of RNA-Binding Proteins of the CsrA Family in the GenusPseudomonas
    Sobrero, Patricio Martin
    Valverde, Claudio
    FRONTIERS IN MOLECULAR BIOSCIENCES, 2020, 7
  • [50] Identification and comparative analysis of the CIPK gene family and characterization of the cold stress response in the woody plant Prunus mume
    Li, Ping
    Zheng, Tangchun
    Li, Lulu
    Zhuo, Xiaokang
    Jing, Liangbaoi
    Wang, Jia
    Cheng, Tangren
    Zhang, Qixiang
    PEERJ, 2019, 7