The Effect of Methodological Considerations on the Construction of Gene-Based Plant Pan-genomes

被引:5
|
作者
Glick, Lior [1 ]
Mayrose, Itay [1 ]
机构
[1] Tel Aviv Univ, Sch Plant Sci & Food Secur, Dept Life Sci, Tel Aviv, Israel
来源
GENOME BIOLOGY AND EVOLUTION | 2023年 / 15卷 / 07期
关键词
pan-genome; assembly; annotation; gene content; presence-absence variation; READ ALIGNMENT; ARCHITECTURE; ASSEMBLIES; PREDICTION; TOOL;
D O I
10.1093/gbe/evad121
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Pan-genomics is an emerging approach for studying the genetic diversity within plant populations. In contrast to common resequencing studies that compare whole genome sequencing data with a single reference genome, the construction of a pan-genome (PG) involves the direct comparison of multiple genomes to one another, thereby enabling the detection of genomic sequences and genes not present in the reference, as well as the analysis of gene content diversity. Although multiple studies describing PGs of various plant species have been published in recent years, a better understanding regarding the effect of the computational procedures used for PG construction could guide researchers in making more informed methodological decisions. Here, we examine the effect of several key methodological factors on the obtained gene pool and on gene presence-absence detections by constructing and comparing multiple PGs of Arabidopsis thaliana and cultivated soybean, as well as conducting a meta-analysis on published PGs. These factors include the construction method, the sequencing depth, and the extent of input data used for gene annotation. We observe substantial differences between PGs constructed using three common procedures (de novo assembly and annotation, map-to-pan, and iterative assembly) and that results are dependent on the extent of the input data. Specifically, we report low agreement between the gene content inferred using different procedures and input data. Our results should increase the awareness of the community to the consequences of methodological decisions made during the process of PG construction and emphasize the need for further investigation of commonly applied methodologies.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] Plant pan-genomes are the new reference
    Bayer, Philipp E.
    Golicz, Agnieszka A.
    Scheben, Armin
    Batley, Jacqueline
    Edwards, David
    NATURE PLANTS, 2020, 6 (08) : 914 - 920
  • [2] Plant pan-genomes are the new reference
    Philipp E. Bayer
    Agnieszka A. Golicz
    Armin Scheben
    Jacqueline Batley
    David Edwards
    Nature Plants, 2020, 6 : 914 - 920
  • [3] Transposable elements and the plant pan-genomes
    Morgante, Michele
    De Paoli, Emanuele
    Radovic, Slobodanka
    CURRENT OPINION IN PLANT BIOLOGY, 2007, 10 (02) : 149 - 155
  • [4] Graph-based pan-genomes: increased opportunities in plant genomics
    Wang, Shuo
    Qian, Yong-Qing
    Zhao, Ru-Peng
    Chen, Ling-Ling
    Song, Jia-Ming
    JOURNAL OF EXPERIMENTAL BOTANY, 2023, 74 (01) : 24 - 39
  • [5] Lateral Gene Transfer Mechanisms and Pan-genomes in Eukaryotes
    Sibbald, Shannon J.
    Eme, Laura
    Archibald, John M.
    Roger, Andrew J.
    TRENDS IN PARASITOLOGY, 2020, 36 (11) : 927 - 941
  • [6] Author Correction: Plant pan-genomes are the new reference
    Philipp E. Bayer
    Agnieszka A. Golicz
    Armin Scheben
    Jacqueline Batley
    David Edwards
    Nature Plants, 2020, 6 (11) : 1389 - 1389
  • [7] Plant pan-genomes are the new reference (vol 6, pg 914, 2020)
    Bayer, Philipp E.
    Golicz, Agnieszka A.
    Scheben, Armin
    Batley, Jacqueline
    Edwards, David
    NATURE PLANTS, 2020, 6 (11) : 1389 - 1389
  • [8] BGDMdocker: a Docker workflow for data mining and visualization of bacterial pan-genomes and biosynthetic gene clusters
    Cheng, Gong
    Lu, Quan
    Ma, Ling
    Zhang, Guocai
    Xu, Liang
    Zhou, Zongshan
    PEERJ, 2017, 5
  • [9] PanExplorer: a web-based tool for exploratory analysis and visualization of bacterial pan-genomes
    Dereeper, Alexis
    Summo, Marilyne
    Meyer, Damien F.
    BIOINFORMATICS, 2022, 38 (18) : 4412 - 4414
  • [10] Analysis of Plant Pan-Genomes and Transcriptomes with GET_HOMOLOGUES-EST, a Clustering Solution for Sequences of the Same Species
    Contreras-Moreira, Bruno
    Cantalapiedra, Carlos P.
    Garcia-Pereira, Maria J.
    Gordon, Sean P.
    Vogel, John P.
    Igartua, Ernesto
    Casas, Ana M.
    Vinuesa, Pablo
    FRONTIERS IN PLANT SCIENCE, 2017, 8