conservation;
leucine-rich repeat (LRR);
minisatellites;
phylogenetic analysis;
plant genomics;
protein evolution;
R genes;
tandem repeats (TRs);
LEUCINE-RICH REPEAT;
ARABIDOPSIS-THALIANA;
MICROSATELLITE LOCI;
LRR PROTEINS;
DYNAMICS;
IDENTIFICATION;
HUMANS;
ORIGIN;
DUPLICATION;
RESISTANCE;
D O I:
10.1111/nph.13184
中图分类号:
Q94 [植物学];
学科分类号:
071001 ;
摘要:
Sequence tandem repeats (TRs) are abundant in proteomes across all domains of life. For plants, little is known about their distribution or contribution to protein function. We exhaustively annotated TRs and studied the evolution of TR unit variations for all Ensembl plants. Using phylogenetic patterns of TR units, we detected conserved TRs with unit number and order preserved during evolution, and those TRs that have diverged via recent TR unit gains/losses. We correlated the mode of evolution of TRs to protein function. TR number was strongly correlated with proteome size, with about one-half of all TRs recognized as common protein domains. The majority of TRs have been highly conserved over long evolutionary distances, some since the separation of red algae and green plants c. 1.6 billion yr ago. Conversely, recurrent recent TR unit mutations were rare. Our results suggest that the first TRs by far predate the first plants, and that TR appearance is an ongoing process with similar rates across the plant kingdom. Interestingly, the few detected highly mutable TRs might provide a source of variation for rapid adaptation. In particular, such TRs are enriched in leucine-rich repeats (LRRs) commonly found in R genes, where TR unit gain/loss may facilitate resistance to emerging pathogens.
机构:
Stanford Univ, Sch Med, 300 Pasteur Dr, Palo Alto, CA 94504 USA
Sanford Burnham Prebys Med Discovery Inst, 10901 N Torrey Pines Rd, La Jolla, CA 92037 USAStanford Univ, Sch Med, 300 Pasteur Dr, Palo Alto, CA 94504 USA