Short Insertion and Deletion Discoveries via Whole-Genome Sequencing of 101 Thoroughbred Racehorses

被引:2
|
作者
Tozaki, Teruaki [1 ]
Ohnuma, Aoi [1 ]
Kikuchi, Mio [1 ]
Ishige, Taichiro [1 ]
Kakoi, Hironaga [1 ]
Hirota, Kei-ichi [1 ]
Takahashi, Yuji [2 ]
Nagata, Shun-ichi [1 ]
机构
[1] Lab Racing Chem, Genet Anal Dept, 1731-2 Tsurutamachi, Utsunomiya, Tochigi 3200851, Japan
[2] Japan Racing Assoc, Equine Res Inst, 1400-4 Shiba, Shimotsuke, Tochigi 3290412, Japan
关键词
gene doping; horseracing; INDEL; parentage test; SNV; MYOSTATIN GENE; HORSES; LOCI; POPULATION; VALIDATION; VARIANTS;
D O I
10.3390/genes14030638
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Thoroughbreds are some of the most famous racehorses worldwide and are currently animals of high economic value. To understand genomic variability in Thoroughbreds, we identified genome-wide insertions and deletions (INDELs) and obtained their allele frequencies in this study. INDELs were obtained from whole-genome sequencing data of 101 Thoroughbred racehorses by mapping sequence reads to the horse reference genome. By integrating individual data, 1,453,349 and 113,047 INDELs were identified in the autosomal (1-31) and X chromosomes, respectively, while 18 INDELs were identified on the mitochondrial genome, totaling 1,566,414 INDELs. Of those, 779,457 loci (49.8%) were novel INDELs, while 786,957 loci (50.2%) were already registered in Ensembl. The sizes of diallelic INDELs ranged from -286 to +476, and the majority, 717,736 (52.14%) and 220,672 (16.03%), were 1-bp and 2-bp variants, respectively. Numerous INDELs were found to have lower frequencies of alternative (Alt) alleles. Many rare variants with low Alt allele frequencies (<0.5%) were also detected. In addition, 5955 loci were genotyped as having a minor allele frequency of 0.5 and being heterogeneous genotypes in all the horses. While short-read sequencing and its mapping to reference genome is a simple way of detecting variants, fake variants may be detected. Therefore, our data help to identify true variants in Thoroughbred horses. The INDEL database we constructed will provide useful information for genetic studies and industrial applications in Thoroughbred horses, including a gene-editing test for gene-doping control and a parentage test using INDELs for horse registration and identification.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] Whole-genome sequencing for HCM screening
    Fernandez-Ruiz I.
    Nature Reviews Cardiology, 2018, 15 (10) : 582 - 582
  • [22] Whole-genome sequencing of the UK Biobank
    Halldorsson, Bjarni, V
    Stefansson, Kari
    NATURE, 2022,
  • [23] Whole-Genome Sequencing in Personalized Therapeutics
    Cordero, P.
    Ashley, E. A.
    CLINICAL PHARMACOLOGY & THERAPEUTICS, 2012, 91 (06) : 1001 - 1009
  • [24] Whole-genome sequencing diagnostics for newborns
    Louisa Flintoft
    Nature Reviews Genetics, 2012, 13 (11) : 758 - 758
  • [25] Whole-Genome Sequencing in Outbreak Analysis
    Gilchrist, Carol A.
    Turner, Stephen D.
    Riley, Margaret F.
    Petri, William A., Jr.
    Hewlett, Erik L.
    CLINICAL MICROBIOLOGY REVIEWS, 2015, 28 (03) : 541 - 563
  • [26] PennCNV in whole-genome sequencing data
    Lima, Leandro de Araujo
    Wang, Kai
    BMC BIOINFORMATICS, 2017, 18
  • [27] Human whole-genome shotgun sequencing
    Weber, JL
    Myers, EW
    GENOME RESEARCH, 1997, 7 (05) : 401 - 409
  • [28] PennCNV in whole-genome sequencing data
    Leandro de Araújo Lima
    Kai Wang
    BMC Bioinformatics, 18
  • [29] Novel Partial Exon 51 Deletion in the Duchenne Muscular Dystrophy Gene Identified via Whole Exome Sequencing and Long-Read Whole-Genome Sequencing
    Li, Qianqian
    Chen, Zhanni
    Xiong, Hui
    Li, Ranran
    Yu, Chenguang
    Meng, Jingjing
    Shi, Panlai
    Kong, Xiangdong
    FRONTIERS IN GENETICS, 2021, 12
  • [30] STRScan: targeted profiling of short tandem repeats in whole-genome sequencing data
    Haixu Tang
    Etienne Nzabarushimana
    BMC Bioinformatics, 18