Impact of reference population size and marker density on accuracy of population imputation

被引:2
|
作者
Kranjevicova, Anita [1 ,2 ]
Kasna, Eva [2 ]
Brzakova, Michaela [1 ]
Pribyl, Josef [2 ]
Vostry, Lubos [1 ]
机构
[1] Czech Univ Life Sci Prague, Fac Agrobiol Food & Nat Resources, Dept Genet & Breeding, Prague, Czech Republic
[2] Inst Anim Sci, Dept Genet & Breeding Farm Anim, Prague, Czech Republic
关键词
cattle; genomics; marker density; missing SNPs; simulation; GENOTYPES; CHIPS;
D O I
10.17221/148/2019-CJAS
中图分类号
S8 [畜牧、 动物医学、狩猎、蚕、蜂];
学科分类号
0905 ;
摘要
The effect of the reference population size and the number of missing single nucleotide polymorphisms (SNPs) on imputation accuracy was determined. The population imputation method using the Flmpute software was applied. The dataset used for the purpose of this study was taken from the database of the Holstein Cattle Breeders Association of the Czech Republic. It contains 1000 animals genotyped with the Illumina BovineSNP50 v.2 Bead-Chip. Two datasets were created, the first containing the original genotypes, including the missing SNPs, the second containing the same genotypes modified to avoid missing data. In these datasets, animals were randomly selected for a reference population (10, 25, 50 and 75%) and there were randomly selected SNPs for deletion (15, 30, 55, 70, and 95%) in animals that were not used as the reference population. Subsequently, the data accuracy was determined by two parameters: correlation between original and imputed SNPs and percentage of correctly imputed SNPs. Since animals and SNPs were randomly selected, the process including data imputation was repeated 100 times. Accuracy was determined as the average accuracy over all repetitions. It was found that the imputation accuracy is influenced by both parameters. If the size of the reference population is sufficient, the imputation accuracy is higher despite the large number of missing SNPs.
引用
收藏
页码:405 / 410
页数:6
相关论文
共 50 条
  • [1] Accuracy of genotype imputation based on reference population size and marker density in Hanwoo cattle
    Lee, DooHo
    Kim, Yeongkuk
    Chung, Yoonji
    Lee, Dongjae
    Seo, Dongwon
    Choi, Tae Jeong
    Lim, Dajeong
    Yoon, Duhak
    Lee, Seung Hwan
    JOURNAL OF ANIMAL SCIENCE AND TECHNOLOGY, 2021, 63 (06) : 1232 - 1246
  • [2] Impact of reference population relatedness on imputation quality
    Petty, Lauren E.
    Tucker, Lindsay S.
    Highland, Heather M.
    Ramesh, Naveen
    Karhade, Mandar
    Hanis, Craig L.
    Below, Jennifer E.
    GENETIC EPIDEMIOLOGY, 2015, 39 (07) : 574 - 575
  • [3] Optimising Genomic Selection in Wheat: Effect of Marker Density, Population Size and Population Structure on Prediction Accuracy
    Norman, Adam
    Taylor, Julian
    Edwards, James
    Kuchel, Haydn
    G3-GENES GENOMES GENETICS, 2018, 8 (09): : 2889 - 2899
  • [4] Impact of Imputation, Reference Population Structure, and Single Nucleotide Polymorphism Panel Density on Accuracy of Genomic Evaluation in Purebred and Crossbred Populations
    Barjasteh, S.
    Dashab, G. R.
    Rokouei, M.
    Shariati, M.
    Valleh, M. Vafaye
    JOURNAL OF AGRICULTURAL SCIENCE AND TECHNOLOGY, 2021, 23 (01): : 27 - 40
  • [5] Accuracy of imputation using the most common sires as reference population in layer chickens
    Marzieh Heidaritabar
    Mario P. L. Calus
    Addie Vereijken
    Martien A. M. Groenen
    John W. M. Bastiaansen
    BMC Genetics, 16
  • [6] Accuracy of imputation using the most common sires as reference population in layer chickens
    Heidaritabar, Marzieh
    Calus, Mario P. L.
    Vereijken, Addie
    Groenen, Martien A. M.
    Bastiaansen, John W. M.
    BMC GENETICS, 2015, 16
  • [7] Accuracy of imputation of single-nucleotide polymorphism marker genotypes for water buffaloes (Bubalus bubalis) using different reference population sizes and imputation tools
    Ghoreishifar, Seyed Mohammad
    Moradi-Shahrbabak, Hossein
    Moradi-Shahrbabak, Mohammad
    Nicolazzi, Ezequiel L.
    Williams, John L.
    Iamartino, Daniela
    Nejati-Javaremi, Ardeshir
    LIVESTOCK SCIENCE, 2018, 216 : 174 - 182
  • [8] Genomic Prediction in Pea: Effect of Marker Density and Training Population Size and Composition on Prediction Accuracy
    Tayeh, Nadim
    Klein, Anthony
    Le Paslier, Marie-Christine
    Jacquin, Francoise
    Houtin, Herve
    Rond, Celine
    Chabert-Martinello, Marianne
    Magnin-Robert, Jean-Bernard
    Marget, Pascal
    Aubert, Gregoire
    Burstin, Judith
    FRONTIERS IN PLANT SCIENCE, 2015, 6
  • [9] Marker Genotype Imputation in a Low-Marker-Density Panel with a High-Marker-Density Reference Panel: Accuracy Evaluation in Barley Breeding Lines
    Iwata, Hiroyoshi
    Jannink, Jean-Luc
    CROP SCIENCE, 2010, 50 (04) : 1269 - 1278
  • [10] Effect of reference population size and available ancestor genotypes on imputation of Mexican Holstein genotypes
    Garcia-Ruiz, A.
    Ruiz-Lopez, F. J.
    Wiggans, G. R.
    Van Tassell, C. P.
    Montaldo, H. H.
    JOURNAL OF DAIRY SCIENCE, 2015, 98 (05) : 3478 - 3484