Prediction of biogeographical ancestry from genotype: a comparison of classifiers

被引:20
|
作者
Cheung, Elaine Y. Y. [1 ]
Gahan, Michelle Elizabeth [1 ]
McNevin, Dennis [1 ]
机构
[1] Univ Canberra, Natl Ctr Forens Studies, Fac Educ Sci Technol & Math ESTeM, Bruce, ACT 2601, Australia
关键词
Biogeographical ancestry (BGA); Phenotype prediction; STRUCTURE; Bayesian; Genetic distance; Multinomial logistic regression; DETERMINING CONTINENTAL ORIGIN; GENOME-WIDE PATTERNS; POPULATION-STRUCTURE; INFORMATIVE MARKERS; DIVERSITY; ADMIXTURE; PANEL; ASSAY; AMERICANS; INFERENCE;
D O I
10.1007/s00414-016-1504-3
中图分类号
DF [法律]; D9 [法律]; R [医药、卫生];
学科分类号
0301 ; 10 ;
摘要
DNA can provide forensic intelligence regarding a donor's biogeographical ancestry (BGA) and other externally visible characteristics (EVCs). A number of algorithms have been proposed to assign individual human genotypes to a BGA using ancestry informative marker (AIM) panels. This study compares the BGA assignment accuracy of the population clustering program STRUCTURE and three generic classification approaches including a Bayesian algorithm, genetic distance, and multinomial logistic regression (MLR). A selection of 142 ancestry informative single nucleotide polymorphisms (SNPs) were chosen from existing marker panels (SNPforID 34-plex, Eurasiaplex, Seldin, and Kidd's AIM panels) to assess BGA classification at the continental level for Africans, Europeans, East Asians, and Amerindians. A training set of 1093 individuals with self-declared BGA from the 1000 Genomes phase 1 database was used by each classifier to predict BGA in a test set of 516 individuals from the HGDP-CEPH (Stanford) cell line panel. Tests were repeated with 0, 10, 50, 70, and 90% of the genotypes missing. Comparison of the area under the receiver operating characteristic curves (AUROCs) showed high accuracy in STRUCTURE and the generic Bayesian approach. The latter algorithm offers a computationally simpler alternative to STRUCTURE with little loss in accuracy and is suitable for phenotype prediction while STRUCTURE is not.
引用
收藏
页码:901 / 912
页数:12
相关论文
共 50 条
  • [1] Prediction of biogeographical ancestry from genotype: a comparison of classifiers
    Elaine Y Y Cheung
    Michelle Elizabeth Gahan
    Dennis McNevin
    International Journal of Legal Medicine, 2017, 131 : 901 - 912
  • [2] Forensic inference of biogeographical ancestry from genotype: The Genetic Ancestry Lab
    McNevin, Dennis
    WILEY INTERDISCIPLINARY REVIEWS: FORENSIC SCIENCE, 2020, 2 (02):
  • [3] Biogeographical Ancestry Inference from Genotype: A Comparison of Ancestral Informative SNPs and Genome-wide SNPs
    Qu, Yue
    Tran, Dat
    Martinez-Marroquin, Elisa
    2020 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2020, : 64 - 70
  • [4] Prediction of biogeographical ancestry in admixed individuals
    Cheung, Elaine Y. Y.
    Gahan, Michelle Elizabeth
    McNevin, Dennis
    FORENSIC SCIENCE INTERNATIONAL-GENETICS, 2018, 36 : 104 - 111
  • [5] Biogeographical ancestry and race
    Gannett, Lisa
    STUDIES IN HISTORY AND PHILOSOPHY OF SCIENCE PART C-STUDIES IN HISTORY AND PHILOSOPHY OF BIOLOGICAL AND BIOMEDIAL SCIENCES, 2014, 47 : 173 - 184
  • [6] Genetic estimation of biogeographical ancestry.
    Pfaff, CL
    Parra, EJ
    Shriver, MD
    AMERICAN JOURNAL OF HUMAN GENETICS, 2000, 67 (04) : 221 - 221
  • [7] Machine Learning overview for biogeographical ancestry prediction-a PLS-DA approach
    Alladio, Eugenio
    Poggiali, Brando
    Cosenza, Giulia
    Cisana, Selena
    Omedei, Monica
    Garofano, Paolo
    Pilli, Elena
    FORENSIC SCIENCE INTERNATIONAL GENETICS SUPPLEMENT SERIES, 2022, 8 : 306 - 307
  • [8] Predictive DNA analysis for biogeographical ancestry
    Cheung, Elaine Y. Y.
    Gahan, Michelle Elizabeth
    McNevin, Dennis
    AUSTRALIAN JOURNAL OF FORENSIC SCIENCES, 2018, 50 (06) : 651 - 658
  • [9] Comparison of Classifiers for the Risk of Diabetes Prediction
    Nai-arun, Nongyao
    Moungmai, Rungruttikarn
    7TH INTERNATIONAL CONFERENCE ON ADVANCES IN INFORMATION TECHNOLOGY, 2015, 69 : 132 - 142
  • [10] Biogeographical Ancestry Analyses Using the ForenSeq™° DNA Signature Prep Kit and Multiple Prediction Tools
    Salvo, Nina Mjolsnes
    Olsen, Gunn-Hege
    Berg, Thomas
    Janssen, Kirstin
    GENES, 2024, 15 (04)