Prediction of biogeographical ancestry from genotype: a comparison of classifiers

被引:20
|
作者
Cheung, Elaine Y. Y. [1 ]
Gahan, Michelle Elizabeth [1 ]
McNevin, Dennis [1 ]
机构
[1] Univ Canberra, Natl Ctr Forens Studies, Fac Educ Sci Technol & Math ESTeM, Bruce, ACT 2601, Australia
关键词
Biogeographical ancestry (BGA); Phenotype prediction; STRUCTURE; Bayesian; Genetic distance; Multinomial logistic regression; DETERMINING CONTINENTAL ORIGIN; GENOME-WIDE PATTERNS; POPULATION-STRUCTURE; INFORMATIVE MARKERS; DIVERSITY; ADMIXTURE; PANEL; ASSAY; AMERICANS; INFERENCE;
D O I
10.1007/s00414-016-1504-3
中图分类号
DF [法律]; D9 [法律]; R [医药、卫生];
学科分类号
0301 ; 10 ;
摘要
DNA can provide forensic intelligence regarding a donor's biogeographical ancestry (BGA) and other externally visible characteristics (EVCs). A number of algorithms have been proposed to assign individual human genotypes to a BGA using ancestry informative marker (AIM) panels. This study compares the BGA assignment accuracy of the population clustering program STRUCTURE and three generic classification approaches including a Bayesian algorithm, genetic distance, and multinomial logistic regression (MLR). A selection of 142 ancestry informative single nucleotide polymorphisms (SNPs) were chosen from existing marker panels (SNPforID 34-plex, Eurasiaplex, Seldin, and Kidd's AIM panels) to assess BGA classification at the continental level for Africans, Europeans, East Asians, and Amerindians. A training set of 1093 individuals with self-declared BGA from the 1000 Genomes phase 1 database was used by each classifier to predict BGA in a test set of 516 individuals from the HGDP-CEPH (Stanford) cell line panel. Tests were repeated with 0, 10, 50, 70, and 90% of the genotypes missing. Comparison of the area under the receiver operating characteristic curves (AUROCs) showed high accuracy in STRUCTURE and the generic Bayesian approach. The latter algorithm offers a computationally simpler alternative to STRUCTURE with little loss in accuracy and is suitable for phenotype prediction while STRUCTURE is not.
引用
收藏
页码:901 / 912
页数:12
相关论文
共 50 条
  • [31] Pigment phenotype and biogeographical ancestry from ancient skeletal remains: inferences from multiplexed autosomal SNP analysis
    Caroline Bouakaze
    Christine Keyser
    Eric Crubézy
    Daniel Montagnon
    Bertrand Ludes
    International Journal of Legal Medicine, 2009, 123 : 315 - 325
  • [32] A COMPARISON BETWEEN DIFFERENT CLASSIFIERS FOR TENNIS MATCH RESULT PREDICTION
    Ghosh, Soumadip
    Sadhu, Shayak
    Biswas, Sushanta
    Sarkar, Debasree
    Sarkar, Partha Pratim
    MALAYSIAN JOURNAL OF COMPUTER SCIENCE, 2019, 32 (02) : 97 - 111
  • [33] An experimental comparison of ensemble of classifiers for bankruptcy prediction and credit scoring
    Nanni, Loris
    Lumini, Alessandra
    EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (02) : 3028 - 3033
  • [34] ROC Based Evaluation and Comparison of Classifiers for IVF Implantation Prediction
    Uyar, Ash
    Bener, Ayse
    Ciray, H. Nadir
    Bahceci, Mustafa
    ELECTRONIC HEALTHCARE, SECOND INTERNATIONAL ICST CONFERENCE, EHEALTH 2009, 2010, 27 : 108 - +
  • [35] Comparison of Machine Learning Classifiers for Protein Secondary Structure Prediction
    Aydin, Zafer
    Kaynar, Oguz
    Gormez, Yasin
    Isik, Yunus Emre
    2018 26TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2018,
  • [36] SNPs associated with physical traits: A valuable tool for the inference of biogeographical ancestry
    Daniel, Runa
    Sanchez, Juan J.
    Nassif, Najah T.
    Hernandez, Alexis
    Walsh, Simon J.
    FORENSIC SCIENCE INTERNATIONAL GENETICS SUPPLEMENT SERIES, 2008, 1 (01) : 538 - 540
  • [37] Development of a SNP multiplex assay for the inference of biogeographical ancestry and pigmentation phenotype
    Castel, Charmain
    Piper, Anita
    FORENSIC SCIENCE INTERNATIONAL GENETICS SUPPLEMENT SERIES, 2011, 3 (01) : E411 - E412
  • [38] Chronic kidney disease: a prediction and comparison of ensemble and basic classifiers performance
    Vikas Chaurasia
    Mithilesh Kumar Pandey
    Saurabh Pal
    Human-Intelligent Systems Integration, 2022, 4 (1-2) : 1 - 10
  • [39] A comparison of classifiers for detecting emotion from speech
    Shafran, I
    Mohri, M
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 341 - 344
  • [40] Poster Abstract: Comparison of Classifiers for Prediction of Human Actions in a Smart Home
    Alhafidh, Basman M. Hasan
    Daood, Amar, I
    Allen, William H.
    2018 IEEE/ACM THIRD INTERNATIONAL CONFERENCE ON INTERNET-OF-THINGS DESIGN AND IMPLEMENTATION (IOTDI 2020), 2018, : 287 - 288