Classifying aneuploidy in genotype intensity data using deep learning

被引:3
|
作者
Bouwman, Aniek C. [1 ,4 ]
Hulsegge, Ina [1 ]
Hawken, Rachel J. [2 ]
Henshall, John M. [3 ]
Veerkamp, Roel F. [1 ]
Schokker, Dirkjan [1 ]
Kamphuis, Claudia [1 ]
机构
[1] Wageningen Univ & Res, Anim Breeding & Genom, Wageningen, Netherlands
[2] Cobb Vantress Inc, Siloam Springs, AR USA
[3] Cobb Vantress BV, Boxmeer, Netherlands
[4] Wageningen Univ & Res, Anim Breeding & Genom, POB 338, NL-6700 AH Wageningen, Netherlands
关键词
aneuploidy; B-allele frequency; chromosome; embryo transfer; SNP; POPULATIONS;
D O I
10.1111/jbg.12760
中图分类号
S8 [畜牧、 动物医学、狩猎、蚕、蜂];
学科分类号
0905 ;
摘要
Aneuploidy is the loss or gain of one or more chromosomes. Although it is a rare phenomenon in liveborn individuals, it is observed in livestock breeding populations. These breeding populations are often routinely genotyped and the genotype intensity data from single nucleotide polymorphism (SNP) arrays can be exploited to identify aneuploidy cases. This identification is a time-consuming and costly task, because it is often performed by visual inspection of the data per chromosome, usually done in plots of the intensity data by an expert. Therefore, we wanted to explore the feasibility of automated image classification to replace (part of) the visual detection procedure for any diploid species. The aim of this study was to develop a deep learning Convolutional Neural Network (CNN) classification model based on chromosome level plots of SNP array intensity data that can classify the images into disomic, monosomic and trisomic cases. A multispecies dataset enriched for aneuploidy cases was collected containing genotype intensity data of 3321 disomic, 1759 monosomic and 164 trisomic chromosomes. The final CNN model had an accuracy of 99.9%, overall precision was 1, recall was 0.98 and the F1 score was 0.99 for classifying images from intensity data. The high precision assures that cases detected are most likely true cases, however, some trisomy cases may be missed (the recall of the class trisomic was 0.94). This supervised CNN model performed much better than an unsupervised k-means clustering, which reached an accuracy of 0.73 and had especially difficult to classify trisomic cases correctly. The developed CNN classification model provides high accuracy to classify aneuploidy cases based on images of plotted X and Y genotype intensity values. The classification model can be used as a tool for routine screening in large diploid populations that are genotyped to get a better understanding of the incidence and inheritance, and in addition, avoid anomalies in breeding candidates.
引用
收藏
页码:304 / 315
页数:12
相关论文
共 50 条
  • [31] Classifying Eligibility Criteria in Clinical Trials Using Active Deep Learning
    Chuan, Ching-Hua
    2018 17TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2018, : 305 - 310
  • [32] Classifying tumor brain images using parallel deep learning algorithms
    Kazemi, Ahmad
    Shiri, Mohammad Ebrahim
    Sheikhahmadi, Amir
    Khodamoradi, Mohamad
    COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 148
  • [33] Transfer learning for genotype-phenotype prediction using deep learning models
    Muneeb, Muhammad
    Feng, Samuel
    Henschel, Andreas
    BMC BIOINFORMATICS, 2022, 23 (01)
  • [34] Classifying Imbalanced Multi-modal Sensor Data for Human Activity Recognition in a Smart Home using Deep Learning
    Alani, Ali A.
    Cosma, Georgina
    Taherkhani, Aboozar
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [35] Effectiveness of Classifying Unsafe Children's Toys Using NLP, Deep Learning and Ensemble Learning
    Wai, Htoo Thiri
    Uttama, Surapong
    2024 21ST INTERNATIONAL JOINT CONFERENCE ON COMPUTER SCIENCE AND SOFTWARE ENGINEERING, JCSSE 2024, 2024, : 261 - 267
  • [36] Knowledge-shot learning: An interpretable deep model for classifying imbalanced electrocardiography data
    Chou, Yen-hsiu
    Hong, Shenda
    Zhou, Yuxi
    Shang, Junyuan
    Song, Moxian
    Li, Hongyan
    NEUROCOMPUTING, 2020, 417 : 64 - 73
  • [37] Evaluating deep learning models for classifying OCT images with limited data and noisy labels
    Miladinovic, Aleksandar
    Biscontin, Alessandro
    Ajcevic, Milos
    Kresevic, Simone
    Accardo, Agostino
    Marangoni, Dario
    Tognetto, Daniele
    Inferrera, Leandro
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [38] Uncertainty Quantification in Classifying Complex Geological Facies Using Bayesian Deep Learning
    Hossain, Touhid Mohammad
    Hermana, Maman
    Jaya, Makky Sandra
    Sakai, Hiroshi
    Abdulkadir, Said Jadid
    IEEE ACCESS, 2022, 10 : 113767 - 113777
  • [39] Classifying handedness in chiral nanomaterials using label error robust deep learning
    C. K. Groschner
    Alexander J. Pattison
    Assaf Ben-Moshe
    A. Paul Alivisatos
    Wolfgang Theis
    M. C. Scott
    npj Computational Materials, 8
  • [40] CLASSIFYING HUMAN THERMAL IMAGES USING DEEP LEARNING TECHNIQUE IN ARTIFICIAL INTELLIGENCE
    Gurupatham, Sathish.
    Purimetla, Ujjwal
    Kumar, Kaliga
    PROCEEDINGS OF ASME 2023 INTERNATIONAL MECHANICAL ENGINEERING CONGRESS AND EXPOSITION, IMECE2023, VOL 3, 2023,