Classifying aneuploidy in genotype intensity data using deep learning

被引:3
|
作者
Bouwman, Aniek C. [1 ,4 ]
Hulsegge, Ina [1 ]
Hawken, Rachel J. [2 ]
Henshall, John M. [3 ]
Veerkamp, Roel F. [1 ]
Schokker, Dirkjan [1 ]
Kamphuis, Claudia [1 ]
机构
[1] Wageningen Univ & Res, Anim Breeding & Genom, Wageningen, Netherlands
[2] Cobb Vantress Inc, Siloam Springs, AR USA
[3] Cobb Vantress BV, Boxmeer, Netherlands
[4] Wageningen Univ & Res, Anim Breeding & Genom, POB 338, NL-6700 AH Wageningen, Netherlands
关键词
aneuploidy; B-allele frequency; chromosome; embryo transfer; SNP; POPULATIONS;
D O I
10.1111/jbg.12760
中图分类号
S8 [畜牧、 动物医学、狩猎、蚕、蜂];
学科分类号
0905 ;
摘要
Aneuploidy is the loss or gain of one or more chromosomes. Although it is a rare phenomenon in liveborn individuals, it is observed in livestock breeding populations. These breeding populations are often routinely genotyped and the genotype intensity data from single nucleotide polymorphism (SNP) arrays can be exploited to identify aneuploidy cases. This identification is a time-consuming and costly task, because it is often performed by visual inspection of the data per chromosome, usually done in plots of the intensity data by an expert. Therefore, we wanted to explore the feasibility of automated image classification to replace (part of) the visual detection procedure for any diploid species. The aim of this study was to develop a deep learning Convolutional Neural Network (CNN) classification model based on chromosome level plots of SNP array intensity data that can classify the images into disomic, monosomic and trisomic cases. A multispecies dataset enriched for aneuploidy cases was collected containing genotype intensity data of 3321 disomic, 1759 monosomic and 164 trisomic chromosomes. The final CNN model had an accuracy of 99.9%, overall precision was 1, recall was 0.98 and the F1 score was 0.99 for classifying images from intensity data. The high precision assures that cases detected are most likely true cases, however, some trisomy cases may be missed (the recall of the class trisomic was 0.94). This supervised CNN model performed much better than an unsupervised k-means clustering, which reached an accuracy of 0.73 and had especially difficult to classify trisomic cases correctly. The developed CNN classification model provides high accuracy to classify aneuploidy cases based on images of plotted X and Y genotype intensity values. The classification model can be used as a tool for routine screening in large diploid populations that are genotyped to get a better understanding of the incidence and inheritance, and in addition, avoid anomalies in breeding candidates.
引用
收藏
页码:304 / 315
页数:12
相关论文
共 50 条
  • [41] Classifying diabetic macular edema grades using extended power of deep learning
    Amit Kumar
    Anand Shanker Tewari
    Multimedia Tools and Applications, 2024, 83 : 14151 - 14172
  • [42] Classifying customer’s written questions by topics using deep learning (LSTM)
    Na S.-T.
    Yang G.-D.
    Shin J.-S.
    Transactions of the Korean Institute of Electrical Engineers, 2019, 68 (11): : 1411 - 1416
  • [43] Classifying Chinese Medicine Constitution Using Multimodal Deep-Learning Model
    Tian-yu Gu
    Zhuang-zhi Yan
    Jie-hui Jiang
    Chinese Journal of Integrative Medicine, 2024, 30 : 163 - 170
  • [44] Intuitionistic Fuzzy Recurrence Plots for Classifying Cardiac Arrhythmias Using Deep Learning
    Mujica-Vargas, Dante
    Vela-Rincon, Virna V.
    Luna-alvarez, Antonio
    Muniz, Andres Antonio Arenas
    PROGRAMMING AND COMPUTER SOFTWARE, 2024, 50 (08) : 662 - 673
  • [45] Classifying sex and strain from mouse ultrasonic vocalizations using deep learning
    Ivanenko, A.
    Watkins, P.
    van Gerven, M. A. J.
    Hammerschmidt, K.
    Englitz, B.
    PLOS COMPUTATIONAL BIOLOGY, 2020, 16 (06)
  • [46] Classifying and clustering malicious advertisement uniform resource locators using deep learning
    Zhang, Xichen
    Lashkari, Arash Habibi
    Ghorbani, Ali A.
    COMPUTATIONAL INTELLIGENCE, 2021, 37 (01) : 511 - 537
  • [47] Classifying Alcoholics and Control Patients Using Deep Learning and Peak Visualization Method
    Fayyaz, Asim
    Maqbool, Muaz
    Saeed, Mehreen
    ICVISP 2019: PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON VISION, IMAGE AND SIGNAL PROCESSING, 2019,
  • [48] Classifying handedness in chiral nanomaterials using label error robust deep learning
    Groschner, C. K.
    Pattison, Alexander J.
    Ben-Moshe, Assaf
    Alivisatos, A. Paul
    Theis, Wolfgang
    Scott, M. C.
    NPJ COMPUTATIONAL MATERIALS, 2022, 8 (01)
  • [49] Classifying diabetic macular edema grades using extended power of deep learning
    Kumar, Amit
    Tewari, Anand Shanker
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (05) : 14151 - 14172
  • [50] Classifying Endangered Species in High-Risk Areas Using Deep Learning
    Brito, Cristian
    Engdahl, Andrea
    Atkinson, John
    ADVANCES AND TRENDS IN ARTIFICIAL INTELLIGENCE: THEORY AND APPLICATIONS, IEA-AIE 2024, 2024, 14748 : 23 - 34