Classifying aneuploidy in genotype intensity data using deep learning

被引:3
|
作者
Bouwman, Aniek C. [1 ,4 ]
Hulsegge, Ina [1 ]
Hawken, Rachel J. [2 ]
Henshall, John M. [3 ]
Veerkamp, Roel F. [1 ]
Schokker, Dirkjan [1 ]
Kamphuis, Claudia [1 ]
机构
[1] Wageningen Univ & Res, Anim Breeding & Genom, Wageningen, Netherlands
[2] Cobb Vantress Inc, Siloam Springs, AR USA
[3] Cobb Vantress BV, Boxmeer, Netherlands
[4] Wageningen Univ & Res, Anim Breeding & Genom, POB 338, NL-6700 AH Wageningen, Netherlands
关键词
aneuploidy; B-allele frequency; chromosome; embryo transfer; SNP; POPULATIONS;
D O I
10.1111/jbg.12760
中图分类号
S8 [畜牧、 动物医学、狩猎、蚕、蜂];
学科分类号
0905 ;
摘要
Aneuploidy is the loss or gain of one or more chromosomes. Although it is a rare phenomenon in liveborn individuals, it is observed in livestock breeding populations. These breeding populations are often routinely genotyped and the genotype intensity data from single nucleotide polymorphism (SNP) arrays can be exploited to identify aneuploidy cases. This identification is a time-consuming and costly task, because it is often performed by visual inspection of the data per chromosome, usually done in plots of the intensity data by an expert. Therefore, we wanted to explore the feasibility of automated image classification to replace (part of) the visual detection procedure for any diploid species. The aim of this study was to develop a deep learning Convolutional Neural Network (CNN) classification model based on chromosome level plots of SNP array intensity data that can classify the images into disomic, monosomic and trisomic cases. A multispecies dataset enriched for aneuploidy cases was collected containing genotype intensity data of 3321 disomic, 1759 monosomic and 164 trisomic chromosomes. The final CNN model had an accuracy of 99.9%, overall precision was 1, recall was 0.98 and the F1 score was 0.99 for classifying images from intensity data. The high precision assures that cases detected are most likely true cases, however, some trisomy cases may be missed (the recall of the class trisomic was 0.94). This supervised CNN model performed much better than an unsupervised k-means clustering, which reached an accuracy of 0.73 and had especially difficult to classify trisomic cases correctly. The developed CNN classification model provides high accuracy to classify aneuploidy cases based on images of plotted X and Y genotype intensity values. The classification model can be used as a tool for routine screening in large diploid populations that are genotyped to get a better understanding of the incidence and inheritance, and in addition, avoid anomalies in breeding candidates.
引用
收藏
页码:304 / 315
页数:12
相关论文
共 50 条
  • [1] Classifying neuromorphic data using a deep learning framework for image classification
    Gopalakrishnan, Roshan
    Chua, Yansong
    Iyer, Laxmi R.
    2018 15TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV), 2018, : 1520 - 1524
  • [2] DEEP LEARNING APPROACHES FOR CLASSIFYING DATA: A REVIEW
    Bikku, Thulasi
    Sree, K. P. N. V. Satya
    JOURNAL OF ENGINEERING SCIENCE AND TECHNOLOGY, 2020, 15 (04): : 2580 - 2594
  • [3] Classifying Gas Data Measured Under Multiple Conditions Using Deep Learning
    Lee, Hojung
    Hwang, Jaehui
    Park, Hwin Dol
    Choi, Jae Hun
    Lee, Jong-Seok
    IEEE ACCESS, 2022, 10 : 68138 - 68150
  • [4] Using Deep Learning for Classifying Ship Trajectories
    Ljunggren, Henrik
    2018 21ST INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION), 2018, : 2158 - 2164
  • [5] A deep learning framework for characterization of genotype data
    Ausmees, Kristiina
    Nettelblad, Carl
    G3-GENES GENOMES GENETICS, 2022, 12 (03):
  • [6] Automated Prediction of Exercise Intensity Using Physiological Data and Deep Learning
    Aref Smiley
    Joseph Finkelstein
    SN Computer Science, 6 (4)
  • [7] A Deep Learning Approach for Classifying Emotions from Physiological Data
    AlZoubi, Omar
    ALMakhadmeh, Buthina
    Tawalbeh, Saja Khaled
    Yassien, Muneer Bani
    Hmeidi, Ismail
    2020 11TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION SYSTEMS (ICICS), 2020, : 214 - 219
  • [8] Deep Learning for Classifying Physical Activities from Accelerometer Data
    Nunavath, Vimala
    Johansen, Sahand
    Johannessen, Tommy Sandtorv
    Jiao, Lei
    Hansen, Bjorge Herman
    Berntsen, Sveinung
    Goodwin, Morten
    SENSORS, 2021, 21 (16)
  • [9] Classifying Tongue Images using Deep Transfer Learning
    Song, Chao
    Wang, Bin
    Xu, Jiatuo
    2020 5TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND APPLICATIONS (ICCIA 2020), 2020, : 103 - 107
  • [10] Classifying Red and Healthy Eyes using Deep Learning
    Verma, Sherry
    Singh, Latika
    Chaudhry, Monica
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2019, 10 (07) : 525 - 531