A multi-task convolutional deep neural network for variant calling in single molecule sequencing

被引:90
|
作者
Luo, Ruibang [1 ,2 ]
Sedlazeck, Fritz J. [3 ]
Lam, Tak-Wah [1 ]
Schatz, Michael C. [2 ]
机构
[1] Univ Hong Kong, Dept Comp Sci, Hong Kong 999077, Peoples R China
[2] Johns Hopkins Univ, Dept Comp Sci, Baltimore, MD 21218 USA
[3] Baylor Coll Med, Human Genome Sequencing Ctr, Houston, TX 77030 USA
基金
美国国家卫生研究院; 美国国家科学基金会;
关键词
GENOME; SNP;
D O I
10.1038/s41467-019-09025-z
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The accurate identification of DNA sequence variants is an important, but challenging task in genomics. It is particularly difficult for single molecule sequencing, which has a pernucleotide error rate of similar to 5-15%. Meeting this demand, we developed Clairvoyante, a multi-task five-layer convolutional neural network model for predicting variant type (SNP or indel), zygosity, alternative allele and indel length from aligned reads. For the well-characterized NA12878 human sample, Clairvoyante achieves 99.67, 95.78, 90.53% F1-score on 1KP common variants, and 98.65, 92.57, 87.26% F1-score for whole-genome analysis, using Illumina, PacBio, and Oxford Nanopore data, respectively. Training on a second human sample shows Clairvoyante is sample agnostic and finds variants in less than 2 h on a standard server. Furthermore, we present 3,135 variants that are missed using Illumina but supported independently by both PacBio and Oxford Nanopore reads. Clairvoyante is available open-source (https://github.com/aquaskyline/Clairvoyante), with modules to train, utilize and visualize the model.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] A multi-task convolutional deep neural network for variant calling in single molecule sequencing
    Ruibang Luo
    Fritz J. Sedlazeck
    Tak-Wah Lam
    Michael C. Schatz
    Nature Communications, 10
  • [2] Multi-task deep convolutional neural network for cancer diagnosis
    Liao, Qing
    Ding, Ye
    Jiang, Zoe L.
    Wang, Xuan
    Zhang, Chunkai
    Zhang, Qian
    NEUROCOMPUTING, 2019, 348 : 66 - 73
  • [3] A multi-task learning convolutional neural network for source localization in deep ocean
    Liu, Yining
    Niu, Haiqiang
    Li, Zhenglin
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2020, 148 (02): : 873 - 883
  • [4] Deep Convolutional Neural Network with Multi-Task Learning Scheme for Modulations Recognition
    Mossad, Omar S.
    ElNainay, Mustafa
    Torki, Marwan
    2019 15TH INTERNATIONAL WIRELESS COMMUNICATIONS & MOBILE COMPUTING CONFERENCE (IWCMC), 2019, : 1644 - 1649
  • [5] Multi-task convolutional deep neural network for recommendation based on knowledge graphs
    Jiang, Mingyang
    Li, Man
    Cao, Wenming
    Yang, Mingming
    Zhou, Luxin
    NEUROCOMPUTING, 2025, 619
  • [6] Dynamic Multi-Task Learning with Convolutional Neural Network
    Fang, Yuchun
    Ma, Zhengyan
    Zhang, Zhaoxiang
    Zhang, Xu-Yao
    Bai, Xiang
    PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 1668 - 1674
  • [7] Heterogeneous Multi-task Learning for Human Pose Estimation with Deep Convolutional Neural Network
    Li, Sijin
    Liu, Zhi-Qiang
    Chan, Antoni B.
    2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2014, : 488 - +
  • [8] A multi-task fully deep convolutional neural network for contactless fingerprint minutiae extraction
    Zhang, Zhao
    Liu, Shuxin
    Liu, Manhua
    PATTERN RECOGNITION, 2021, 120
  • [9] Atrial Fibrillation Burden Estimation Using Multi-Task Deep Convolutional Neural Network
    Prabhakararao, Eedara
    Dandapat, Samarendra
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2022, 26 (12) : 5992 - 6002
  • [10] Heterogeneous Multi-task Learning for Human Pose Estimation with Deep Convolutional Neural Network
    Li, Sijin
    Liu, Zhi-Qiang
    Chan, Antoni B.
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 113 (01) : 19 - 36