SVcnn: an accurate deep learning-based method for detecting structural variation based on long-read data

被引:7
|
作者
Zheng, Yan [1 ]
Shang, Xuequn [1 ]
机构
[1] Northwestern Polytech Univ, Sch Comp Sci, West Youyi Rd 127, Xian 710072, Peoples R China
基金
中国国家自然科学基金;
关键词
Long-read sequencing data; Structural variations; SV caller; Deep learning; PAIRED-END; IMPACT; VARIANTS; INDELS; CANCER;
D O I
10.1186/s12859-023-05324-x
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Structural variations (SVs) refer to variations in an organism's chromosome structure that exceed a length of 50 base pairs. They play a significant role in genetic diseases and evolutionary mechanisms. While long-read sequencing technology has led to the development of numerous SV caller methods, their performance results have been suboptimal. Researchers have observed that current SV callers often miss true SVs and generate many false SVs, especially in repetitive regions and areas with multi-allelic SVs. These errors are due to the messy alignments of long-read data, which are affected by their high error rate. Therefore, there is a need for a more accurate SV caller method. Result: We propose a new method-SVcnn, a more accurate deep learning-based method for detecting SVs by using long-read sequencing data. We run SVcnn and other SV callers in three real datasets and find that SVcnn improves the F1-score by 2-8% compared with the second-best method when the read depth is greater than 5x. More importantly, SVcnn has better performance for detecting multi-allelic SVs. Conclusions: SVcnn is an accurate deep learning-based method to detect SVs. The program is available at https://github.com/nwpuzhengyan/SVcnn.
引用
收藏
页数:19
相关论文
共 50 条
  • [31] Deep learning-based structural health monitoring
    Cha, Young-Jin
    Ali, Rahmat
    Lewis, John
    Buyukozturk, Oral
    AUTOMATION IN CONSTRUCTION, 2024, 161
  • [32] Vibration data feature extraction and deep learning-based preprocessing method for highly accurate motor fault diagnosis
    Jang, Jun-Gyo
    Noh, Chun-Myoung
    Kim, Sung-Soo
    Shin, Sung-Chul
    Lee, Soon-Sup
    Lee, Jae-Chul
    JOURNAL OF COMPUTATIONAL DESIGN AND ENGINEERING, 2023, 10 (01) : 204 - 220
  • [33] Deep learning-based data processing method for transient thermoreflectance measurements
    Mao, Yali
    Zhou, Shaojie
    Tang, Weiyuan
    Wu, Mei
    Zhang, Haochen
    Sun, Haiding
    Yuan, Chao
    JOURNAL OF APPLIED PHYSICS, 2024, 135 (09)
  • [34] A Deep Learning-Based Classification Method for Different Frequency EEG Data
    Wen, Tingxi
    Du, Yu
    Pan, Ting
    Huang, Chuanbo
    Zhang, Zhongnan
    Wong, Kelvin
    COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2021, 2021
  • [35] Benchmarking long-read aligners and SV callers for structural variation detection in Oxford nanopore sequencing data
    Helal, Asmaa A.
    Saad, Bishoy T.
    Saad, Mina T.
    Mosaad, Gamal S.
    Aboshanab, Khaled M.
    SCIENTIFIC REPORTS, 2024, 14 (01)
  • [36] Benchmarking long-read aligners and SV callers for structural variation detection in Oxford nanopore sequencing data
    Asmaa A. Helal
    Bishoy T. Saad
    Mina T. Saad
    Gamal S. Mosaad
    Khaled M. Aboshanab
    Scientific Reports, 14
  • [37] Machine learning-based detection of adventitious microbes in T-cell therapy cultures using long-read sequencing
    Strutt, James P. B.
    Natarajan, Meenubharathi
    Lee, Elizabeth
    Teo, Denise Bei Lin
    Sin, Wei-Xiang
    Cheung, Ka-Wai
    Chew, Marvin
    Thazin, Khaing
    Barone, Paul W.
    Wolfrum, Jacqueline M.
    Williams, Rohan B. H.
    Rice, Scott A.
    Springs, Stacy L.
    MICROBIOLOGY SPECTRUM, 2023, 11 (05):
  • [38] Machine learning-based detection of adventitious microbes in T-cell therapy cultures using long-read sequencing
    Strutt, James P. B.
    Natarajan, Meenubharathi
    Lee, Elizabeth
    Teo, Denise Bei Lin
    Sin, Wei-Xiang
    Cheung, Ka-Wai
    Chew, Marvin
    Thazin, Khaing
    Barone, Paul W.
    Wolfrum, Jacqueline M.
    Williams, Rohan B. H.
    Rice, Scott A.
    Springs, Stacy L.
    MICROBIOLOGY SPECTRUM, 2023,
  • [39] A deep learning-based method for detecting and classifying the ultrasound images of suspicious thyroid nodules
    Zhao, Zijian
    Yang, Congmin
    Wang, Qian
    Zhang, Huawei
    Shi, Linlin
    Zhang, Zhiwen
    MEDICAL PHYSICS, 2021, 48 (12) : 7959 - 7970
  • [40] Deep learning-based method for detecting anomalies of operating equipment dynamically in livestock farms
    Park, Hyeon
    Park, Dae-Heon
    Kim, Se-Han
    11TH INTERNATIONAL CONFERENCE ON ICT CONVERGENCE: DATA, NETWORK, AND AI IN THE AGE OF UNTACT (ICTC 2020), 2020, : 1182 - 1185