Evaluating Structural Variation Detection Tools for Long-Read Sequencing Datasets in Saccharomyces cerevisiae

被引:14
|
作者
Luan, Mei-Wei [1 ]
Zhang, Xiao-Ming [2 ]
Zhu, Zi-Bin [1 ]
Chen, Ying [3 ]
Xie, Shang-Qian [1 ]
机构
[1] Hainan Univ, Key Lab Genet & Germplasm Innovat Trop Special Fo, Minist Educ,Coll Forestry, Hainan Key Lab Biol Trop Ornamental Plant Germpla, Haikou, Hainan, Peoples R China
[2] Inner Mongolia Agr Univ, Coll Grassland Resources & Environm, Hohhot, Peoples R China
[3] Sun Yat Sen Univ, Zhongshan Ophthalm Ctr, State Key Lab Ophthalmol, Guangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
structural variation; long-read sequencing; PacBio and ONT; SV caller; Saccharomyces cerevisiae; HUMAN GENOME; INSIGHTS; IMPACT;
D O I
10.3389/fgene.2020.00159
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Structural variation (SV) represents a major form of genetic variations that contribute to polymorphic variations, human diseases, and phenotypes in many organisms. Long-read sequencing has been successfully used to identify novel and complex SVs. However, comparison of SV detection tools for long-read sequencing datasets has not been reported. Therefore, we developed an analysis workflow that combined two alignment tools (NGMLR and minimap2) and five callers (Sniffles, Picky, smartie-sv, PBHoney, and NanoSV) to evaluate the SV detection in six datasets of Saccharomyces cerevisiae. The accuracy of SV regions was validated by re-aligning raw reads in diverse alignment tools, SV callers, experimental conditions, and sequencing platforms. The results showed that SV detection between NGMLR and minimap2 was not significant when using the same caller. The PBHoney was with the highest average accuracy (89.04%) and Picky has the lowest average accuracy (35.85%). The accuracy of NanoSV, Sniffles, and smartie-sv was 68.67%, 60.47%, and 57.67%, respectively. In addition, smartie-sv and NanoSV detected the most and least number of SVs, and SV detection from the PacBio sequencing platform was significantly more than that from ONT (p = 0.000173).
引用
收藏
页数:10
相关论文
共 50 条
  • [31] Genomics in the long-read sequencing era
    van Dijk, Erwin L.
    Naquin, Delphine
    Gorrichon, Kevin
    Jaszczyszyn, Yan
    Ouazahrou, Rania
    Thermes, Claude
    Hernandez, Celine
    TRENDS IN GENETICS, 2023, 39 (09) : 649 - 671
  • [32] Long-read sequencing for brain tumors
    Shelton, William J.
    Zandpazandi, Sara
    Nix, J. Stephen
    Gokden, Murat
    Bauer, Michael
    Ryan, Katie Rose
    Wardell, Christopher P.
    Vaske, Olena Morozova
    Rodriguez, Analiz
    FRONTIERS IN ONCOLOGY, 2024, 14
  • [33] Long-read sequencing in human genetics
    Kraft, Florian
    Kurth, Ingo
    MEDIZINISCHE GENETIK, 2019, 31 (02) : 198 - 204
  • [34] Method of the year: long-read sequencing
    Marx, Vivien
    NATURE METHODS, 2023, 20 (01) : 6 - 11
  • [35] Reimagining Long-Read DNA Sequencing
    不详
    CHEMICAL ENGINEERING PROGRESS, 2017, 113 (10) : 28 - 28
  • [36] Long-read sequencing in fungal identification
    Hoang, Minh Thuy Vi
    Irinyi, Laszlo
    Meyer, Wieland
    MICROBIOLOGY AUSTRALIA, 2022, 43 (01) : 14 - 18
  • [37] Transcriptomics in the era of long-read sequencing
    Monzo, Carolina
    Liu, Tianyuan
    Conesa, Ana
    NATURE REVIEWS GENETICS, 2025,
  • [38] Long-read sequencing goes clinical
    Neveling, K.
    Derks, R.
    Kwint, M.
    van de Vorst, M.
    Gardeitchik, T.
    Nelen, M.
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2019, 27 : 516 - 516
  • [39] Evaluation of taxonomic classification and profiling methods for long-read shotgun metagenomic sequencing datasets
    Daniel M. Portik
    C. Titus Brown
    N. Tessa Pierce-Ward
    BMC Bioinformatics, 23
  • [40] Evaluation of taxonomic classification and profiling methods for long-read shotgun metagenomic sequencing datasets
    Portik, Daniel M.
    Brown, C. Titus
    Pierce-Ward, N. Tessa
    BMC BIOINFORMATICS, 2022, 23 (01)