Evaluating Structural Variation Detection Tools for Long-Read Sequencing Datasets in Saccharomyces cerevisiae

被引:14
|
作者
Luan, Mei-Wei [1 ]
Zhang, Xiao-Ming [2 ]
Zhu, Zi-Bin [1 ]
Chen, Ying [3 ]
Xie, Shang-Qian [1 ]
机构
[1] Hainan Univ, Key Lab Genet & Germplasm Innovat Trop Special Fo, Minist Educ,Coll Forestry, Hainan Key Lab Biol Trop Ornamental Plant Germpla, Haikou, Hainan, Peoples R China
[2] Inner Mongolia Agr Univ, Coll Grassland Resources & Environm, Hohhot, Peoples R China
[3] Sun Yat Sen Univ, Zhongshan Ophthalm Ctr, State Key Lab Ophthalmol, Guangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
structural variation; long-read sequencing; PacBio and ONT; SV caller; Saccharomyces cerevisiae; HUMAN GENOME; INSIGHTS; IMPACT;
D O I
10.3389/fgene.2020.00159
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Structural variation (SV) represents a major form of genetic variations that contribute to polymorphic variations, human diseases, and phenotypes in many organisms. Long-read sequencing has been successfully used to identify novel and complex SVs. However, comparison of SV detection tools for long-read sequencing datasets has not been reported. Therefore, we developed an analysis workflow that combined two alignment tools (NGMLR and minimap2) and five callers (Sniffles, Picky, smartie-sv, PBHoney, and NanoSV) to evaluate the SV detection in six datasets of Saccharomyces cerevisiae. The accuracy of SV regions was validated by re-aligning raw reads in diverse alignment tools, SV callers, experimental conditions, and sequencing platforms. The results showed that SV detection between NGMLR and minimap2 was not significant when using the same caller. The PBHoney was with the highest average accuracy (89.04%) and Picky has the lowest average accuracy (35.85%). The accuracy of NanoSV, Sniffles, and smartie-sv was 68.67%, 60.47%, and 57.67%, respectively. In addition, smartie-sv and NanoSV detected the most and least number of SVs, and SV detection from the PacBio sequencing platform was significantly more than that from ONT (p = 0.000173).
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Systematic benchmarking of tools for structural variation detection using short- and long-read sequencing data in pigs
    He, Sang
    Song, Bangmin
    Tang, Yueting
    Qu, Xiaolu
    Li, Xingzheng
    Yang, Xintong
    Bao, Qi
    Fang, Lingzhao
    Jiang, Jicai
    Tang, Zhonglin
    Yi, Guoqiang
    ISCIENCE, 2025, 28 (03)
  • [2] Long-read sequencing - for detecting clinically relevant structural variation
    Hoischen, A.
    Wenger, A. M.
    van der Vorst, M.
    Kwint, M.
    Nelen, M.
    Neveling, K.
    Baybayan, P.
    Hickey, L.
    Kuijpers, J.
    Korlach, J.
    Corcoran, K.
    Brunner, H. G.
    Vissers, L. E. L. M.
    Gilissen, C.
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2019, 27 : 849 - 849
  • [3] Long-read sequencing settings for efficient structural variation detection based on comprehensive evaluation
    Jiang, Tao
    Liu, Shiqi
    Cao, Shuqi
    Liu, Yadong
    Cui, Zhe
    Wang, Yadong
    Guo, Hongzhe
    BMC BIOINFORMATICS, 2021, 22 (01)
  • [4] Long-read sequencing settings for efficient structural variation detection based on comprehensive evaluation
    Tao Jiang
    Shiqi Liu
    Shuqi Cao
    Yadong Liu
    Zhe Cui
    Yadong Wang
    Hongzhe Guo
    BMC Bioinformatics, 22
  • [5] Evaluating long-read RNA-sequencing analysis tools with in silico mixtures
    Dong, Xueyi
    Ritchie, Matthew E.
    NATURE METHODS, 2023, 20 (11) : 1643 - 1644
  • [7] On detection of somatic structural variation in highly repetitive regions using long-read sequencing data
    Shiraishi, Yuichi
    CANCER SCIENCE, 2024, 115 : 31 - 31
  • [8] Comparative analysis of metagenomic classifiers for long-read sequencing datasets
    Maric, Josip
    Krizanovic, Kresimir
    Riondet, Sylvain
    Nagarajan, Niranjan
    Sikic, Mile
    BMC BIOINFORMATICS, 2024, 25 (01)
  • [9] Comparative analysis of metagenomic classifiers for long-read sequencing datasets
    Josip Marić
    Krešimir Križanović
    Sylvain Riondet
    Niranjan Nagarajan
    Mile Šikić
    BMC Bioinformatics, 25
  • [10] Benchmarking long-read aligners and SV callers for structural variation detection in Oxford nanopore sequencing data
    Helal, Asmaa A.
    Saad, Bishoy T.
    Saad, Mina T.
    Mosaad, Gamal S.
    Aboshanab, Khaled M.
    SCIENTIFIC REPORTS, 2024, 14 (01)