Evaluating Structural Variation Detection Tools for Long-Read Sequencing Datasets in Saccharomyces cerevisiae

被引:14
|
作者
Luan, Mei-Wei [1 ]
Zhang, Xiao-Ming [2 ]
Zhu, Zi-Bin [1 ]
Chen, Ying [3 ]
Xie, Shang-Qian [1 ]
机构
[1] Hainan Univ, Key Lab Genet & Germplasm Innovat Trop Special Fo, Minist Educ,Coll Forestry, Hainan Key Lab Biol Trop Ornamental Plant Germpla, Haikou, Hainan, Peoples R China
[2] Inner Mongolia Agr Univ, Coll Grassland Resources & Environm, Hohhot, Peoples R China
[3] Sun Yat Sen Univ, Zhongshan Ophthalm Ctr, State Key Lab Ophthalmol, Guangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
structural variation; long-read sequencing; PacBio and ONT; SV caller; Saccharomyces cerevisiae; HUMAN GENOME; INSIGHTS; IMPACT;
D O I
10.3389/fgene.2020.00159
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Structural variation (SV) represents a major form of genetic variations that contribute to polymorphic variations, human diseases, and phenotypes in many organisms. Long-read sequencing has been successfully used to identify novel and complex SVs. However, comparison of SV detection tools for long-read sequencing datasets has not been reported. Therefore, we developed an analysis workflow that combined two alignment tools (NGMLR and minimap2) and five callers (Sniffles, Picky, smartie-sv, PBHoney, and NanoSV) to evaluate the SV detection in six datasets of Saccharomyces cerevisiae. The accuracy of SV regions was validated by re-aligning raw reads in diverse alignment tools, SV callers, experimental conditions, and sequencing platforms. The results showed that SV detection between NGMLR and minimap2 was not significant when using the same caller. The PBHoney was with the highest average accuracy (89.04%) and Picky has the lowest average accuracy (35.85%). The accuracy of NanoSV, Sniffles, and smartie-sv was 68.67%, 60.47%, and 57.67%, respectively. In addition, smartie-sv and NanoSV detected the most and least number of SVs, and SV detection from the PacBio sequencing platform was significantly more than that from ONT (p = 0.000173).
引用
收藏
页数:10
相关论文
共 50 条
  • [41] SVDF: enhancing structural variation detect from long-read sequencing via automatic filtering strategies
    Hu, Heng
    Gao, Runtian
    Gao, Wentao
    Gao, Bo
    Jiang, Zhongjun
    Zhou, Murong
    Wang, Guohua
    Jiang, Tao
    BRIEFINGS IN BIOINFORMATICS, 2024, 25 (04)
  • [42] De novo-assembled long-read genomes of Saccharomyces cerevisiae strains in commerce
    Shwed, P. S.
    Anoop, V.
    Leveque, G.
    MICROBIOLOGY RESOURCE ANNOUNCEMENTS, 2025, 14 (01)
  • [43] Method of the year: long-read sequencing
    Vivien Marx
    Nature Methods, 2023, 20 : 6 - 11
  • [44] The Application of Long-Read Sequencing to Cancer
    Ermini, Luca
    Driguez, Patrick
    CANCERS, 2024, 16 (07)
  • [45] Nanopore long-read sequencing of circRNAs
    Rahimi, Karim
    Nielsen, Anne Faerch
    Veno, Morten T.
    Kjems, Jorgen
    METHODS, 2021, 196 : 23 - 29
  • [46] Tools and Strategies for Long-Read Sequencing and De Novo Assembly of Plant Genomes
    Jung, Hyungtaek
    Winefield, Christopher
    Bombarely, Aureliano
    Prentis, Peter
    Waterhouse, Peter
    TRENDS IN PLANT SCIENCE, 2019, 24 (08) : 700 - 724
  • [47] Long-read sequencing identifies novel structural variations in colorectal cancer
    Xu, Luming
    Wang, Xingyue
    Lu, Xiaohuan J.
    Liang, Fan
    Liu, Zhibo J.
    Zhang, Hongyan
    Li, Xiaoqiong J.
    Tian, ShaoBo
    Wang, Lin J.
    Wang, Zheng
    PLOS GENETICS, 2023, 19 (02):
  • [48] Genome-wide detection of structural variation in some sheep breeds using whole-genome long-read sequencing data
    Qiao, Guoyan
    Xu, Pan
    Guo, Tingting
    He, Xue
    Yue, Yaojing
    Yang, Bohui
    JOURNAL OF ANIMAL BREEDING AND GENETICS, 2024, 141 (04) : 403 - 414
  • [50] Targeted long-read sequencing identifies missing disease-causing variation
    Miller, Danny E.
    Sulovari, Arvis
    Wang, Tianyun
    Loucks, Hailey
    Hoekzema, Kendra
    Munson, Katherine M.
    Lewis, Alexandra P.
    Fuerte, Edith P. Almanza
    Paschal, Catherine R.
    Walsh, Tom
    Thies, Jenny
    Bennett, James T.
    Glass, Ian
    Dipple, Katrina M.
    Patterson, Karynne
    Bonkowski, Emily S.
    Nelson, Zoe
    Squire, Audrey
    Sikes, Megan
    Beckman, Erika
    Bennett, Robin L.
    Earl, Dawn
    Lee, Winston
    Allikmets, Rando
    Perlman, Seth J.
    Chow, Penny
    Hing, Anne, V
    Wenger, Tara L.
    Adam, Margaret P.
    Sun, Angela
    Lam, Christina
    Chang, Irene
    Zou, Xue
    Austin, Stephanie L.
    Huggins, Erin
    Safi, Alexias
    Iyengar, Apoorva K.
    Reddy, Timothy E.
    Majoros, William H.
    Allen, Andrew S.
    Crawford, Gregory E.
    Kishnani, Priya S.
    King, Mary-Claire
    Cherry, Tim
    Chong, Jessica X.
    Bamshad, Michael J.
    Nickerson, Deborah A.
    Mefford, Heather C.
    Doherty, Dan
    Eichler, Evan E.
    AMERICAN JOURNAL OF HUMAN GENETICS, 2021, 108 (08) : 1436 - 1449