Transformation of alignment files improves performance of variant callers for long-read RNA sequencing data

被引:0
|
作者
Vladimir B. C. de Souza
Ben T. Jordan
Elizabeth Tseng
Elizabeth A. Nelson
Karen K. Hirschi
Gloria Sheynkman
Mark D. Robinson
机构
[1] University of Zurich,Department of Molecular Life Sciences and SIB Swiss Institute of Bioinformatics
[2] University of Virginia,Department of Molecular Physiology and Biological Physics
[3] PacBio,Department of Cell Biology and Cardiovascular Research Center
[4] University of Virginia School of Medicine,Department of Medicine
[5] Yale University School of Medicine,Department of Genetics
[6] Yale University School of Medicine,Department of Biochemistry and Molecular Genetics
[7] Yale Cardiovascular Research Center,Center for Public Health Genomics
[8] Yale University School of Medicine,UVA Comprehensive Cancer Center
[9] University of Virginia,undefined
[10] University of Virginia,undefined
[11] University of Virginia,undefined
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Long-read RNA sequencing (lrRNA-seq) produces detailed information about full-length transcripts, including novel and sample-specific isoforms. Furthermore, there is an opportunity to call variants directly from lrRNA-seq data. However, most state-of-the-art variant callers have been developed for genomic DNA. Here, there are two objectives: first, we perform a mini-benchmark on GATK, DeepVariant, Clair3, and NanoCaller primarily on PacBio Iso-Seq, data, but also on Nanopore and Illumina RNA-seq data; second, we propose a pipeline to process spliced-alignment files, making them suitable for variant calling with DNA-based callers. With such manipulations, high calling performance can be achieved using DeepVariant on Iso-seq data.
引用
收藏
相关论文
共 50 条
  • [41] Comparative assessment of long-read error correction software applied to Nanopore RNA-sequencing data
    Lima, Leandro
    Marchet, Camille
    Caboche, Segolene
    Da Silva, Corinne
    Istace, Benjamin
    Aury, Jean-Marc
    Touzet, Helene
    Chikhi, Rayan
    BRIEFINGS IN BIOINFORMATICS, 2020, 21 (04) : 1164 - 1181
  • [42] Reference-free assembly of long-read transcriptome sequencing data with RNA-Bloom2
    Ka Ming Nip
    Saber Hafezqorani
    Kristina K. Gagalova
    Readman Chiu
    Chen Yang
    René L. Warren
    Inanc Birol
    Nature Communications, 14 (1)
  • [43] Startups use short-read data to expand long-read sequencing market
    Michael Eisenstein
    Nature Biotechnology, 2015, 33 : 433 - 435
  • [44] K-mer analysis of long-read alignment pileups for structural variant genotyping
    English, Adam C.
    Cunial, Fabio
    Metcalf, Ginger A.
    Gibbs, Richard A.
    Sedlazeck, Fritz J.
    NATURE COMMUNICATIONS, 2025, 16 (01)
  • [45] long-read-tools.org: an interactive catalogue of analysis methods for long-read sequencing data
    Amarasinghe, Shanika L.
    Ritchie, Matthew E.
    Gouil, Quentin
    GIGASCIENCE, 2021, 10 (02):
  • [46] Long-read RNA sequencing can probe organelle genome pervasive transcription
    Lima, Matheus Sanita
    Silva Domingues, Douglas
    Rossi Paschoal, Alexandre
    Smith, David Roy
    BRIEFINGS IN FUNCTIONAL GENOMICS, 2024, 23 (06) : 695 - 701
  • [48] Evaluating long-read RNA-sequencing analysis tools with in silico mixtures
    Dong, Xueyi
    Ritchie, Matthew E.
    NATURE METHODS, 2023, 20 (11) : 1643 - 1644
  • [49] Recovering Escherichia coli Plasmids in the Absence of Long-Read Sequencing Data
    Paganini, Julian A.
    Plantinga, Nienke L.
    Arredondo-Alonso, Sergio
    Willems, Rob J. L.
    Schurch, Anita C.
    MICROORGANISMS, 2021, 9 (08)
  • [50] Long-read single-molecule RNA structure sequencing using nanopore
    Bizuayehu, Teshome Tilahun
    Labun, Kornel
    Jakubec, Martin
    Jefimov, Kirill
    Niazi, Adnan Muhammad
    Valen, Eivind
    NUCLEIC ACIDS RESEARCH, 2022, 50 (20)