long-read-tools.org: an interactive catalogue of analysis methods for long-read sequencing data

被引:24
|
作者
Amarasinghe, Shanika L. [1 ,2 ]
Ritchie, Matthew E. [1 ,2 ,3 ]
Gouil, Quentin [1 ,2 ]
机构
[1] Walter & Eliza Hall Inst Med Res, Epigenet & Dev Div, 1G Royal Parade, Parkville, Vic 3052, Australia
[2] Univ Melbourne, Dept Med Biol, 1G Royal Parade, Parkville, Vic 3052, Australia
[3] Univ Melbourne, Sch Math & Stat, 813 Swanston St, Parkville, Vic 3010, Australia
来源
GIGASCIENCE | 2021年 / 10卷 / 02期
基金
英国医学研究理事会; 澳大利亚国家健康与医学研究理事会;
关键词
database; long-read sequencing; data analysis; nanopore; PacBio; ALIGNMENT; RNA;
D O I
10.1093/gigascience/giab003
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: The data produced by long-read third-generation sequencers have unique characteristics compared to short-read sequencing data, often requiring tailored analysis tools for tasks ranging from quality control to downstream processing. The rapid growth in software that addresses these challenges for different genomics applications is difficult to keep track of, which makes it hard for users to choose the most appropriate tool for their analysis goal and for developers to identify areas of need and existing solutions to benchmark against. Findings: We describe the implementation of long-read-tools.org, an open-source database that organizes the rapidly expanding collection of long-read data analysis tools and allows its exploration through interactive browsing and filtering. The current database release contains 478 tools across 32 categories. Most tools are developed in Python, and the most frequent analysis tasks include base calling, de novo assembly, error correction, quality checking/filtering, and isoform detection, while long-read single-cell data analysis and transcriptomics are areas with the fewest tools available. Conclusion: Continued growth in the application of long-read sequencing in genomics research positions the long-read-tools.org database as an essential resource that allows researchers to keep abreast of both established and emerging software to help guide the selection of the most relevant tool for their analysis needs.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] Long-read sequencing data analysis for yeasts
    Yue, Jia-Xing
    Liti, Gianni
    NATURE PROTOCOLS, 2018, 13 (06) : 1213 - 1231
  • [2] Long-read sequencing data analysis for yeasts
    Jia-Xing Yue
    Gianni Liti
    Nature Protocols, 2018, 13 : 1213 - 1231
  • [3] Opportunities and challenges in long-read sequencing data analysis
    Shanika L. Amarasinghe
    Shian Su
    Xueyi Dong
    Luke Zappia
    Matthew E. Ritchie
    Quentin Gouil
    Genome Biology, 21
  • [4] Opportunities and challenges in long-read sequencing data analysis
    Amarasinghe, Shanika L.
    Su, Shian
    Dong, Xueyi
    Zappia, Luke
    Ritchie, Matthew E.
    Gouil, Quentin
    GENOME BIOLOGY, 2020, 21 (01)
  • [5] The long and the short of it: unlocking nanopore long-read RNA sequencing data with short-read differential expression analysis tools
    Dong, Xueyi
    Tian, Luyi
    Gouil, Quentin
    Kariyawasam, Hasaru
    Su, Shian
    De Paoli-Iseppi, Ricardo
    Prawer, Yair David Joseph
    Clark, Michael B.
    Breslin, Kelsey
    Iminitoff, Megan
    Blewitt, Marnie E.
    Law, Charity W.
    Ritchie, Matthew E.
    NAR GENOMICS AND BIOINFORMATICS, 2021, 3 (02)
  • [6] NanoGalaxy: Nanopore long-read sequencing data analysis in Galaxy
    de Koning, Willem
    Miladi, Milad
    Hiltemann, Saskia
    Heikema, Astrid
    Hays, John P.
    Flemming, Stephan
    van den Beek, Marius
    Mustafa, Dana A.
    Backofen, Rolf
    Gruening, Bjoern
    Stubbs, Andrew P.
    GIGASCIENCE, 2020, 9 (10):
  • [7] LONG-READ SEQUENCING FOR THE METAGENOMIC ANALYSIS OF MICROBIOMES
    Free, Tristan
    BIOTECHNIQUES, 2023, 74 (04) : 153 - 155
  • [8] NanoPack: visualizing and processing long-read sequencing data
    De Coster, Wouter
    D'Hert, Svenn
    Schultz, Darrin T.
    Cruts, Marc
    Van Broeckhoven, Christine
    BIOINFORMATICS, 2018, 34 (15) : 2666 - 2669
  • [9] Evaluating long-read RNA-sequencing analysis tools with in silico mixtures
    Dong, Xueyi
    Ritchie, Matthew E.
    NATURE METHODS, 2023, 20 (11) : 1643 - 1644