Parallel and Memory-Efficient Reads Indexing for Genome Assembly

被引:0
|
作者
Chapuis, Guillaume [1 ]
Chikhi, Rayan [1 ]
Lavenier, Dominique [1 ]
机构
[1] ENS Cachan IRISA, Dept Comp Sci, F-35042 Rennes, France
关键词
ALGORITHMS;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
As genomes, transcriptomes and meta-genomes are being sequenced at a faster pace than ever, there is a pressing need for efficient genome assembly methods. Two practical issues in assembly are heavy memory usage and long execution time during the read indexing phase. In this article, a parallel and memory-efficient method is proposed for reads indexing prior to assembly. Specifically, a hash-based structure that stores a reduced amount of read information is designed. Erroneous entries are filtered on the fly during index construction. A prototype implementation has been designed and applied to actual Illumina short reads. Benchmark evaluation shows that this indexing method requires significantly less memory than those from popular assemblers.
引用
收藏
页码:272 / 280
页数:9
相关论文
共 50 条
  • [1] Parallel and Memory-efficient Preprocessing for Metagenome Assembly
    Rengasamy, Vasudevan
    Medvedev, Paul
    Madduri, Kamesh
    2017 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), 2017, : 283 - 292
  • [2] HapCol: accurate and memory-efficient haplotype assembly from long reads
    Pirola, Yuri
    Zaccaria, Simone
    Dondi, Riccardo
    Klau, Gunnar W.
    Pisanti, Nadia
    Bonizzoni, Paola
    BIOINFORMATICS, 2016, 32 (11) : 1610 - 1617
  • [3] Time- and memory-efficient genome assembly with Raven
    Vaser, Robert
    Sikic, Mile
    NATURE COMPUTATIONAL SCIENCE, 2021, 1 (05): : 332 - 336
  • [4] LightAssembler: fast and memory-efficient assembly algorithm for high-throughput sequencing reads
    El-Metwally, Sara
    Zakaria, Magdi
    Hamza, Taher
    BIOINFORMATICS, 2016, 32 (21) : 3215 - 3223
  • [5] Memory-Efficient Assembly Using Flye
    Freire, Borja
    Ladra, Susana
    Parama, Jose R.
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2022, 19 (06) : 3564 - 3577
  • [6] Memory-efficient Parallel Tensor Decompositions
    Baskaran, Muthu
    Henretty, Tom
    Pradelle, Benoit
    Langston, M. Harper
    Bruns-Smith, David
    Ezick, James
    Lethin, Richard
    2017 IEEE HIGH PERFORMANCE EXTREME COMPUTING CONFERENCE (HPEC), 2017,
  • [7] Parallel Memory-Efficient Processing of BCI Data
    Alexander, Trevor
    Kuh, Anthony
    Hamada, Katsuhiko
    Mori, Hiromu
    Shinoda, Hiroyuki
    Rutkowski, Tomasz
    2014 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2014,
  • [8] A scalable memory-efficient architecture for parallel shared memory switches
    Matthews, Brad
    Elhanany, Itamar
    2007 WORKSHOP ON HIGH PERFORMANCE SWITCHING AND ROUTING, 2007, : 74 - +
  • [9] Memory-Efficient Pipeline-Parallel DNN Training
    Narayanan, Deepak
    Phanishayee, Amar
    Shi, Kaiyu
    Chen, Xie
    Zaharia, Matei
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [10] Multiplexer and Memory-Efficient Circuits for Parallel Bit Reversal
    Garrido, Mario
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2019, 66 (04) : 657 - 661