Afann: bias adjustment for alignment-free sequence comparison based on sequencing data using neural network regression

被引:14
|
作者
Tang, Kujin [1 ]
Ren, Jie [1 ]
Sun, Fengzhu [1 ]
机构
[1] Univ Southern Calif, Quantitat & Computat Biol Program, Dept Biol Sci, Los Angeles, CA 90007 USA
基金
美国国家卫生研究院; 美国国家科学基金会;
关键词
Alignment-free; Neural network regression; kmer; d(2)*; d(2)(s); NGS; Bias adjustment; DISSIMILARITY MEASURES;
D O I
10.1186/s13059-019-1872-3
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Alignment-free methods, more time and memory efficient than alignment-based methods, have been widely used for comparing genome sequences or raw sequencing samples without assembly. However, in this study, we show that alignment-free dissimilarity calculated based on sequencing samples can be overestimated compared with the dissimilarity calculated based on their genomes, and this bias can significantly decrease the performance of the alignment-free analysis. Here, we introduce a new alignment-free tool, Alignment-Free methods Adjusted by Neural Network (Afann) that successfully adjusts this bias and achieves excellent performance on various independent datasets. Afann is freely available at https://github.com/GeniusTang/Afann.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Afann: bias adjustment for alignment-free sequence comparison based on sequencing data using neural network regression
    Kujin Tang
    Jie Ren
    Fengzhu Sun
    Genome Biology, 20
  • [2] SENSE: Siamese neural network for sequence embedding and alignment-free comparison
    Zheng, Wei
    Yang, Le
    Genco, Robert J.
    Wactawski-Wende, Jean
    Buck, Michael
    Sun, Yijun
    BIOINFORMATICS, 2019, 35 (11) : 1820 - 1828
  • [3] Alignment-Free Sequence Comparison Based on Next-Generation Sequencing Reads
    Song, Kai
    Ren, Jie
    Zhai, Zhiyuan
    Liu, Xuemei
    Deng, Minghua
    Sun, Fengzhu
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2013, 20 (02) : 64 - 79
  • [4] Alignment-free sequence comparison using absent words
    Charalampopoulos, Panagiotis
    Crochemore, Maxime
    Fici, Gabriele
    Mercas, Robert
    Pissis, Solon R.
    INFORMATION AND COMPUTATION, 2018, 262 : 57 - 68
  • [5] An Algorithm for Alignment-free Sequence Comparison using Logical Match
    Shanker, Sanil
    Austin, Jim
    Sherly, Elizabeth
    2010 2ND INTERNATIONAL CONFERENCE ON COMPUTER AND AUTOMATION ENGINEERING (ICCAE 2010), VOL 3, 2010, : 536 - 538
  • [6] Alignment-free genomic sequence comparison using FCGR and signal processing
    Lichtblau, Daniel
    BMC BIOINFORMATICS, 2019, 20 (01)
  • [7] Alignment-free genomic sequence comparison using FCGR and signal processing
    Daniel Lichtblau
    BMC Bioinformatics, 20
  • [8] Weighted measures based on maximizing deviation for alignment-free sequence comparison
    Qian, Kun
    Luan, Yihui
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2017, 481 : 235 - 242
  • [9] New developments of alignment-free sequence comparison: measures, statistics and next-generation sequencing
    Song, Kai
    Ren, Jie
    Reinert, Gesine
    Deng, Minghua
    Waterman, Michael S.
    Sun, Fengzhu
    BRIEFINGS IN BIOINFORMATICS, 2014, 15 (03) : 343 - 353
  • [10] Fast alignment-free sequence comparison using spaced-word frequencies
    Leimeister, Chris-Andre
    Boden, Marcus
    Horwege, Sebastian
    Lindner, Sebastian
    Morgenstern, Burkhard
    BIOINFORMATICS, 2014, 30 (14) : 1991 - 1999