Afann: bias adjustment for alignment-free sequence comparison based on sequencing data using neural network regression

被引:14
|
作者
Tang, Kujin [1 ]
Ren, Jie [1 ]
Sun, Fengzhu [1 ]
机构
[1] Univ Southern Calif, Quantitat & Computat Biol Program, Dept Biol Sci, Los Angeles, CA 90007 USA
基金
美国国家卫生研究院; 美国国家科学基金会;
关键词
Alignment-free; Neural network regression; kmer; d(2)*; d(2)(s); NGS; Bias adjustment; DISSIMILARITY MEASURES;
D O I
10.1186/s13059-019-1872-3
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Alignment-free methods, more time and memory efficient than alignment-based methods, have been widely used for comparing genome sequences or raw sequencing samples without assembly. However, in this study, we show that alignment-free dissimilarity calculated based on sequencing samples can be overestimated compared with the dissimilarity calculated based on their genomes, and this bias can significantly decrease the performance of the alignment-free analysis. Here, we introduce a new alignment-free tool, Alignment-Free methods Adjusted by Neural Network (Afann) that successfully adjusts this bias and achieves excellent performance on various independent datasets. Afann is freely available at https://github.com/GeniusTang/Afann.
引用
收藏
页数:17
相关论文
共 50 条
  • [31] Normalized Feature Vectors: A Novel Alignment-Free Sequence Comparison Method Based on the Numbers of Adjacent Amino Acids
    Huang, De-Shuang
    Yu, Hong-Jie
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2013, 10 (02) : 457 - 467
  • [32] Alignment-Free Prediction of a Drug-Target Complex Network Based on Parameters of Drug Connectivity and Protein Sequence of Receptors
    Vina, Dolores
    Uriarte, Eugenio
    Orallo, Francisco
    Gonzalez-Diaz, Humberto
    MOLECULAR PHARMACEUTICS, 2009, 6 (03) : 825 - 835
  • [33] Author Correction: GRAFENE: Graphlet-based alignment-free network approach integrates 3D structural and sequence (residue order) data to improve protein structural comparison
    Fazle E. Faisal
    Khalique Newaz
    Julie L. Chaney
    Jun Li
    Scott J. Emrich
    Patricia L. Clark
    Tijana Milenković
    Scientific Reports, 10
  • [34] GRAFENE: Graphlet-based alignment-free network approach integrates 3D structural and sequence (residue order) data to improve protein structural comparison (vol 7, 14890, 2017)
    Faisal, Fazle E.
    Newaz, Khalique
    Chaney, Julie L.
    Li, Jun
    Emrich, Scott J.
    Clark, Patricia L.
    Milenkovic, Tijana
    SCIENTIFIC REPORTS, 2020, 10 (01)
  • [35] Clustering of fungal hexosaminidase enzymes based on free alignment method using MLP neural network
    Mojtaba Mamarabadi
    Abbas Rohani
    Neural Computing and Applications, 2018, 30 : 2819 - 2829
  • [36] Clustering of fungal hexosaminidase enzymes based on free alignment method using MLP neural network
    Mamarabadi, Mojtaba
    Rohani, Abbas
    NEURAL COMPUTING & APPLICATIONS, 2018, 30 (09): : 2819 - 2829
  • [37] Comparison of bias adjustment in meta-analysis using data-based and opinion-based methods
    Stone, Jennifer C.
    Furuya-Kanamori, Luis
    Aromataris, Edoardo
    Barker, Timothy H.
    Doi, Suhail A. R.
    JBI EVIDENCE SYNTHESIS, 2024, 22 (03) : 434 - 440
  • [38] A data-centric pipeline using convolutional neural network to select better multiple sequence alignment method
    Kuang, Mengmeng
    Ting, Hing-fung
    ACM-BCB 2020 - 11TH ACM CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY, AND HEALTH INFORMATICS, 2020,
  • [39] The Performance Comparison of Multiple Linear Regression, Random Forest and Artificial Neural Network by using Photovoltaic and Atmospheric Data
    Kayri, Murat
    Kayri, Ismail
    Gencoglu, Muhsin Tunay
    2017 14TH INTERNATIONAL CONFERENCE ON ENGINEERING OF MODERN ELECTRIC SYSTEMS (EMES), 2017, : 1 - 4
  • [40] VNtyper enables accurate alignment-free genotyping of MUC1 coding VNTR using short-read sequencing data in autosomal dominant tubulointerstitial kidney disease
    Saei, Hassan
    Moriniere, Vincent
    Heidet, Laurence
    Gribouval, Olivier
    Lebbah, Said
    Tores, Frederic
    Mautret-Godefroy, Manon
    Knebelmann, Bertrand
    Burtey, Stephane
    Vuiblet, Vincent
    Antignac, Corinne
    Nitschke, Patrick
    Dorval, Guillaume
    ISCIENCE, 2023, 26 (07)