Sequence-specific error profile of Illumina sequencers

被引:396
|
作者
Nakamura, Kensuke [1 ]
Oshima, Taku [2 ]
Morimoto, Takuya [2 ,3 ]
Ikeda, Shun [1 ]
Yoshikawa, Hirofumi [4 ,5 ]
Shiwa, Yuh [5 ]
Ishikawa, Shu [2 ]
Linak, Margaret C. [6 ]
Hirai, Aki [1 ]
Takahashi, Hiroki [1 ]
Altaf-Ul-Amin, Md. [1 ]
Ogasawara, Naotake [2 ]
Kanaya, Shigehiko [1 ]
机构
[1] Nara Inst Sci & Technol, Grad Sch Informat Sci, Nara 6300192, Japan
[2] Nara Inst Sci & Technol, Grad Sch Biol Sci, Nara 6300192, Japan
[3] Kao Corp, Biol Sci Labs, Haga, Tochigi 3213497, Japan
[4] Tokyo Univ Agr, Dept Biosci, Setagaya Ku, Tokyo 1568502, Japan
[5] Tokyo Univ Agr, Genome Res Ctr, NODAI Res Inst, Setagaya Ku, Tokyo 1568502, Japan
[6] Univ Minnesota, Dept Chem Engn & Mat Sci, Minneapolis, MN 55455 USA
关键词
READ ALIGNMENT; RNA-SEQ; GENOME; GENERATION; TOOL; SEARCH; SANGER; BLAST;
D O I
10.1093/nar/gkr344
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
We identified the sequence-specific starting positions of consecutive miscalls in the mapping of reads obtained from the Illumina Genome Analyser (GA). Detailed analysis of the miscall pattern indicated that the underlying mechanism involves sequence-specific interference of the base elongation process during sequencing. The two major sequence patterns that trigger this sequence-specific error (SSE) are: (i) inverted repeats and (ii) GGC sequences. We speculate that these sequences favor dephasing by inhibiting single-base elongation, by: (i) folding single-stranded DNA and (ii) altering enzyme preference. This phenomenon is a major cause of sequence coverage variability and of the unfavorable bias observed for population-targeted methods such as RNA-seq and ChIP-seq. Moreover, SSE is a potential cause of false single-nucleotide polymorphism (SNP) calls and also significantly hinders de novo assembly. This article highlights the importance of recognizing SSE and its underlying mechanisms in the hope of enhancing the potential usefulness of the Illumina sequencers.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Thermodynamics of sequence-specific binding of PNA to DNA
    Ratilainen, T
    Holmén, A
    Tuite, E
    Nielsen, PE
    Nordén, B
    BIOCHEMISTRY, 2000, 39 (26) : 7781 - 7791
  • [32] HOMEOBOX PROTEINS AS SEQUENCE-SPECIFIC TRANSCRIPTION FACTORS
    LEVINE, M
    HOEY, T
    CELL, 1988, 55 (04) : 537 - 540
  • [33] RNA recognition and cleavage by sequence-specific endoribonucleases
    Saida, Fakhri
    Odaert, Benoit
    PROTEIN AND PEPTIDE LETTERS, 2007, 14 (02): : 103 - 111
  • [34] Sequence-specific DNA modification in Acetobacter xylinum
    Petroni, EA
    Bocca, SN
    Ielpi, L
    CELLULAR AND MOLECULAR BIOLOGY, 1996, 42 (05) : 759 - 767
  • [35] STUDIES ON THE BIOCHEMISTRY OF SEQUENCE-SPECIFIC RECOMBINATION MECHANISMS
    SCHMUCKER, R
    STERN, B
    RITTHALER, W
    KAHMANN, R
    KAMP, D
    HOPPE-SEYLERS ZEITSCHRIFT FUR PHYSIOLOGISCHE CHEMIE, 1983, 364 (09): : 1211 - 1212
  • [36] Designed sequence-specific minor groove ligands
    Wemmer, DE
    ANNUAL REVIEW OF BIOPHYSICS AND BIOMOLECULAR STRUCTURE, 2000, 29 : 439 - 461
  • [37] Scaling Theory of Neutral Sequence-Specific Polyampholytes
    Rumyantsev, Artem M.
    Jackson, Nicholas E.
    Johner, Albert
    de Pablo, Juan J.
    MACROMOLECULES, 2021, 54 (07) : 3232 - 3246
  • [38] Self-assembly of sequence-specific polypeptoids
    Segalman, Rachel A.
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2011, 241
  • [39] Sequence-Specific DNA Recognition with Designed Peptides
    Boga, Sonia
    Bouzada, David
    Garcia Pena, Diego
    Vazquez Lopez, Miguel
    Eugenio Vazquez, M.
    EUROPEAN JOURNAL OF ORGANIC CHEMISTRY, 2018, 2018 (03) : 249 - 261
  • [40] BLOOMS OF SEQUENCE-SPECIFIC CULTURABLE BACTERIA IN THE SEA
    REHNSTAM, AS
    BACKMAN, S
    SMITH, DC
    AZAM, F
    HAGSTROM, A
    FEMS MICROBIOLOGY ECOLOGY, 1993, 102 (3-4) : 161 - 166