A detailed analysis of second and third-generation sequencing approaches for accurate length determination of short tandem repeats and homopolymers

被引:0
|
作者
Jeanjean, Sophie, I [1 ]
Shen, Yimin [2 ]
Hardy, Lise M. [1 ]
Daunay, Antoine [1 ]
Delepine, Marc [3 ]
Gerber, Zuzana [3 ]
Alberdi, Antonio [4 ]
Tubacher, Emmanuel [2 ]
Deleuze, Jean-Francois [1 ,2 ,3 ]
How-Kit, Alexandre [1 ]
机构
[1] Fdn Jean Dausset CEPH, Lab Genom, F-75010 Paris, France
[2] Fdn Jean Dausset, Lab Bioinformat, Paris, France
[3] Inst Francois Jacob, Ctr Natl Rech Genomique Humaine CNRGH, CEA, F-91000 Evry, France
[4] Univ Paris, Inst Rech St Louis, St Louis Hosp, Paris, France
关键词
MICROSATELLITE INSTABILITY; YEAST DEPENDENCE; MUTATION-RATES; RARE MUTATIONS; DNA; MARKERS; ALLELES; MIXTURE; LOCI; PCR;
D O I
10.1093/nar/gkaf131
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Microsatellites are short tandem repeats (STRs) of a motif of 1-6 nucleotides that are ubiquitous in almost all genomes and widely used in many biomedical applications. However, despite the development of next-generation sequencing (NGS) over the past two decades with new technologies coming to the market, accurately sequencing and genotyping STRs, particularly homopolymers, remain very challenging today due to several technical limitations. This leads in many cases to erroneous allele calls and difficulty in correctly identifying the genuine allele distribution in a sample. Here, we assessed several second and third-generation sequencing approaches in their capability to correctly determine the length of microsatellites using plasmids containing A/T homopolymers, AC/TG or AT/TA dinucleotide STRs of variable length. Standard polymerase chain reaction (PCR)-free and PCR-containing, single Unique Molecular Indentifier (UMI) and dual UMI 'duplex sequencing' protocols were evaluated using Illumina short-read sequencing, and two PCR-free protocols using PacBio and Oxford Nanopore Technologies long-read sequencing. Several bioinformatics algorithms were developed to correctly identify microsatellite alleles from sequencing data, including four and two modes for generating standard and combined consensus alleles, respectively. We provided a detailed analysis and comparison of these approaches and made several recommendations for the accurate determination of microsatellite allele length.
引用
收藏
页数:20
相关论文
共 8 条
  • [1] Analysis of RNA Modifications by Second- and Third-Generation Deep Sequencing: 2020 Update
    Motorin, Yuri
    Marchand, Virginie
    GENES, 2021, 12 (02) : 1 - 20
  • [2] Best practices for germline variant and DNA methylation analysis of second- and third-generation sequencing data
    Bonfiglio, Ferdinando
    Legati, Andrea
    Lasorsa, Vito Alessandro
    Palombo, Flavia
    De Riso, Giulia
    Isidori, Federica
    Russo, Silvia
    Furini, Simone
    Merla, Giuseppe
    Coppede, Fabio
    Tartaglia, Marco
    Bruselles, Alessandro
    Pippucci, Tommaso
    Ciolfi, Andrea
    Pinelli, Michele
    Capasso, Mario
    HUMAN GENOMICS, 2024, 18 (01)
  • [3] Comprehensive analysis of thalassemia alleles (CATSA) based on third-generation sequencing is a comprehensive and accurate approach for neonatal thalassemia screening
    Long, Ju
    Yu, Chunhui
    Sun, Lei
    Peng, Mingkui
    Song, Chuanlu
    Mao, Aiping
    Zhan, Jiahan
    Liu, Enqi
    CLINICA CHIMICA ACTA, 2024, 560
  • [4] A Novel Method for Identifying Length Variations of Short Tandem Repeats Based on Next Generation Sequencing and Its Application in Human Genetic Disease Research
    Yan Zhang-Ming
    Wang Yao
    Liu Ke
    Xiang Shu-Nian
    Sun Zhi-Rong
    PROGRESS IN BIOCHEMISTRY AND BIOPHYSICS, 2016, 43 (08) : 768 - 777
  • [5] Comparative and comprehensive analysis on bacterial communities of two full-scale wastewater treatment plants by second and third-generation sequencing
    Ji B.
    Wang S.
    Guo D.
    Pang H.
    Bioresource Technology Reports, 2020, 11
  • [6] Combined Analysis of Second- and Third-Generation Transcriptome Sequencing for Gene Characteristics and Identification of Key Splicing Variants in Wound Healing of Ganxi Goat Skin
    Yang, Xue
    Zheng, Lucheng
    Huo, Junhong
    Hu, Wei
    Liu, Ben
    Fan, Qingcan
    Zheng, Wenya
    Wang, Qianqian
    ANIMALS, 2024, 14 (21):
  • [7] High-throughput and high-sensitivity full-length single-cell RNA-seq analysis on third-generation sequencing platform
    Liao, Yuhan
    Liu, Zhenyu
    Zhang, Yu
    Lu, Ping
    Wen, Lu
    Tang, Fuchou
    CELL DISCOVERY, 2023, 9 (01)
  • [8] High-throughput and high-sensitivity full-length single-cell RNA-seq analysis on third-generation sequencing platform
    Yuhan Liao
    Zhenyu Liu
    Yu Zhang
    Ping Lu
    Lu Wen
    Fuchou Tang
    Cell Discovery, 9