SNP-Based and Kmer-Based eQTL Analysis Using Transcriptome Data

被引:0
|
作者
Ge, Mei [1 ]
Li, Chenyu [1 ]
Zhang, Zhiyan [1 ]
机构
[1] Jiangxi Agr Univ, Natl Key Lab Swine Genet Improvement & Germplasm I, Nanchang 330045, Peoples R China
来源
ANIMALS | 2024年 / 14卷 / 20期
关键词
RNA-seq; eQTL; SNP; kmer; haplotype block; ALIGNMENT; LOCI;
D O I
10.3390/ani14202941
中图分类号
S8 [畜牧、 动物医学、狩猎、蚕、蜂];
学科分类号
0905 ;
摘要
Simple Summary Expression quantitative trait locus (eQTL) analysis is crucial in revealing the genetic basis of complex traits, advancing the study of human diseases, optimizing the breeding of agricultural plants and animals, and gaining a deeper understanding of specific biological processes. The conventional analysis involves correlating genetic variants from whole-genome sequencing (WGS) data and gene expression, but to improve the power of eQTL detection, it is often necessary to expand the sample size as far as possible, ranging from hundreds to thousands. Large sample sizes for WGS are extremely costly, and this bottleneck is particularly evident in economically important agricultural animals. In contrast, the advantages of eQTL analyses from transcriptome data are obvious: cost-effective, simultaneous acquisition of variants and expression information. We propose to use transcriptome data alone for SNP calling and kmer generation, and then association analysis with gene expression. Here, 87 SNP-based and 35 kmer-based association results were obtained. Subsequently, comparison and validation of these two results revealed that they each have their own strengths and can complement each other, which promotes in-depth exploration of the regulatory relationship between genetic variants and gene expression.Abstract Traditional expression quantitative trait locus (eQTL) mapping associates single nucleotide polymorphisms (SNPs) with gene expression, where the SNPs are derived from large-scale whole-genome sequencing (WGS) data or transcriptome data. While WGS provides a high SNP density, it also incurs substantial sequencing costs. In contrast, RNA-seq data, which are more accessible and less expensive, can simultaneously yield gene expressions and SNPs. Thus, eQTL analysis based on RNA-seq offers significant potential applications. Two primary strategies were employed for eQTL in this study. The first involved analyzing expression levels in relation to variant sites detected between populations from RNA-seq data. The second approach utilized kmers, which are sequences of length k derived from RNA-seq reads, to represent variant sites and associated these kmer genotypes with gene expression. We discovered 87 significant association signals involving eGene on the basis of the SNP-based eQTL analysis. These genes include DYNLT1, NMNAT1, and MRLC2, which are closely related to neurological functions such as motor coordination and homeostasis, play a role in cellular energy metabolism, and function in regulating calcium-dependent signaling in muscle contraction, respectively. This study compared the results obtained from eQTL mapping using RNA-seq identified SNPs and gene expression with those derived from kmers. We found that the vast majority (23/30) of the association signals overlapping the two methods could be verified by haplotype block analysis. This comparison elucidates the strengths and limitations of each method, providing insights into their relative efficacy for eQTL identification.
引用
收藏
页数:14
相关论文
共 50 条
  • [11] A SNP-based phylogenetic analysis of Corynebacterium diphtheriae in Malaysia
    Hii S.Y.F.
    Ahmad N.
    Hashim R.
    Liow Y.L.
    Abd Wahab M.A.
    Mohd Khalid M.K.N.
    BMC Research Notes, 11 (1)
  • [12] Interpretation of SNP-based NIPS data in the context of heteropaternal twins
    Gong, Ping
    Hsu, Melissa
    DiNonno, Wendy
    Hashimoto, Katelyn
    Kijacic, Dusan
    Xu, Wenbo
    GENETICS IN MEDICINE, 2022, 24 (03) : S242 - S243
  • [13] SNP-based analysis of genetic substructure in the German population
    Steffens, Michael
    Lamina, Claudia
    Illig, Thomas
    Bettecken, Thomas
    Vogler, Rainer
    Entz, Patricia
    Suk, Eun-Kyung
    Toliat, Mohammad Reza
    Klopp, Norman
    Caliebe, Amke
    Koenig, Inke R.
    Koehler, Karola
    Luedemann, Jan
    Lacava, Amalia Diaz
    Fimmers, Rolf
    Lichtner, Peter
    Ziegler, Andreas
    Wolf, Andreas
    Krawczak, Michael
    Nuernberg, Peter
    Hampe, Jochen
    Schreiber, Stefan
    Meitinger, Thomas
    Wichmann, H. -Erich
    Roeder, Kathryn
    Wienker, Thomas F.
    Baur, Max P.
    HUMAN HEREDITY, 2006, 62 (01) : 20 - 29
  • [14] Response to Lee et al.: SNP-Based Heritability Analysis with Dense Data
    Speed, Doug
    Hemani, Gibran
    Johnson, Michael R.
    Balding, David J.
    AMERICAN JOURNAL OF HUMAN GENETICS, 2013, 93 (06) : 1155 - 1157
  • [15] eSNPO: An eQTL-based SNP Ontology and SNP functional enrichment analysis platform
    Li, Jin
    Wang, Limei
    Jiang, Tao
    Wang, Jizhe
    Li, Xue
    Liu, Xiaoyan
    Wang, Chunyu
    Teng, Zhixia
    Zhang, Ruijie
    Lv, Hongchao
    Guo, Maozu
    SCIENTIFIC REPORTS, 2016, 6
  • [16] Transcriptome profiling, sequence characterization, and SNP-based chromosomal assignment of the EXPANSIN genes in cotton
    Chuanfu An
    Sukumar Saha
    Johnie N. Jenkins
    Brian E. Scheffler
    Thea A. Wilkins
    David M. Stelly
    Molecular Genetics and Genomics, 2007, 278 : 539 - 553
  • [17] eSNPO: An eQTL-based SNP Ontology and SNP functional enrichment analysis platform
    Jin Li
    Limei Wang
    Tao Jiang
    Jizhe Wang
    Xue Li
    Xiaoyan Liu
    Chunyu Wang
    Zhixia Teng
    Ruijie Zhang
    Hongchao Lv
    Maozu Guo
    Scientific Reports, 6
  • [18] A classifier for the SNP-Based inference of ancestry
    Frudakis, T
    Venkateswarlu, K
    Thomas, MJ
    Gaskin, Z
    Ginjupalli, S
    Gunturi, S
    Ponnuswamy, V
    Natarajan, S
    Nachimuthu, PK
    JOURNAL OF FORENSIC SCIENCES, 2003, 48 (04) : 771 - 782
  • [19] A comprehensive SNP-based genetic analysis of inbred mouse strains
    Tsang, S
    Sun, ZH
    Luke, B
    Stewart, C
    Lum, N
    Gregory, M
    Wu, XL
    Subleski, M
    Jenkins, NA
    Copeland, NG
    Munroe, DJ
    MAMMALIAN GENOME, 2005, 16 (07) : 476 - 480
  • [20] Transcriptome profiling, sequence characterization, and SNP-based chromosomal assignment of the EXPANSIN genes in cotton
    An, Chuanfu
    Saha, Sukumar
    Jenkins, Johnie N.
    Scheffler, Brian E.
    Wilkins, Thea A.
    Stelly, David M.
    MOLECULAR GENETICS AND GENOMICS, 2007, 278 (05) : 539 - 553