Enhanced bovine genome annotation through integration of transcriptomics and epi-transcriptomics datasets facilitates genomic biology

被引:1
|
作者
Beiki, Hamid [1 ]
Murdoch, Brenda M. [2 ]
Park, Carissa A. [1 ]
Kern, Chandlar [3 ]
Kontechy, Denise [2 ]
Becker, Gabrielle [2 ]
Rincon, Gonzalo [4 ]
Jiang, Honglin [5 ]
Zhou, Huaijun [6 ]
Thorne, Jacob [2 ]
Koltes, James E. [1 ]
Michal, Jennifer J. [7 ]
Davenport, Kimberly [2 ]
Rijnkels, Monique [8 ]
Ross, Pablo J. [6 ]
Hu, Rui [5 ]
Corum, Sarah [4 ]
McKay, Stephanie [9 ]
Smith, Timothy P. L. [10 ]
Liu, Wansheng [3 ]
Ma, Wenzhi [3 ]
Zhang, Xiaohui [7 ]
Xu, Xiaoqing [6 ]
Han, Xuelei [7 ]
Jiang, Zhihua [7 ]
Hu, Zhi-Liang [1 ]
Reecy, James M. [1 ]
机构
[1] Iowa State Univ, Dept Anim Sci, Ames, IA 50011 USA
[2] Univ Idaho, Dept Anim Vet & Food Sci, Moscow, ID 83844 USA
[3] Penn State Univ, Dept Anim Sci, State Coll, PA 16802 USA
[4] Zoetis, Parsippany Troy Hills, NJ 07054 USA
[5] Virginia Tech, Dept Anim & Poultry Sci, Blacksburg, VA 24060 USA
[6] Univ Calif Davis, Dept Anim Sci, Davis, CA 95616 USA
[7] Washington State Univ, Dept Anim Sci, Pullman, WA 99164 USA
[8] Texas A&M Univ, Dept Vet Integrat Biosci, College Stn, TX 77843 USA
[9] Univ Missouri, Columbia, MO 65211 USA
[10] USDA ARS, USMARC, Washington, DC 68933 USA
来源
GIGASCIENCE | 2024年 / 13卷
基金
美国食品与农业研究所;
关键词
functional genomics; transcriptomics; epi-genetics; multi-omics integration; trait-similarity network; QTL; RECEPTOR RNA ACTIVATOR; RESIDUAL FEED-INTAKE; LONG NONCODING RNA; NF-KAPPA-B; CLINICAL MASTITIS; MESSENGER-RNA; MILK-YIELD; PROTEIN; EXPRESSION; LACTATION;
D O I
10.1093/gigascience/giae019
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background The accurate identification of the functional elements in the bovine genome is a fundamental requirement for high-quality analysis of data informing both genome biology and genomic selection. Functional annotation of the bovine genome was performed to identify a more complete catalog of transcript isoforms across bovine tissues.Results A total of 160,820 unique transcripts (50% protein coding) representing 34,882 unique genes (60% protein coding) were identified across tissues. Among them, 118,563 transcripts (73% of the total) were structurally validated by independent datasets (PacBio isoform sequencing data, Oxford Nanopore Technologies sequencing data, de novo assembled transcripts from RNA sequencing data) and comparison with Ensembl and NCBI gene sets. In addition, all transcripts were supported by extensive data from different technologies such as whole transcriptome termini site sequencing, RNA Annotation and Mapping of Promoters for the Analysis of Gene Expression, chromatin immunoprecipitation sequencing, and assay for transposase-accessible chromatin using sequencing. A large proportion of identified transcripts (69%) were unannotated, of which 86% were produced by annotated genes and 14% by unannotated genes. A median of two 5 ' untranslated regions were expressed per gene. Around 50% of protein-coding genes in each tissue were bifunctional and transcribed both coding and noncoding isoforms. Furthermore, we identified 3,744 genes that functioned as noncoding genes in fetal tissues but as protein-coding genes in adult tissues. Our new bovine genome annotation extended more than 11,000 annotated gene borders compared to Ensembl or NCBI annotations. The resulting bovine transcriptome was integrated with publicly available quantitative trait loci data to study tissue-tissue interconnection involved in different traits and construct the first bovine trait similarity network.Conclusions These validated results show significant improvement over current bovine genome annotations.
引用
收藏
页数:22
相关论文
共 6 条
  • [1] Enhanced bovine genome annotation through integration of transcriptomics and epi-transcriptomics datasets facilitates genomic biology
    Beiki, Hamid
    Murdoch, Brenda M.
    Park, Carissa A.
    Kern, Chandlar
    Kontechy, Denise
    Becker, Gabrielle
    Rincon, Gonzalo
    Jiang, Honglin
    Zhou, Huaijun
    Thorne, Jacob
    Koltes, James E.
    Michal, Jennifer J.
    Davenport, Kimberly
    Rijnkels, Monique
    Ross, Pablo J.
    Hu, Rui
    Corum, Sarah
    McKay, Stephanie
    Smith, Timothy P. L.
    Liu, Wansheng
    Ma, Wenzhi
    Zhang, Xiaohui
    Xu, Xiaoqing
    Han, Xuelei
    Jiang, Zhihua
    Hu, Zhi-Liang
    Reecy, James M.
    GIGASCIENCE, 2024, 13
  • [2] Identification of genomic enhancers through spatial integration of single-cell transcriptomics and epigenomics
    Gonzalez-Blas, Carmen Bravo
    Quan, Xiao-Jiang
    Duran-Romana, Ramon
    Taskiran, Ibrahim Ihsan
    Koldere, Duygu
    Davie, Kristofer
    Christiaens, Valerie
    Makhzami, Samira
    Hulselmans, Gert
    de Waegeneer, Maxime
    Mauduit, David
    Poovathingal, Suresh
    Aibar, Sara
    Aerts, Stein
    MOLECULAR SYSTEMS BIOLOGY, 2020, 16 (05)
  • [3] Universal annotation of the human genome through integration of over a thousand epigenomic datasets
    Vu, Ha
    Ernst, Jason
    GENOME BIOLOGY, 2022, 23 (01)
  • [4] Universal annotation of the human genome through integration of over a thousand epigenomic datasets
    Ha Vu
    Jason Ernst
    Genome Biology, 23
  • [5] Enhanced whole genome sequence and annotation of Clostridium stercorarium DSM8532T using RNA-seq transcriptomics and high-throughput proteomics
    John J Schellenberg
    Tobin J Verbeke
    Peter McQueen
    Oleg V Krokhin
    Xiangli Zhang
    Graham Alvare
    Brian Fristensky
    Gerhard G Thallinger
    Bernard Henrissat
    John A Wilkins
    David B Levin
    Richard Sparling
    BMC Genomics, 15
  • [6] Enhanced whole genome sequence and annotation of Clostridium stercorarium DSM8532T using RNA-seq transcriptomics and high-throughput proteomics
    Schellenberg, John J.
    Verbeke, Tobin J.
    McQueen, Peter
    Krokhin, Oleg V.
    Zhang, Xiangli
    Alvare, Graham
    Fristensky, Brian
    Thallinger, Gerhard G.
    Henrissat, Bernard
    Wilkins, John A.
    Levin, David B.
    Sparling, Richard
    BMC GENOMICS, 2014, 15