CrusTome: a transcriptome database resource for large-scale analyses across Crustacea

被引:7
|
作者
Perez-Moreno, Jorge L. [1 ]
Kozma, Mihika T. [1 ]
DeLeo, Danielle M. [2 ]
Bracken-Grissom, Heather D. [2 ,3 ,4 ]
Durica, David S. [5 ]
Mykles, Donald L. [1 ]
机构
[1] Colorado State Univ, Dept Biol, Ft Collins, CO 80523 USA
[2] Smithsonian Inst, Natl Museum Nat Hist, Dept Invertebrate Zool, Washington, DC 20560 USA
[3] Florida Int Univ, Dept Biol Sci, North Miami, FL 33181 USA
[4] Florida Int Univ, Inst Environm, North Miami, FL 33181 USA
[5] Univ Oklahoma, Dept Biol, Norman, OK 73019 USA
来源
G3-GENES GENOMES GENETICS | 2023年 / 13卷 / 07期
基金
美国国家科学基金会; 美国海洋和大气管理局;
关键词
Arthropoda; bioinformatics; BLAST; crustaceans; cryptochrome; phylogenetics; RNA-seq; RNA-SEQ; GRAND CHALLENGES; CRYPTOCHROME; ANNOTATION; PHOTOLYASE; GENOME; MODEL; RECOMMENDATIONS; APPROXIMATION; ALGORITHM;
D O I
10.1093/g3journal/jkad098
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Transcriptomes from nontraditional model organisms often harbor a wealth of unexplored data. Examining these data sets can lead to clarity and novel insights in traditional systems, as well as to discoveries across a multitude of fields. Despite significant advances in DNA sequencing technologies and in their adoption, access to genomic and transcriptomic resources for nontraditional model organisms remains limited. Crustaceans, for example, being among the most numerous, diverse, and widely distributed taxa on the planet, often serve as excellent systems to address ecological, evolutionary, and organismal questions. While they are ubiquitously present across environments, and of economic and food security importance, they remain severely underrepresented in publicly available sequence databases. Here, we present CrusTome, a multispecies, multitissue, transcriptome database of 201 assembled mRNA transcriptomes (189 crustaceans, 30 of which were previously unpublished, and 12 ecdysozoans for phylogenetic context) as an evolving and publicly available resource. This database is suitable for evolutionary, ecological, and functional studies that employ genomic/transcriptomic techniques and data sets. CrusTome is presented in BLAST and DIAMOND formats, providing robust data sets for sequence similarity searches, orthology assignments, phylogenetic inference, etc. and thus allowing for straightforward incorporation into existing custom pipelines for high-throughput analyses. In addition, to illustrate the use and potential of CrusTome, we conducted phylogenetic analyses elucidating the identity and evolution of the cryptochrome/photolyase family of proteins across crustaceans.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] The Optimal Allocation of Large-scale Water Resource
    Xu Chen-guang
    Li Yue-peng
    PROGRESS IN CIVIL ENGINEERING, PTS 1-4, 2012, 170-173 : 1883 - 1886
  • [42] RISK ALLOCATION IN LARGE-SCALE RESOURCE VENTURES
    SIEBERT, H
    KYKLOS, 1987, 40 (04) : 476 - 495
  • [43] Resource Elasticity for Large-Scale Machine Learning
    Huang, Botong
    Boehm, Matthias
    Tian, Yuanyuan
    Reinwald, Berthold
    Tatikonda, Shirish
    Reiss, Frederick R.
    SIGMOD'15: PROCEEDINGS OF THE 2015 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2015, : 137 - 152
  • [44] TRANSLIT: A Large-scale Name Transliteration Resource
    Benites, Fernando
    Duivesteijn, Gilbert Francois
    von Daeniken, Pius
    Cieliebak, Mark
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 3265 - 3271
  • [45] Large-scale transcriptome sequencing and gene analyses in the crab-eating macaque (Macaca fascicularis) for biomedical research
    Huh, Jae-Won
    Kim, Young-Hyun
    Park, Sang-Je
    Kim, Dae-Soo
    Lee, Sang-Rae
    Kim, Kyoung-Min
    Jung, Kang-Jin
    Kim, Ji-Su
    Song, Bong-Seok
    Sim, Bo-Woong
    Kim, Sun-Uk
    Kim, Sang-Hyun
    Chang, Kyu-Tae
    BMC GENOMICS, 2012, 13
  • [46] A Large-scale Database for Less Cooperative Iris Recognition
    Hu, Junxing
    Wang, Leyuan
    Luo, Zhengquan
    Wang, Yunlong
    Sun, Zhenan
    2021 INTERNATIONAL JOINT CONFERENCE ON BIOMETRICS (IJCB 2021), 2021,
  • [47] Large-scale biological meta-database management
    Pedersen, Edvard
    Bongo, Lars Ailo
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2017, 67 : 481 - 489
  • [48] Wikidata: A large-scale collaborative ontological medical database
    Turki, Houcemeddine
    Shafee, Thomas
    Taieb, Mohamed Ali Hadj
    Ben Aouicha, Mohamed
    Vrandecic, Denny
    Das, Diptanshu
    Hamdi, Helmi
    JOURNAL OF BIOMEDICAL INFORMATICS, 2019, 99
  • [49] PeroxiBase: a database for large-scale evolutionary analysis of peroxidases
    Fawal, Nizar
    Li, Qiang
    Savelli, Bruno
    Brette, Marie
    Passaia, Gisele
    Fabre, Maxime
    Mathe, Catherine
    Dunand, Christophe
    NUCLEIC ACIDS RESEARCH, 2013, 41 (D1) : D441 - D444
  • [50] AVA: A Large-Scale Database for Aesthetic Visual Analysis
    Murray, Naila
    Marchesotti, Luca
    Perronnin, Florent
    2012 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2012, : 2408 - 2415