Comprehensive comparison of large-scale tissue expression datasets

被引:63
|
作者
Santos, Alberto [1 ]
Tsafou, Kalliopi [1 ]
Stolte, Christian [2 ]
Pletscher-Frankild, Sune [1 ]
O'Donoghue, Sean I. [2 ,3 ]
Jensen, Lars Juhl [1 ]
机构
[1] Univ Copenhagen, Fac Hlth & Med Sci, Novo Nordisk Fdn Ctr Prot Res, Copenhagen, Denmark
[2] CSIRO, Sydney, NSW, Australia
[3] Garvan Inst Med Res, Sydney, NSW, Australia
来源
PEERJ | 2015年 / 3卷
基金
美国国家卫生研究院;
关键词
Immunohistochemistry; RNA sequencing; Tissue expression; Mass spectrometry; Microarrays; Databases; Tissue-specificity; GENE-EXPRESSION; MASS-SPECTROMETRY; HOUSEKEEPING GENES; RNA-SEQ; ATLAS; SPECIFICITY; MICROARRAY; DATABASE; DRAFT;
D O I
10.7717/peerj.1054
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
For tissues to carry out their functions, they rely on the right proteins to be present. Several high-throughput technologies have been used to map out which proteins are expressed in which tissues; however, the data have not previously been systematically compared and integrated. We present a comprehensive evaluation of tissue expression data from a variety of experimental techniques and show that these agree surprisingly well with each other and with results from literature curation and text mining. We further found that most datasets support the assumed but not demonstrated distinction between tissue-specific and ubiquitous expression. By developing comparable confidence scores for all types of evidence, we show that it is possible to improve both quality and coverage by combining the datasets. To facilitate use and visualization of our work, we have developed the TISSUES resource (http://tissues.jensenlab.org), which makes all the scored and integrated data available through a single user-friendly web interface.
引用
收藏
页数:23
相关论文
共 50 条
  • [21] Will Large-scale Generative Models Corrupt Future Datasets?
    Hataya, Ryuichiro
    Bao, Han
    Arai, Hiromi
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 20498 - 20508
  • [22] Large-scale palm vein recognition on synthetic datasets
    Hernandez-Garcia, Ruber
    Santamaria, Jose, I
    Barrientos, Ricardo J.
    Salazar Jurado, Edwin H.
    Manuel Castro, Francisco
    Ramos-Cozar, Julian
    Guil, Nicolas
    2021 40TH INTERNATIONAL CONFERENCE OF THE CHILEAN COMPUTER SCIENCE SOCIETY (SCCC), 2021,
  • [23] Scalable Iterative Classification for Sanitizing Large-Scale Datasets
    Li, Bo
    Vorobeychik, Yevgeniy
    Li, Muqun
    Malin, Bradley
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2017, 29 (03) : 698 - 711
  • [24] TIPP: Parallel Delaunay Triangulation for Large-Scale Datasets
    Nguyen, Cuong
    Rhodes, Philip J.
    30TH INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT (SSDBM 2018), 2018,
  • [25] Exploring Large-scale Public Medical Image Datasets
    Oakden-Rayner, Luke
    ACADEMIC RADIOLOGY, 2020, 27 (01) : 106 - 112
  • [26] Distributed Sketched Subspace Clustering for Large-scale Datasets
    Traganitis, Panagiotis A.
    Giannakis, Georgios B.
    2017 IEEE 7TH INTERNATIONAL WORKSHOP ON COMPUTATIONAL ADVANCES IN MULTI-SENSOR ADAPTIVE PROCESSING (CAMSAP), 2017,
  • [27] Generative models and abstractions for large-scale neuroanatomy datasets
    Rolnick, David
    Dyer, Eva L.
    CURRENT OPINION IN NEUROBIOLOGY, 2019, 55 : 112 - 120
  • [28] LARGE-SCALE DATASETS FOR GOING DEEPER IN IMAGE UNDERSTANDING
    Wu, Jiahong
    Zheng, He
    Zhao, Bo
    Li, Yixin
    Yan, Baoming
    Liang, Rui
    Wang, Wenjia
    Zhou, Shipei
    Lin, Guosen
    Fu, Yanwei
    Wang, Yizhou
    Wang, Yonggang
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 1480 - 1485
  • [29] A fast fuzzy clustering algorithm for large-scale datasets
    Shi, LK
    He, PL
    ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2005, 3584 : 203 - 208
  • [30] Understanding Data Similarity in Large-Scale Scientific Datasets
    Linton, Payton
    Melodia, William
    Lazar, Alina
    Agarwal, Deborah
    Bianchi, Ludovico
    Ghoshal, Devarshi
    Pastorello, Gilbert
    Ramakrishnan, Lavanya
    Wu, Kesheng
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 4525 - 4531