Towards linked open gene mutations data

被引:8
|
作者
Zappa, Achille [1 ,2 ]
Splendiani, Andrea [3 ,4 ]
Romano, Paolo [1 ]
机构
[1] IRCCS AOU San Martino IST Natl Canc Res Inst, I-16132 Genoa, Italy
[2] Univ Genoa, Dept Informat Syst & Telemat, I-16145 Genoa, Italy
[3] Rothamsted Res, West Common, Harpenden AL5 2JQ, Herts, England
[4] Natl Univ Ireland Galway, Digital Enterprise Res Inst, Galway, Ireland
来源
BMC BIOINFORMATICS | 2012年 / 13卷
基金
英国生物技术与生命科学研究理事会;
关键词
DISEASE; IMPACT; MODEL;
D O I
10.1186/1471-2105-13-S4-S7
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: With the advent of high-throughput technologies, a great wealth of variation data is being produced. Such information may constitute the basis for correlation analyses between genotypes and phenotypes and, in the future, for personalized medicine. Several databases on gene variation exist, but this kind of information is still scarce in the Semantic Web framework. In this paper, we discuss issues related to the integration of mutation data in the Linked Open Data infrastructure, part of the Semantic Web framework. We present the development of a mapping from the IARC TP53 Mutation database to RDF and the implementation of servers publishing this data. Methods: A version of the IARC TP53 Mutation database implemented in a relational database was used as first test set. Automatic mappings to RDF were first created by using D2RQ and later manually refined by introducing concepts and properties from domain vocabularies and ontologies, as well as links to Linked Open Data implementations of various systems of biomedical interest. Since D2RQ query performances are lower than those that can be achieved by using an RDF archive, generated data was also loaded into a dedicated system based on tools from the Jena software suite. Results: We have implemented a D2RQ Server for TP53 mutation data, providing data on a subset of the IARC database, including gene variations, somatic mutations, and bibliographic references. The server allows to browse the RDF graph by using links both between classes and to external systems. An alternative interface offers improved performances for SPARQL queries. The resulting data can be explored by using any Semantic Web browser or application. Conclusions: This has been the first case of a mutation database exposed as Linked Data. A revised version of our prototype, including further concepts and IARC TP53 Mutation database data sets, is under development. The publication of variation information as Linked Data opens new perspectives: the exploitation of SPARQL searches on mutation data and other biological databases may support data retrieval which is presently not possible. Moreover, reasoning on integrated variation data may support discoveries towards personalized medicine.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Towards linked open gene mutations data
    Achille Zappa
    Andrea Splendiani
    Paolo Romano
    BMC Bioinformatics, 13
  • [2] Towards the Linked Open Data Geographic
    Abrescia, Alfredo
    GEOMEDIA, 2012, 16 (04) : 28 - 29
  • [3] Towards linked open government data in Canada
    Rajabi, Enayat
    International Journal of Metadata, Semantics and Ontologies, 2020, 14 (03) : 209 - 217
  • [4] Towards a Linked Open Data Edition of Sumerian Corpora
    Chiarcos, Christian
    Page-Perron, Emilie
    Khait, Ilya
    Schenk, Niko
    Reckling, Lucas
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 2437 - 2444
  • [5] Towards a Linguistic Linked Open Data cloud: The Open Linguistics Working Group
    Chiarcos, Christian
    Hellmann, Sebastian
    Nordhoff, Sebastian
    TRAITEMENT AUTOMATIQUE DES LANGUES, 2011, 52 (03): : 245 - 275
  • [6] Towards a Linked Open Data Representation of a Grammar Terms Index
    Jettka, Daniel
    Kuropka, Karim
    Vertan, Cristina
    Zinsmeister, Heike
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014,
  • [7] Using Open Information Extraction and Linked Open Data towards Ontology Enrichment and Alignment
    Koukourikos, Antonis
    Karampiperis, Pythagoras
    Vouros, George
    Karkaletsis, Vangelis
    ADVANCED INFORMATION SYSTEMS ENGINEERING WORKSHOPS, CAISE 2012, 2012, 112 : 117 - 122
  • [8] Towards a Linked Open Code
    Djebri, Ahmed El Amine
    Ettorre, Antonia
    Mortara, Johann
    SEMANTIC WEB, ESWC 2021, 2021, 12731 : 497 - 505
  • [9] Towards an Internet of Agents model based on Linked Open Data approach
    Pablo Pico-Valencia
    Juan A. Holgado-Terriza
    J. A. Senso
    Autonomous Agents and Multi-Agent Systems, 2019, 33 : 84 - 131
  • [10] Towards an Internet of Agents model based on Linked Open Data approach
    Pico-Valencia, Pablo
    Holgado-Terriza, Juan A.
    Senso, J. A.
    AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2019, 33 (1-2) : 84 - 131