LinkHub: a Semantic Web system that facilitates cross-database queries and information retrieval in proteomics

被引:23
|
作者
Smith, Andrew K.
Cheung, Kei-Hoi
Yip, Kevin Y.
Schultz, Martin
Gerstein, Mark K.
机构
[1] Yale Univ, Dept Mol Biophys & Biochem, New Haven, CT 06520 USA
[2] Yale Univ, Dept Comp Sci, New Haven, CT 06520 USA
[3] Yale Univ, Ctr Med Informat, New Haven, CT 06520 USA
[4] Yale Univ, Dept Genet, New Haven, CT 06504 USA
[5] Yale Univ, Program Computat Biol & Bioinformat, New Haven, CT 06511 USA
[6] Yale Univ, Dept Anesthesiol, New Haven, CT 06520 USA
基金
美国国家科学基金会;
关键词
D O I
10.1186/1471-2105-8-S3-S5
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: A key abstraction in representing proteomics knowledge is the notion of unique identifiers for individual entities ( e. g. proteins) and the massive graph of relationships among them. These relationships are sometimes simple ( e. g. synonyms) but are often more complex ( e. g. one-to-many relationships in protein family membership). Results: We have built a software system called LinkHub using Semantic Web RDF that manages the graph of identifier relationships and allows exploration with a variety of interfaces. For efficiency, we also provide relational-database access and translation between the relational and RDF versions. LinkHub is practically useful in creating small, local hubs on common topics and then connecting these to major portals in a federated architecture; we have used LinkHub to establish such a relationship between UniProt and the North East Structural Genomics Consortium. LinkHub also facilitates queries and access to information and documents related to identifiers spread across multiple databases, acting as "connecting glue" between different identifier spaces. We demonstrate this with example queries discovering "interologs" of yeast protein interactions in the worm and exploring the relationship between gene essentiality and pseudogene content. We also show how "protein family based" retrieval of documents can be achieved. LinkHub is available at hub.gersteinlab.org and hub.nesg.org with supplement, database models and full-source code. Conclusion: LinkHub leverages Semantic Web standards-based integrated data to provide novel information retrieval to identifier-related documents through relational graph queries, simplifies and manages connections to major hubs such as UniProt, and provides useful interactive and query interfaces for exploring the integrated data.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] LinkHub: a Semantic Web system that facilitates cross-database queries and information retrieval in proteomics
    Andrew K Smith
    Kei-Hoi Cheung
    Kevin Y Yip
    Martin Schultz
    Mark B Gerstein
    BMC Bioinformatics, 8
  • [2] Leveraging the structure of the semantic web to enhance information retrieval for proteomics
    Smith, Andrew
    Cheung, Kei
    Krauthammer, Michael
    Schultz, Martin
    Gerstein, Mark
    BIOINFORMATICS, 2007, 23 (22) : 3073 - 3079
  • [3] Discovering biomedical semantic relations in PubMed queries for information retrieval and database curation
    Huang, Chung-Chi
    Lu, Zhiyong
    DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2016,
  • [4] Information Retrieval from Database Queries
    Catao, Vladimir Soares
    Sampaio, Marcus Costa
    Schiel, Ulrich
    2014 IEEE/ACS 11TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2014, : 507 - 514
  • [5] Heterogeneous Learning Resources Integration and Cross-Database Retrieval
    Liu, Jiamin
    Du, Xu
    Li, Hao
    Yang, Juan
    2017 6TH INTERNATIONAL CONFERENCE OF EDUCATIONAL INNOVATION THROUGH TECHNOLOGY (EITT), 2017, : 345 - 346
  • [6] An architecture for personal semantic web information retrieval system
    Yu, H. (yu@al.is.kyushu-u.ac.jp), 1600, et al.; Fuji Xerox Co., Ltd.; Hitachi, Ltd.; NEC; World Wide Web Consortium (W3C); Yahoo (Association for Computing Machinery (ACM)):
  • [7] Universal information retrieval system in semantic Web environment
    Yoo, JM
    Myaeng, SH
    Jin, Y
    Lee, MH
    Proceedings of the 2005 IEEE International Conference on Natural Language Processing and Knowledge Engineering (IEEE NLP-KE'05), 2005, : 348 - 353
  • [8] Semantic information retrieval on the web
    Sezer, Ebru
    Yazici, Adnan
    Yarimagan, Unal
    ADVANCES IN INFORMATION SYSTEMS, PROCEEDINGS, 2006, 4243 : 158 - 167
  • [9] An Integrated Cross-Database Search System for microRNA
    Chen, Yijun
    PROCEEDINGS OF THE 2009 2ND INTERNATIONAL CONFERENCE ON BIOMEDICAL ENGINEERING AND INFORMATICS, VOLS 1-4, 2009, : 1716 - 1719
  • [10] Using the Web corpus to translate the queries in cross-lingual information retrieval
    Zhang, JL
    Sun, L
    Min, JM
    PROCEEDINGS OF THE 2005 IEEE INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING (IEEE NLP-KE'05), 2005, : 493 - 498