Genomic Sequence Variation Markup Language (GSVML)

被引:7
|
作者
Nakaya, Jun [1 ]
Kimura, Michio [2 ]
Hiroi, Kaei
Ido, Keisuke
Yang, Woosung [3 ]
Tanaka, Hiroshi
机构
[1] Tokyo Med & Dent Univ, Informat Ctr Med Sci, Bunkyo Ku, Tokyo 1138549, Japan
[2] Hamamatsu Univ Sch Med, Hamamatsu, Shizuoka 4312102, Japan
[3] Kobe Univ, Grad Sch Med, Kobe, Hyogo 657, Japan
关键词
Clinical genomics; Sequence variation; Markup Language; Global interoperability; Data interchange; Information model; IDENTIFICATION; DATABASE; SNPS;
D O I
10.1016/j.ijmedinf.2009.11.003
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective: With the aim of making good use of internationally accumulated genomic sequence variation data, which is increasing rapidly due to the explosive amount of genomic research at present, the development of an interoperable data exchange format and its international standardization are necessary. Genomic Sequence Variation Markup Language (GSVML) will focus on genomic sequence variation data and human health applications, such as gene based medicine or pharmacogenomics. Design and method: We developed GSVML through eight steps, based on case analysis and domain investigations. By focusing on the design scope to human health applications and genomic sequence variation, we attempted to eliminate ambiguity and to ensure practicability. We intended to satisfy the requirements derived from the use case analysis of human-based clinical genomic applications. Based on database investigations, we attempted to minimize the redundancy of the data format, while maximizing the data covering range. We also attempted to ensure communication and interface ability with other Markup Languages, for exchange of omics data among various omics researchers or facilities. The interface ability with developing clinical standards, such as the Health Level Seven Genotype Information model, was analyzed. Results: We developed the human health-oriented GSVML comprising variation data, direct annotation, and indirect annotation categories; the variation data category is required, while the direct and indirect annotation categories are optional. The annotation categories contain omics and clinical information, and have internal relationships. For designing, we examined 6 cases for three criteria as human health application and 15 data elements for three criteria as data formats for genomic sequence variation data exchange. The data format of five international SNP databases and six Markup Languages and the interface ability to the Health Level Seven Genotype Model in terms of 317 items were investigated. Conclusion: GSVML was developed as a potential data exchanging format for genomic sequence variation data exchange focusing on human health applications. The international standardization of GSVML is necessary, and is currently underway. GSVML can be applied to enhance the utilization of genomic sequence variation data worldwide by providing a communicable platform between clinical and research applications. (C) 2009 Elsevier Ireland Ltd. All rights reserved.
引用
收藏
页码:130 / 142
页数:13
相关论文
共 50 条
  • [1] The carbohydrate sequence markup language (CabosML): A XML description of carbohydrate structures
    Kikuchi, Norihiro
    Kameyama, Akihiko
    Nakaya, Shuuichi
    Ito, Hiromi
    Sato, Takashi
    Shikanai, Toshihide
    Takahashi, Yoriko
    Narimatsul, Hisashi
    MOLECULAR & CELLULAR PROTEOMICS, 2004, 3 (10) : S102 - S102
  • [2] The carbohydrate sequence markup language (CabosML): a XML description of carbohydrate structures
    Kikuchi, N
    Kameyama, A
    Nakaya, S
    Ito, H
    Sato, T
    Shikanai, T
    Takahashi, Y
    Narimatsu, H
    GLYCOBIOLOGY, 2004, 14 (11) : 1184 - 1184
  • [3] The carbohydrate sequence markup language (CabosML): an XML description of carbohydrate structures
    Kikuchi, N
    Kameyama, A
    Nakaya, S
    Ito, H
    Sato, T
    Shikanai, T
    Takahashi, Y
    Narimatsu, H
    BIOINFORMATICS, 2005, 21 (08) : 1717 - 1718
  • [4] Extensible markup language
    Bray, T
    SperbergMcQueen, CM
    SGML '96 CONFERENCE PROCEEDINGS - CELEBRATING A DECADE OF SGML, 1996, : 399 - 404
  • [5] Extensible markup language
    Udell, J
    BYTE, 1998, 23 (01): : 80 - 80
  • [6] The chemical markup language
    Liao, YM
    Ghanadan, H
    ANALYTICAL CHEMISTRY, 2002, 74 (13) : 389A - 390A
  • [7] CHEMICAL MARKUP LANGUAGE
    MURRAYRUST, P
    LEACH, C
    RZEPA, HS
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 1995, 210 : 40 - COMP
  • [8] The biopolymer markup language
    Fenyö, D
    BIOINFORMATICS, 1999, 15 (04) : 339 - 340
  • [9] Intensional markup language
    Wadge, WW
    DISTRIBUTED COMMUNITIES ON THE WEB, PROCEEDINGS, 2000, 1830 : 82 - 89
  • [10] Incorporating Units Markup Language (UnitsML) into AnIML (Analytical Information Markup Language)
    Jopp, Ronny
    Roth, Alexander
    Linstrom, Peter J.
    Kramer, Gary W.
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2006, 231