The Software Ontology (SWO): a resource for reproducibility in biomedical data analysis, curation and digital preservation

被引:55
|
作者
Malone, James [1 ]
Brown, Andy [2 ]
Lister, Allyson L. [2 ]
Ison, Jon [1 ]
Hull, Duncan [2 ]
Parkinson, Helen [1 ]
Stevens, Robert [2 ]
机构
[1] EBI, EMBL, Cambridge CB10 1SD, England
[2] Univ Manchester, Sch Comp Sci, Manchester M13 9PL, Lancs, England
来源
基金
英国工程与自然科学研究理事会;
关键词
WEB SERVICES; BIOINFORMATICS;
D O I
10.1186/2041-1480-5-25
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Motivation: Biomedical ontologists to date have concentrated on ontological descriptions of biomedical entities such as gene products and their attributes, phenotypes and so on. Recently, effort has diversified to descriptions of the laboratory investigations by which these entities were produced. However, much biological insight is gained from the analysis of the data produced from these investigations, and there is a lack of adequate descriptions of the wide range of software that are central to bioinformatics. We need to describe how data are analyzed for discovery, audit trails, provenance and reproducibility. Results: The Software Ontology (SWO) is a description of software used to store, manage and analyze data. Input to the SWO has come from beyond the life sciences, but its main focus is the life sciences. We used agile techniques to gather input for the SWO and keep engagement with our users. The result is an ontology that meets the needs of a broad range of users by describing software, its information processing tasks, data inputs and outputs, data formats versions and so on. Recently, the SWO has incorporated EDAM, a vocabulary for describing data and related concepts in bioinformatics. The SWO is currently being used to describe software used in multiple biomedical applications. Conclusion: The SWO is another element of the biomedical ontology landscape that is necessary for the description of biomedical entities and how they were discovered. An ontology of software used to analyze data produced by investigations in the life sciences can be made in such a way that it covers the important features requested and prioritized by its users. The SWO thus fits into the landscape of biomedical ontologies and is produced using techniques designed to keep it in line with user's needs. Availability: The Software Ontology is available under an Apache 2.0 license at http://theswo.sourceforge.net/; the Software Ontology blog can be read at http://softwareontology.wordpress.com.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] The Software Ontology (SWO): a resource for reproducibility in biomedical data analysis, curation and digital preservation
    James Malone
    Andy Brown
    Allyson L Lister
    Jon Ison
    Duncan Hull
    Helen Parkinson
    Robert Stevens
    Journal of Biomedical Semantics, 5
  • [2] Ontology for mapping the technological dependence of digital objects in the context of digital curation and preservation
    Yamaoka, Eloi Juniti
    ATOZ-NOVAS PRATICAS EM INFORMACAO E CONHECIMENTO, 2012, 1 (02): : 65 - 78
  • [3] Video shot analysis for digital curation and preservation of historical films
    Helm, D.
    Kampel, M.
    GCH 2019 - Eurographics Workshop on Graphics and Cultural Heritage, 2019, : 25 - 28
  • [4] DIGITAL CURATION IN QUALITATIVE ANALYSIS SOFTWARE: ANALYSIS OF THE TRAINING PROCESS OF RESEARCHERS
    Silva, Katia Alexandra de Godoi E.
    Costa, Antonio Pedro
    Pinto, Sandro Teixeira
    CADERNOS EDUCACAO TECNOLOGIA E SOCIEDADE, 2023, 16 : 123 - 133
  • [5] Digital data preservation and curation: A collaboration among libraries, publishers, and the virtual observatory
    Hanisch, R. J.
    Steffen, J.
    Choudhury, S.
    DiLauro, T.
    Szalay, A.
    Vishniac, E.
    Ehling, T.
    Milkey, R.
    Plante, R.
    LIBRARY AND INFORMATION SERVICES IN ASTRONOMY V: COMMON CHALLENGES, UNCOMMON SOLUTIONS, 2007, 377 : 29 - +
  • [6] Live Demonstration: Enhancing Biomedical Research Precision, Productivity and Reproducibility via Autonomous Data Acquisition and Robust Data Curation
    Gtat, Yousef
    Mason, Andrew J.
    2017 IEEE BIOMEDICAL CIRCUITS AND SYSTEMS CONFERENCE (BIOCAS), 2017,
  • [7] Community Stories and Institutional Stewardship: Digital Curation's Dual Roles of Story Creation and Resource Preservation
    Kunda, Sue
    Anderson-Wilk, Mark
    PORTAL-LIBRARIES AND THE ACADEMY, 2011, 11 (04) : 895 - 914
  • [8] Live Demonstration: Automated Data Acquisition and Digital Curation Platform for Enhancing Research Precision Productivity and Reproducibility
    Gtat, Yousef
    Parsnejad, Sina
    Mason, Andrew J.
    2017 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2017,
  • [9] Discrimination of software quality in a biomedical data analysis system
    Pizzi, NJ
    Demko, A
    Vivanco, R
    JOINT 9TH IFSA WORLD CONGRESS AND 20TH NAFIPS INTERNATIONAL CONFERENCE, PROCEEDINGS, VOLS. 1-5, 2001, : 1702 - 1707
  • [10] The storage and preservation of files related to digital bank credit notes: ensuring their reproducibility amidst software obsolescence
    Dios, Marcelle Mourelle Perez
    Rocha, Maria Gabryelle Dantas
    ACERVO, 2024, 37 (03): : 36 - 36