The Software Ontology (SWO): a resource for reproducibility in biomedical data analysis, curation and digital preservation

被引:55
|
作者
Malone, James [1 ]
Brown, Andy [2 ]
Lister, Allyson L. [2 ]
Ison, Jon [1 ]
Hull, Duncan [2 ]
Parkinson, Helen [1 ]
Stevens, Robert [2 ]
机构
[1] EBI, EMBL, Cambridge CB10 1SD, England
[2] Univ Manchester, Sch Comp Sci, Manchester M13 9PL, Lancs, England
来源
基金
英国工程与自然科学研究理事会;
关键词
WEB SERVICES; BIOINFORMATICS;
D O I
10.1186/2041-1480-5-25
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Motivation: Biomedical ontologists to date have concentrated on ontological descriptions of biomedical entities such as gene products and their attributes, phenotypes and so on. Recently, effort has diversified to descriptions of the laboratory investigations by which these entities were produced. However, much biological insight is gained from the analysis of the data produced from these investigations, and there is a lack of adequate descriptions of the wide range of software that are central to bioinformatics. We need to describe how data are analyzed for discovery, audit trails, provenance and reproducibility. Results: The Software Ontology (SWO) is a description of software used to store, manage and analyze data. Input to the SWO has come from beyond the life sciences, but its main focus is the life sciences. We used agile techniques to gather input for the SWO and keep engagement with our users. The result is an ontology that meets the needs of a broad range of users by describing software, its information processing tasks, data inputs and outputs, data formats versions and so on. Recently, the SWO has incorporated EDAM, a vocabulary for describing data and related concepts in bioinformatics. The SWO is currently being used to describe software used in multiple biomedical applications. Conclusion: The SWO is another element of the biomedical ontology landscape that is necessary for the description of biomedical entities and how they were discovered. An ontology of software used to analyze data produced by investigations in the life sciences can be made in such a way that it covers the important features requested and prioritized by its users. The SWO thus fits into the landscape of biomedical ontologies and is produced using techniques designed to keep it in line with user's needs. Availability: The Software Ontology is available under an Apache 2.0 license at http://theswo.sourceforge.net/; the Software Ontology blog can be read at http://softwareontology.wordpress.com.
引用
收藏
页数:13
相关论文
共 50 条
  • [11] Study on Contexts and Stages of Digital Content Curation Models: Guidelines for Use in Qualitative Analysis Software
    de Godoi e Silva, Katia Alexandra
    Costa, Antonio Pedro
    QUALITATIVE REPORT, 2023, 28 (10): : 2980 - 2994
  • [12] MicroScope-an integrated microbial resource for the curation and comparative analysis of genomic and metabolic data
    Vallenet, David
    Belda, Eugeni
    Calteau, Alexandra
    Cruveiller, Stephane
    Engelen, Stefan
    Lajus, Aurelie
    Le Fevre, Francois
    Longin, Cyrille
    Mornico, Damien
    Roche, David
    Rouy, Zoe
    Salvignol, Gregory
    Scarpelli, Claude
    Smith, Adam Alexander Thil
    Weiman, Marion
    Medigue, Claudine
    NUCLEIC ACIDS RESEARCH, 2013, 41 (D1) : E636 - E647
  • [13] Software Quality Assessment of a Web Application for Biomedical Data Analysis
    Lietz, Kristina
    Wiese, Ingmar
    Wiese, Lena
    IDEAS 2021: 25TH INTERNATIONAL DATABASE ENGINEERING & APPLICATIONS SYMPOSIUM, 2021, : 84 - 93
  • [14] Preparing experiments' software for long term analysis and data preservation
    Kemp, Yves
    Ozerov, Dmitry
    INTERNATIONAL CONFERENCE ON COMPUTING IN HIGH ENERGY AND NUCLEAR PHYSICS 2012 (CHEP2012), PTS 1-6, 2012, 396
  • [15] USING CAD SOFTWARE TO REDUCE THE AMOUNT OF DATA IN CASE OF DIGITAL PRESERVATION OF THE CULTURAL HERITAGE
    Badiu, I.
    Popescu, D.
    Cenusa, A.
    Buna, Z.
    Comes, R.
    2014 INTERNATIONAL CONFERENCE ON PRODUCTION RESEARCH - REGIONAL CONFERENCE AFRICA, EUROPE AND THE MIDDLE EAST AND 3RD INTERNATIONAL CONFERENCE ON QUALITY AND INNOVATION IN ENGINEERING AND MANAGEMENT (ICPR-AEM 2014), 2014, : 12 - 16
  • [16] Multilayer Perceptron discrimination of software quality in a biomedical data analysis system
    Alexiuk, MD
    Pizzi, NJ
    IEEE CCEC 2002: CANADIAN CONFERENCE ON ELECTRCIAL AND COMPUTER ENGINEERING, VOLS 1-3, CONFERENCE PROCEEDINGS, 2002, : 770 - 775
  • [17] A Domain-driven Approach to Digital Curation and Preservation of 3D Architectural Data: Stakeholder Identification and Alignment in the DURAARK Project
    Lindlar, Michelle
    Tamke, Martin
    ARCHIVING 2014, FINAL PROGRAM AND PROCEEDINGS, 2014, : 204 - 209
  • [18] 'Everyone has their reasons for curating the data they have decided to keep': a thematic analysis of data hoarding as digital curation practice
    Maemura, Emily
    Wagner, Travis L.
    INFORMATION RESEARCH-AN INTERNATIONAL ELECTRONIC JOURNAL, 2025, 30 : 789 - 797
  • [19] RCSB Protein Data Bank: Sustaining A Living Digital Data Resource That Enables Breakthroughs In Scientific Research And Biomedical Education
    Burley, Stephen K.
    Berman, Helen M.
    Christie, Cole
    Duarte, Jose M.
    Feng, Zukang
    Westbrook, John
    Young, Jasmine
    Zardecki, Christine
    FASEB JOURNAL, 2018, 32 (01):
  • [20] Rcsb Protein Data Bank: Sustaining a Living Digital Data Resource that Enables Breakthroughs in Scientific Research and Biomedical Education
    Burley, Stephen K.
    BIOPHYSICAL JOURNAL, 2019, 116 (03) : 329A - 329A