DAMS: A Distributed Analytics Metadata Schema

被引:0
|
作者
Sascha Welten [1 ]
Laurenz Neumann [1 ]
Yeliz Ucer Yediel [2 ]
Luiz Olavo Bonino da Silva Santos [3 ,4 ]
Stefan Decker [1 ,2 ]
Oya Beyan [2 ,5 ]
机构
[1] Chair Informatik ,RWTH Aachen University
[2] Fraunhofer Institute for Applied Information Techniques (FIT)
[3] Faculty of Electrical Engineering,Mathematics and Computer Science,University of Twente
[4] Department of Human Genetics,Leiden University Medical Centre
[5] Institute of Medical Information,Faculty of Medicine & University Hospital Cologne,University of
关键词
D O I
暂无
中图分类号
TP311.13 [];
学科分类号
1201 ;
摘要
In recent years, implementations enabling Distributed Analytics(DA) have gained considerable attention due to their ability to perform complex analysis tasks on decentralised data by bringing the analysis to the data. These concepts propose privacy-enhancing alternatives to data centralisation approaches, which have restricted applicability in case of sensitive data due to ethical, legal or social aspects. Nevertheless, the immanent problem of DA-enabling architectures is the black-box-alike behaviour of the highly distributed components originating from the lack of semantically enriched descriptions, particularly the absence of basic metadata for data sets or analysis tasks. To approach the mentioned problems, we propose a metadata schema for DA infrastructures, which provides a vocabulary to enrich the involved entities with descriptive semantics. We initially perform a requirement analysis with domain experts to reveal necessary metadata items, which represents the foundation of our schema. Afterwards, we transform the obtained domain expert knowledge into user stories and derive the most significant semantic content. In the final step, we enable machine-readability via RDF(S) and SHACL serialisations. We deploy our schema in a proof-of-concept monitoring dashboard to validate its contribution to the transparency of DA architectures. Additionally, we evaluate the schema's compliance with the FAIR principles. The evaluation shows that the schema succeeds in increasing transparency while being compliant with most of the FAIR principles. Because a common metadata model is critical for enhancing the compatibility between multiple DA infrastructures, our work lowers data access and analysis barriers. It represents an initial and infrastructure-independent foundation for the FAIRification of DA and the underlying scientific data management.
引用
收藏
页码:528 / 547
页数:20
相关论文
共 50 条
  • [21] A metadata schema for data objects in clinical research
    Canham, Steve
    Ohmann, Christian
    TRIALS, 2016, 17
  • [22] Designing metadata schema for a human library: a prototype
    Jana, Anupta
    Rout, Rosalien
    DIGITAL LIBRARY PERSPECTIVES, 2022, 38 (03) : 346 - 361
  • [23] ProMetaS - A Metadata Schema for Process Engineering and Industry
    Sherpa, Lincoln
    Mueller-Pfefferkorn, Ralph
    Tolksdorf, Gregor
    Khaydarov, Valentin
    Wiedau, Michael
    Urbas, Leon
    CHEMIE INGENIEUR TECHNIK, 2023, 95 (07) : 1041 - 1048
  • [24] A metadata schema for data objects in clinical research
    Steve Canham
    Christian Ohmann
    Trials, 17
  • [25] An Open Metadata Schema for Clinical Pathway (openCP) in China
    Xu, Wei
    Zhu, Yanxin
    Wang, Xia
    MEDINFO 2017: PRECISION HEALTHCARE THROUGH INFORMATICS, 2017, 245 : 1344 - 1344
  • [26] On-demand partial schema delivery for multimedia metadata
    Davis, Stephen J.
    Burnett, Ian S.
    2006 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO - ICME 2006, VOLS 1-5, PROCEEDINGS, 2006, : 1513 - +
  • [27] XML Schema Mappings: Data Exchange and Metadata Management
    Amano, Shun'ichi
    David, Claire
    Libkin, Leonid
    Murlak, Filip
    JOURNAL OF THE ACM, 2014, 61 (02)
  • [28] Schema exchange: Generic mappings for transforming data and metadata
    Papotti, Paolo
    Torlone, Riccardo
    DATA & KNOWLEDGE ENGINEERING, 2009, 68 (07) : 665 - 682
  • [29] Towards an Entity-based Scientific Metadata Schema
    Xu, Hao
    APPLIED MATERIALS AND TECHNOLOGIES FOR MODERN MANUFACTURING, PTS 1-4, 2013, 423-426 : 2751 - 2754
  • [30] A metadata classification schema for semantic content analysis of videos
    Shotton, DM
    Rodríguez, A
    Guil, N
    Trelles, O
    JOURNAL OF MICROSCOPY, 2002, 205 : 33 - 42