DAMS: A Distributed Analytics Metadata Schema

被引:0
|
作者
Sascha Welten [1 ]
Laurenz Neumann [1 ]
Yeliz Ucer Yediel [2 ]
Luiz Olavo Bonino da Silva Santos [3 ,4 ]
Stefan Decker [1 ,2 ]
Oya Beyan [2 ,5 ]
机构
[1] Chair Informatik ,RWTH Aachen University
[2] Fraunhofer Institute for Applied Information Techniques (FIT)
[3] Faculty of Electrical Engineering,Mathematics and Computer Science,University of Twente
[4] Department of Human Genetics,Leiden University Medical Centre
[5] Institute of Medical Information,Faculty of Medicine & University Hospital Cologne,University of
关键词
D O I
暂无
中图分类号
TP311.13 [];
学科分类号
1201 ;
摘要
In recent years, implementations enabling Distributed Analytics(DA) have gained considerable attention due to their ability to perform complex analysis tasks on decentralised data by bringing the analysis to the data. These concepts propose privacy-enhancing alternatives to data centralisation approaches, which have restricted applicability in case of sensitive data due to ethical, legal or social aspects. Nevertheless, the immanent problem of DA-enabling architectures is the black-box-alike behaviour of the highly distributed components originating from the lack of semantically enriched descriptions, particularly the absence of basic metadata for data sets or analysis tasks. To approach the mentioned problems, we propose a metadata schema for DA infrastructures, which provides a vocabulary to enrich the involved entities with descriptive semantics. We initially perform a requirement analysis with domain experts to reveal necessary metadata items, which represents the foundation of our schema. Afterwards, we transform the obtained domain expert knowledge into user stories and derive the most significant semantic content. In the final step, we enable machine-readability via RDF(S) and SHACL serialisations. We deploy our schema in a proof-of-concept monitoring dashboard to validate its contribution to the transparency of DA architectures. Additionally, we evaluate the schema's compliance with the FAIR principles. The evaluation shows that the schema succeeds in increasing transparency while being compliant with most of the FAIR principles. Because a common metadata model is critical for enhancing the compatibility between multiple DA infrastructures, our work lowers data access and analysis barriers. It represents an initial and infrastructure-independent foundation for the FAIRification of DA and the underlying scientific data management.
引用
收藏
页码:528 / 547
页数:20
相关论文
共 50 条
  • [1] DAMS: A Distributed Analytics Metadata Schema
    Welten, Sascha
    Neumann, Laurenz
    Yediel, Yeliz Ucer
    da Silva Santos, Luiz Olavo Bonino
    Decker, Stefan
    Beyan, Oya
    DATA INTELLIGENCE, 2021, 3 (04) : 528 - 547
  • [2] Metadata editing by schema
    Suleman, H
    RESEARCH AND ADVANCED TECHNOLOGY FOR DIGITAL LIBRARIES, 2003, 2769 : 82 - 87
  • [3] JISC metadata schema registry
    Heery, R
    Johnston, P
    Beckett, D
    Rogers, N
    PROCEEDINGS OF THE 5TH ACM/IEEE JOINT CONFERENCE ON DIGITAL LIBRARIES, PROCEEDINGS, 2005, : 381 - 381
  • [4] A Survey of Scientific Metadata Schema
    Xu, Hao
    Sun, Liangfeng
    Zou, Mi
    Meng, Anning
    INFORMATION TECHNOLOGY APPLICATIONS IN INDUSTRY II, PTS 1-4, 2013, 411-414 : 349 - +
  • [5] A digital metadata schema repository
    Lin, Yen-Chun
    Wang, Hsiang-An
    Huang, Chien-Chung
    Chen, Wei
    NEW ASPECTS OF TELECOMMUNICATIONS AND INFORMATICS, 2008, : 177 - 182
  • [6] Metadata Schema for Augmented Reality
    Ishikawa, Takaaki
    Park, Je-Ho
    2014 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND APPLICATIONS (ICISA), 2014,
  • [7] A metadata schema registry for the registration and analysis of recordkeeping and preservation metadata
    Gilliland-Swetland, A
    McKemmish, S
    Archiving 2005, Final Program and Proceedings, 2005, : 109 - 112
  • [8] ir_metadata: An Extensible Metadata Schema for IR Experiments
    Breuer, Timo
    Keller, Jueri
    Schaer, Philipp
    PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22), 2022, : 3078 - 3089
  • [9] Information Environment Metadata Schema Registry
    Tonkin, Emma
    Strelnikov, Alexey
    RESEARCH AND ADVANCED TECHNOLOGY FOR DIGITAL LIBRARIES, PROCEEDINGS, 2009, 5714 : 487 - 488
  • [10] The EuDML metadata schema version 1.0
    Cellule Mathdoc , Université Joseph-Fourier, B.P. 74, SaintMartin d'Hères
    384 02, France
    不详
    919 40, France
    不详
    D-105 87, Germany
    DML - Towards Digit. Math. Libr., Proc., (45-61):