DAMS: A Distributed Analytics Metadata Schema

被引:0
|
作者
Sascha Welten [1 ]
Laurenz Neumann [1 ]
Yeliz Ucer Yediel [2 ]
Luiz Olavo Bonino da Silva Santos [3 ,4 ]
Stefan Decker [1 ,2 ]
Oya Beyan [2 ,5 ]
机构
[1] Chair Informatik ,RWTH Aachen University
[2] Fraunhofer Institute for Applied Information Techniques (FIT)
[3] Faculty of Electrical Engineering,Mathematics and Computer Science,University of Twente
[4] Department of Human Genetics,Leiden University Medical Centre
[5] Institute of Medical Information,Faculty of Medicine & University Hospital Cologne,University of
关键词
D O I
暂无
中图分类号
TP311.13 [];
学科分类号
1201 ;
摘要
In recent years, implementations enabling Distributed Analytics(DA) have gained considerable attention due to their ability to perform complex analysis tasks on decentralised data by bringing the analysis to the data. These concepts propose privacy-enhancing alternatives to data centralisation approaches, which have restricted applicability in case of sensitive data due to ethical, legal or social aspects. Nevertheless, the immanent problem of DA-enabling architectures is the black-box-alike behaviour of the highly distributed components originating from the lack of semantically enriched descriptions, particularly the absence of basic metadata for data sets or analysis tasks. To approach the mentioned problems, we propose a metadata schema for DA infrastructures, which provides a vocabulary to enrich the involved entities with descriptive semantics. We initially perform a requirement analysis with domain experts to reveal necessary metadata items, which represents the foundation of our schema. Afterwards, we transform the obtained domain expert knowledge into user stories and derive the most significant semantic content. In the final step, we enable machine-readability via RDF(S) and SHACL serialisations. We deploy our schema in a proof-of-concept monitoring dashboard to validate its contribution to the transparency of DA architectures. Additionally, we evaluate the schema's compliance with the FAIR principles. The evaluation shows that the schema succeeds in increasing transparency while being compliant with most of the FAIR principles. Because a common metadata model is critical for enhancing the compatibility between multiple DA infrastructures, our work lowers data access and analysis barriers. It represents an initial and infrastructure-independent foundation for the FAIRification of DA and the underlying scientific data management.
引用
收藏
页码:528 / 547
页数:20
相关论文
共 50 条
  • [31] Alexandria Digital Library: Rapid prototype and metadata schema
    Fischer, C
    Frew, J
    Larsgaard, M
    Smith, TR
    Zheng, Q
    DIGITAL LIBRARIES: RESEARCH AND TECHNOLOGY ADVANCES, 1996, 1082 : 221 - 241
  • [32] A proposal for management of RDF and RDF Schema metadata in MOF
    dos Santos, HL
    de Barros, RSM
    Fonseca, D
    ON THE MOVE TO MEANINGFUL INTERNET SYSTEMS 2003: COOPIS, DOA, AND ODBASE, 2003, 2888 : 1014 - 1031
  • [33] Construction of a metadata schema for medical data in networking applications
    Chen, Chi-Jane
    Pai, Tun-Wen
    Huang, Jhen-Li
    Lo, Ying-Tsang
    Lin, Shih-Syun
    Yeh, Chun-Chao
    2017 31ST IEEE INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS WORKSHOPS (IEEE WAINA 2017), 2017, : 597 - 600
  • [34] Brick : Metadata schema for portable smart building applications
    Balaji, Bharathan
    Bhattacharya, Arka
    Fierro, Gabriel
    Gao, Jingkun
    Gluck, Joshua
    Hong, Dezhi
    Johansen, Aslak
    Koh, Jason
    Ploennigs, Joern
    Agarwal, Yuvraj
    Berges, Mario
    Culler, David
    Gupta, Rajesh K.
    Kjaergaard, Mikkel Baun
    Srivastava, Mani
    Whitehouse, Kamin
    APPLIED ENERGY, 2018, 226 : 1273 - 1292
  • [35] Reflections on Schema Mappings, Data Exchange, and Metadata Management
    Kolaitis, Phokion G.
    PODS'18: PROCEEDINGS OF THE 37TH ACM SIGMOD-SIGACT-SIGAI SYMPOSIUM ON PRINCIPLES OF DATABASE SYSTEMS, 2018, : 107 - 109
  • [36] Combining OWL Ontology and Schema Annotations in Metadata Management
    Pankowski, Tadeusz
    HYBRID ARTIFICIAL INTELLIGENT SYSTEMS, PART I, 2011, 6678 : 255 - 262
  • [37] Metadata Object Description Schema (MODS) in Digital Repositories: An Exploratory Study of Metadata Use and Quality
    Park, Jung-ran
    Maszaros, Susan
    KNOWLEDGE ORGANIZATION, 2009, 36 (01): : 46 - 59
  • [38] Analytics using metadata associations for digital investigations
    Sriram Raghavan
    S. V. Raghavan
    CSI Transactions on ICT, 2017, 5 (3) : 315 - 338
  • [39] Distributed metadata search for the cloud
    Yu Y.
    Zhu Y.
    Samsudin J.
    Journal of Communications, 2016, 11 (01): : 100 - 107
  • [40] Distributed learning metadata standards
    McClelland M.
    Journal of Computing in Higher Education, 2004, 16 (1) : 93 - 105