Making data platforms smarter with MOSES

被引:11
|
作者
Francia, Matteo [1 ]
Gallinucci, Enrico [1 ,2 ]
Golfarelli, Matteo [1 ,2 ]
Leoni, Anna Giulia [1 ,2 ]
Rizzi, Stefano [1 ,2 ]
Santolini, Nicola [1 ]
机构
[1] Univ Bologna, DISI, Via Univ 50, I-47522 Cesena, Italy
[2] Univ Bologna, CIRI ICT, Via Univ 50, I-47522 Cesena, Italy
关键词
Data lake; Metadata; Big data; Data platform; DATA LAKE; PROVENANCE; MANAGEMENT;
D O I
10.1016/j.future.2021.06.031
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The rise of data platforms has enabled the collection and processing of huge volumes of data, but has opened to the risk of losing their control. Collecting proper metadata about raw data and transformations can significantly reduce this risk. In this paper we propose MOSES, a technology-agnostic, extensible, and customizable framework for metadata handling in big data platforms. The framework hinges on a metadata repository that stores information about the objects in the big data platform and the processes that transform them. MOSES provides a wide range of functionalities to different types of users of the platform. Differently from previous high-level proposals, MOSES is fully implemented and it was not conceived for a specific technology. Besides discussing the rationale and the features of MOSES, in this paper we describe its implementation and we test it on a real case study. The ultimate goal is to take a significant step forward towards proving that metadata handling in big data platforms is feasible and beneficial. (C) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页码:299 / 313
页数:15
相关论文
共 50 条
  • [1] Big Data: Making Cities Smarter
    Ni, Lionel M.
    2016 17TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED COMPUTING, APPLICATIONS AND TECHNOLOGIES (PDCAT), 2016, : XX - XX
  • [2] Smarter Together: Progressing Smart Data Platforms in Lyon, Munich, and Vienna
    Morishita-Steffen, Naomi
    Alberola, Remi
    Mougeot, Baptiste
    Vignali, Etienne
    Wikstrom, Camilla
    Montag, Uwe
    Gastaud, Emmanuel
    Lutz, Brigitte
    Hartmann, Gerhard
    Pfaffenbichler, Franz Xaver
    Hainoun, Ali
    Gaiddon, Bruno
    Marvuglia, Antonino
    Andreucci, Maria Beatrice
    ENERGIES, 2021, 14 (04)
  • [3] Smarter consolidation into Hadoop platforms
    Kobielus, James
    IBM Data Management Magazine, 2012, (04):
  • [4] big data analytics making the smart grid smarter
    Hong, Tao
    IEEE POWER & ENERGY MAGAZINE, 2018, 16 (03): : 12 - 16
  • [5] Smarter Cities’ Attractiveness. Testing New Criteria or Facets: “Data Scientists” and “Data Platforms”
    Maurice Baslé
    Journal of the Knowledge Economy, 2021, 12 : 268 - 278
  • [6] Smarter Cities' Attractiveness. Testing New Criteria or Facets: "Data Scientists" and "Data Platforms"
    Basle, Maurice
    JOURNAL OF THE KNOWLEDGE ECONOMY, 2021, 12 (01) : 268 - 278
  • [7] Crop-Planning, Making Smarter Agriculture With Climate Data
    Pajarito Grajales, Diego Fabian
    Asprilla Mosquera, Geidy Jhoana
    Mejia, Fabian
    Cardona Piedrahita, Leonardo
    2015 FOURTH INTERNATIONAL CONFERENCE ON AGRO-GEOINFORMATICS, 2015,
  • [8] Collaboration platforms in smarter water management
    Hidaka, C. E.
    Jasperse, J.
    Kolar, H. R.
    Williams, R. P.
    IBM JOURNAL OF RESEARCH AND DEVELOPMENT, 2011, 55 (1-2)
  • [9] Making Clothing Smarter
    Chandler, David L.
    IEEE PULSE, 2017, 8 (06) : 54 - 57
  • [10] Making rooms smarter
    Tucker, P
    FUTURIST, 2005, 39 (05) : 6 - 6