Data Provenance Analysis and Description for ETL based on PROV

被引:0
|
作者
Zhang Ran [1 ]
Dai Chao-fan [1 ]
Zeng Sai-hong [1 ]
机构
[1] Natl Univ Sci Technol, Dept Sci & Technol, Informat Syst Engn Lab, Changsha 410074, Hunan, Peoples R China
关键词
PROV ETL; Data provenance; Resource Description;
D O I
暂无
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Data provenance, also calling it data lineage or pedigree, is related information of data about the process from its generation to present situation. W3C workshop proposes PROV standards that rule what vocabularies/ ontologies/rules were used to generate data. It is the uniform standard for data provenance, which strengthens interoperations between different provenance information. ETL, which Extract-Transform-Load abbreviates to, is a description for the change process from data source to end, including extraction, transformation an d loading. In this paper, what we do is to analyze and design a system that can trace data and process correctly and effectively,and we focus on reverse rules and tracing method. As a result, we will do research on data provenance, which will be based on ETL and use PROV standards can make the tracing process better. What's more,we will give an introduction about provenan cc tree that is graphical representation of data tracing process
引用
收藏
页码:1651 / 1656
页数:6
相关论文
共 50 条
  • [21] PROV-TE: A Provenance-Driven Diagnostic Framework for Task Eviction in Data Centers
    Albatli, Abdulaziz
    McKee, David
    Townend, Paul
    Lau, Lydia
    Xu, Jie
    2017 THIRD IEEE INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING SERVICE AND APPLICATIONS (IEEE BIGDATASERVICE 2017), 2017, : 233 - 242
  • [22] Experiencing PROV-Wf for Provenance Interoperability in SWfMSs
    Oliveira, Wellington
    De Oliveira, Daniel
    Braganholo, Vanessa
    PROVENANCE AND ANNOTATION OF DATA AND PROCESSES (IPAW 2014), 2015, 8628 : 294 - 296
  • [23] Prov-Replay: A Qualitative Analysis Framework for Gameplay Sessions Using Provenance and Replay
    Thurler, Leonardo
    Melo, Sidney
    Clua, Esteban
    Kohwalter, Troy
    ENTERTAINMENT COMPUTING, ICEC 2023, 2023, 14455 : 31 - 40
  • [24] SC-PROV: A Provenance Vocabulary for Social Computation
    Markovic, Milan
    Edwards, Peter
    Corsar, David
    PROVENANCE AND ANNOTATION OF DATA AND PROCESSES (IPAW 2014), 2015, 8628 : 285 - 287
  • [25] Nano-PROV: FAIRification workflow for generating nanopublications based on provenance and semantic enrichment
    Feijoó M.P.P.
    Jardim R.
    da Cruz S.M.S.
    Campos M.L.M.
    International Journal of Metadata, Semantics and Ontologies, 2023, 16 (02) : 138 - 151
  • [26] PROV-IO+: A Cross-Platform Provenance Framework for Scientific Data on HPC Systems
    Han, Runzhou
    Zheng, Mai
    Byna, Suren
    Tang, Houjun
    Dong, Bin
    Dai, Dong
    Chen, Yong
    Kim, Dongkyun
    Hassoun, Joseph
    Thorsley, David
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2024, 35 (05) : 844 - 861
  • [27] Fine-Grained Provenance for Matching & ETL
    Zheng, Nan
    Alawini, Abdussalam
    Ives, Zachary G.
    2019 IEEE 35TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2019), 2019, : 184 - 195
  • [28] Constructing data supply chain based on layered PROV
    Peng Li
    Tin-Yu Wu
    Xin-Ming Li
    Hong Luo
    Mohammad S. Obaidat
    The Journal of Supercomputing, 2017, 73 : 1509 - 1531
  • [29] Study of Constructing Data Supply Chain Based on PROV
    Lan, Jiewei
    Liu, Xiyun
    Luo, Hong
    Li, Peng
    BIG DATA COMPUTING AND COMMUNICATIONS, 2015, 9196 : 69 - 78
  • [30] Constructing data supply chain based on layered PROV
    Li, Peng
    Wu, Tin-Yu
    Li, Xin-Ming
    Luo, Hong
    Obaidat, Mohammad S.
    JOURNAL OF SUPERCOMPUTING, 2017, 73 (04): : 1509 - 1531