Data Provenance Analysis and Description for ETL based on PROV

被引:0
|
作者
Zhang Ran [1 ]
Dai Chao-fan [1 ]
Zeng Sai-hong [1 ]
机构
[1] Natl Univ Sci Technol, Dept Sci & Technol, Informat Syst Engn Lab, Changsha 410074, Hunan, Peoples R China
关键词
PROV ETL; Data provenance; Resource Description;
D O I
暂无
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Data provenance, also calling it data lineage or pedigree, is related information of data about the process from its generation to present situation. W3C workshop proposes PROV standards that rule what vocabularies/ ontologies/rules were used to generate data. It is the uniform standard for data provenance, which strengthens interoperations between different provenance information. ETL, which Extract-Transform-Load abbreviates to, is a description for the change process from data source to end, including extraction, transformation an d loading. In this paper, what we do is to analyze and design a system that can trace data and process correctly and effectively,and we focus on reverse rules and tracing method. As a result, we will do research on data provenance, which will be based on ETL and use PROV standards can make the tracing process better. What's more,we will give an introduction about provenan cc tree that is graphical representation of data tracing process
引用
收藏
页码:1651 / 1656
页数:6
相关论文
共 50 条
  • [41] UML2PROV: Automating Provenance Capture in Software Engineering
    Saenz-Adan, Carlos
    Perez, Beatriz
    Trung Dong Huynh
    Moreau, Luc
    SOFSEM 2018: THEORY AND PRACTICE OF COMPUTER SCIENCE, 2018, 10706 : 667 - 681
  • [42] Provenance-based analysis of data-centric processes
    Daniel Deutch
    Yuval Moskovitch
    Val Tannen
    The VLDB Journal, 2015, 24 : 583 - 607
  • [43] Designing a Provenance-Based Climate Data Analysis Application
    Santos, Emanuele
    Koop, David
    Maxwell, Thomas
    Doutriaux, Charles
    Ellqvist, Tommy
    Potter, Gerald
    Freire, Juliana
    Williams, Dean
    Silva, Claudio T.
    PROVENANCE AND ANNOTATION OF DATA AND PROCESSES, IPAW 2012, 2012, 7525 : 214 - 219
  • [44] Provenance-based analysis of data-centric processes
    Deutch, Daniel
    Moskovitch, Yuval
    Tannen, Val
    VLDB JOURNAL, 2015, 24 (04): : 583 - 607
  • [45] A Data Provenance based Architecture to Enhance the Reliability of Data Analysis for Industry 4.0
    Li, Peng
    Niggemann, Oliver
    2018 IEEE 23RD INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES AND FACTORY AUTOMATION (ETFA), 2018, : 1375 - 1382
  • [46] Automating Provenance Capture in Software Engineering with UML2PROV
    Saenz-Adan, Carlos
    Moreau, Luc
    Perez, Beatriz
    Miles, Simon
    Garcia-Izquierdo, Francisco J.
    PROVENANCE AND ANNOTATION OF DATA AND PROCESSES, IPAW 2018, 2018, 11017 : 58 - 70
  • [47] Extending Abstract Notation to Ontology Provenance using PROV-ASN
    Pandey, Mrinal
    Pandey, Rajiv
    2014 INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND GRID COMPUTING (PDGC), 2014, : 374 - 379
  • [48] Versioned-PROV: A PROV Extension to Support Mutable Data Entities
    Pimentel, Joao Felipe N.
    Missier, Paolo
    Murta, Leonardo
    Braganholo, Vanessa
    PROVENANCE AND ANNOTATION OF DATA AND PROCESSES, IPAW 2018, 2018, 11017 : 87 - 100
  • [49] Research on Data Integration of Credit Cooperative Based on ETL
    Yang, Bin
    Wang, Lei
    2010 INTERNATIONAL CONFERENCE ON MANAGEMENT SCIENCE AND ENGINEERING (MSE 2010), VOL 3, 2010, : 290 - 293
  • [50] ETL-based interoperable data management system
    Wongoo Lee
    Minho Lee
    Yunsoo Choi
    Donghoon Choi
    Minhee Cho
    Sa-kwang Song
    Hanmin Jung
    DongHwi Lee
    Hwamook Yoon
    Multimedia Tools and Applications, 2014, 71 : 799 - 812