Data Provenance Analysis and Description for ETL based on PROV

被引:0
|
作者
Zhang Ran [1 ]
Dai Chao-fan [1 ]
Zeng Sai-hong [1 ]
机构
[1] Natl Univ Sci Technol, Dept Sci & Technol, Informat Syst Engn Lab, Changsha 410074, Hunan, Peoples R China
关键词
PROV ETL; Data provenance; Resource Description;
D O I
暂无
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Data provenance, also calling it data lineage or pedigree, is related information of data about the process from its generation to present situation. W3C workshop proposes PROV standards that rule what vocabularies/ ontologies/rules were used to generate data. It is the uniform standard for data provenance, which strengthens interoperations between different provenance information. ETL, which Extract-Transform-Load abbreviates to, is a description for the change process from data source to end, including extraction, transformation an d loading. In this paper, what we do is to analyze and design a system that can trace data and process correctly and effectively,and we focus on reverse rules and tracing method. As a result, we will do research on data provenance, which will be based on ETL and use PROV standards can make the tracing process better. What's more,we will give an introduction about provenan cc tree that is graphical representation of data tracing process
引用
收藏
页码:1651 / 1656
页数:6
相关论文
共 50 条
  • [31] Research of Data Resource Description Method Oriented Provenance
    Zhao, Yan-peng
    Dai, Chao-fan
    Zhang, Xiao-yu
    PROCEEDINGS OF THE 22ND INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND ENGINEERING MANAGEMENT: INNOVATION AND PRACTICE IN INDUSTRIAL ENGINEERING AND MANAGEMENT (VOL 2), 2016, : 215 - 224
  • [32] PROV-O-Viz - Understanding the Role of Activities in Provenance
    Hoekstra, Rinke
    Groth, Paul
    PROVENANCE AND ANNOTATION OF DATA AND PROCESSES (IPAW 2014), 2015, 8628 : 215 - 220
  • [33] Abstracting PROV provenance graphs: A validity-preserving approach
    Missier, P.
    Bryans, J.
    Gamble, C.
    Curcin, V
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2020, 111 : 352 - 367
  • [34] PROV-IO: An I/O-Centric Provenance Framework for Scientific Data on HPC Systems
    Han, Runzhou
    Byna, Suren
    Tang, Houjun
    Dong, Bin
    Zheng, Mai
    PROCEEDINGS OF THE 31ST INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE PARALLEL AND DISTRIBUTED COMPUTING, HPDC 2022, 2022, : 213 - 226
  • [35] A New ETL Approach Based on Data Virtualization
    Guo, Shu-Sheng
    Yuan, Zi-Mu
    Sun, Ao-Bing
    Yue, Qiang
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2015, 30 (02) : 311 - 323
  • [36] Research on Data Integration Based on ETL and ODS
    Yang, Bin
    Li, Huihui
    2011 INTERNATIONAL CONFERENCE ON FUTURE COMPUTERS IN EDUCATION (ICFCE 2011), VOL III, 2011, : 498 - 500
  • [37] A New ETL Approach Based on Data Virtualization
    Shu-Sheng Guo
    Zi-Mu Yuan
    Ao-Bing Sun
    Qiang Yue
    Journal of Computer Science and Technology, 2015, 30 : 311 - 323
  • [38] 基于PROV的ETL起源信息统一表达机制
    柯洁
    董红斌
    梁意文
    谭成予
    艾勇
    四川大学学报(工程科学版), 2015, 47 (05) : 123 - 129
  • [39] Provectories: Embedding-Based Analysis of Interaction Provenance Data
    Walchshofer, Conny
    Hinterreiter, Andreas
    Xu, Kai
    Stitz, Holger
    Streit, Marc
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2023, 29 (12) : 4816 - 4831
  • [40] Prov-IoT: A Security-Aware IoT Provenance Model
    Jaigirdar, Fariha Tasmin
    Rudolph, Carsten
    Bain, Chris
    2020 IEEE 19TH INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM 2020), 2020, : 1361 - 1368