SubZero: A Fine-Grained Lineage System for Scientific Databases

被引:0
|
作者
Wu, Eugene [1 ]
Madden, Samuel [1 ]
Stonebraker, Michael [1 ]
机构
[1] MIT, CSAIL, Cambridge, MA 02139 USA
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Data lineage is a key component of provenance that helps scientists track and query relationships between input and output data. While current systems readily support lineage relationships at the file or data array level, finer-grained support at an array-cell level is impractical due to the lack of support for user defined operators and the high runtime and storage overhead to store such lineage. We interviewed scientists in several domains to identify a set of common semantics that can be leveraged to efficiently store fine-grained lineage. We use the insights to define lineage representations that efficiently capture common locality properties in the lineage data, and a set of APIs so operator developers can easily export lineage information from user defined operators. Finally, we introduce two benchmarks derived from astronomy and genomics, and show that our techniques can reduce lineage query costs by up to 10x while incuring substantially less impact on workflow runtime and storage.
引用
收藏
页码:865 / 876
页数:12
相关论文
共 50 条
  • [21] Multi-Analyst Differential Privacy with Fine-Grained Provenance for Databases
    He, Xi
    SIGMOD RECORD, 2024, 53 (04)
  • [22] Squall: Fine-Grained Live Reconfiguration for Partitioned Main Memory Databases
    Elmore, Aaron J.
    Arora, Vaibhav
    Taft, Rebecca
    Pavlo, Andrew
    Agrawal, Divyakant
    El Abbadi, Amr
    SIGMOD'15: PROCEEDINGS OF THE 2015 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2015, : 299 - 313
  • [23] Fine-Grained Access Control in Hybrid Relational-XML Databases
    Sasaki, Taketo
    Fukushima, Takuya
    Park, Daeil
    Toyama, Motomichi
    2008 THIRD INTERNATIONAL CONFERENCE ON DIGITAL INFORMATION MANAGEMENT, VOLS 1 AND 2, 2008, : 611 - +
  • [24] A Distributed System for The Management of Fine-grained Provenance
    Sultana, Salmin
    Bertino, Elisa
    JOURNAL OF DATABASE MANAGEMENT, 2015, 26 (02) : 32 - 47
  • [25] A fine-grained social network recommender system
    Aivazoglou, Markos
    Roussos, Antonios O.
    Margaris, Dionisis
    Vassilakis, Costas
    Ioannidis, Sotiris
    Polakis, Jason
    Spiliotopoulos, Dimitris
    SOCIAL NETWORK ANALYSIS AND MINING, 2019, 10 (01)
  • [26] FIFS: Fine-grained Indoor Fingerprinting System
    Xiao, Jiang
    Wu, Kaishun
    Yi, Youwen
    Ni, Lionel M.
    2012 21ST INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATIONS AND NETWORKS (ICCCN), 2012,
  • [27] A Fine-Grained Metric System for the Completeness of Metadata
    Margaritopoulos, Thomas
    Margaritopoulos, Merkourios
    Mavridis, Ioannis
    Manitsaris, Athanasios
    METADATA AND SEMANTIC RESEARCH, PROCEEDINGS, 2009, 46 : 83 - 94
  • [28] A fine-grained social network recommender system
    Markos Aivazoglou
    Antonios O. Roussos
    Dionisis Margaris
    Costas Vassilakis
    Sotiris Ioannidis
    Jason Polakis
    Dimitris Spiliotopoulos
    Social Network Analysis and Mining, 2020, 10
  • [29] Improve Fine-Grained Feature Learning in Fine-Grained DataSet GAI
    Wang, Hai Peng
    Geng, Zhi Qing
    IEEE ACCESS, 2025, 13 : 12777 - 12788
  • [30] Operating system protection for fine-grained programs
    Jaeger, T
    Liedtke, J
    Islam, N
    PROCEEDINGS OF THE SEVENTH USENIX SECURITY SYMPOSIUM, 1998, : 143 - 157