Compliant Geo-distributed Data Processing in Action

被引:3
|
作者
Beedkar, Kaustubh [1 ]
Brekardin, David [1 ]
Quiane-Ruiz, Jorge-Anulfo [1 ,2 ]
Markl, Volker [1 ,2 ]
机构
[1] TU Berlin, Berlin, Germany
[2] DFKI, Kaiserslautern, Germany
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2021年 / 14卷 / 12期
关键词
D O I
10.14778/3476311.3476359
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper we present our work on compliant geo distributed data processing. Our work focuses on the new dimension of dataflow constraints that regulate the movement of data across geographical or institutional borders. For example, European directives may regulate transferring only certain information fields (such as non personal information) or aggregated data. Thus, it is crucial for distributed data processing frameworks to consider compliance with respect to dataflow constraints derived from these regulations. We have developed a compliance-based data processing framework, which (i) allows for the declarative specification of dataflow constraints, (ii) determines if a query can be translated into a compliant distributed query execution plan, and (iii) executes the compliant plan over distributed SQL databases. We demonstrate our framework using a geo-distributed adaptation of the TPC-H benchmark data. Our framework provides an interactive dashboard, which allows users to specify dataflow constraints, and analyze and execute compliant distributed query execution plans.
引用
收藏
页码:2843 / 2846
页数:4
相关论文
共 50 条
  • [21] Holistic Management of Sustainable Geo-Distributed Data Centers
    Abbasi, Zahra
    Gupta, Sandeep K. S.
    2015 IEEE 22ND INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING (HIPC), 2015, : 426 - 435
  • [22] Bohr: Similarity Aware Geo-Distributed Data Analytics
    Li, Hangyu
    Xu, Hong
    Nutanong, Sarana
    CONEXT'18: PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON EMERGING NETWORKING EXPERIMENTS AND TECHNOLOGIES, 2018, : 267 - 279
  • [23] GDSim: Benchmarking Geo-Distributed Data Center Schedulers
    Alves, Daniel
    Obraczka, Katia
    Kabbani, Abdul
    2021 IEEE 10TH INTERNATIONAL CONFERENCE ON CLOUD NETWORKING (IEEE CLOUDNET), 2021, : 148 - 156
  • [24] Joint Data Purchasing and Data Placement in a Geo-Distributed Data Market
    Ren, Xiaoqi
    London, Palma
    Ziani, Juba
    Wierman, Adam
    SIGMETRICS/PERFORMANCE 2016: PROCEEDINGS OF THE SIGMETRICS/PERFORMANCE JOINT INTERNATIONAL CONFERENCE ON MEASUREMENT AND MODELING OF COMPUTER SCIENCE, 2016, : 383 - 384
  • [25] Efficient Graph Query Processing over Geo-Distributed Datacenters
    Yuan, Ye
    Ma, Delong
    Wen, Zhenyu
    Ma, Yuliang
    Wang, Guoren
    Chen, Lei
    PROCEEDINGS OF THE 43RD INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '20), 2020, : 619 - 628
  • [27] Dynamic Data Replication Across Geo-Distributed Cloud Data Centres
    Jayalakshmi, D. S.
    Ranjana, T. P. Rashmi
    Ramaswamy, Srinivasan
    DISTRIBUTED COMPUTING AND INTERNET TECHNOLOGY (ICDCIT 2016), 2016, 9581 : 182 - 187
  • [28] Demonstration of Geo-Distributed Data Processing and Aggregation in MEC-empowered Metro Optical Networks
    Zhang, Jiawei
    Cui, Lu
    Liu, Zhen
    Ji, Yuefeng
    2020 OPTICAL FIBER COMMUNICATIONS CONFERENCE AND EXPOSITION (OFC), 2020,
  • [29] Doctoral Symposium: Self-Adaptive Data Stream Processing in Geo-Distributed Computing Environments
    Russo, Gabriele Russo
    DEBS'19: PROCEEDINGS OF THE 13TH ACM INTERNATIONAL CONFERENCE ON DISTRIBUTED AND EVENT-BASED SYSTEMS, 2019, : 276 - 279
  • [30] Elastic, Geo-Distributed RAFT
    Xu, Zichen
    Stewart, Christopher
    Huang, Jiacheng
    PROCEEDINGS OF THE IEEE/ACM INTERNATIONAL SYMPOSIUM ON QUALITY OF SERVICE (IWQOS 2019), 2019,