Compliant Geo-distributed Data Processing in Action

被引:3
|
作者
Beedkar, Kaustubh [1 ]
Brekardin, David [1 ]
Quiane-Ruiz, Jorge-Anulfo [1 ,2 ]
Markl, Volker [1 ,2 ]
机构
[1] TU Berlin, Berlin, Germany
[2] DFKI, Kaiserslautern, Germany
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2021年 / 14卷 / 12期
关键词
D O I
10.14778/3476311.3476359
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper we present our work on compliant geo distributed data processing. Our work focuses on the new dimension of dataflow constraints that regulate the movement of data across geographical or institutional borders. For example, European directives may regulate transferring only certain information fields (such as non personal information) or aggregated data. Thus, it is crucial for distributed data processing frameworks to consider compliance with respect to dataflow constraints derived from these regulations. We have developed a compliance-based data processing framework, which (i) allows for the declarative specification of dataflow constraints, (ii) determines if a query can be translated into a compliant distributed query execution plan, and (iii) executes the compliant plan over distributed SQL databases. We demonstrate our framework using a geo-distributed adaptation of the TPC-H benchmark data. Our framework provides an interactive dashboard, which allows users to specify dataflow constraints, and analyze and execute compliant distributed query execution plans.
引用
收藏
页码:2843 / 2846
页数:4
相关论文
共 50 条
  • [41] Analysis of Cost Minimization Methods in Geo-Distributed Data Centers
    Khalaf, Ayesheh Ahrari
    Abdalla, Aisha Hassan
    PROCEEDINGS OF 6TH INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION ENGINEERING (ICCCE 2016), 2016, : 241 - 245
  • [42] Datum: Managing Data Purchasing and Data Placement in a Geo-Distributed Data Market
    Ren, Xiaoqi
    London, Palma
    Ziani, Juba
    Wierman, Adam
    IEEE-ACM TRANSACTIONS ON NETWORKING, 2018, 26 (02) : 893 - 905
  • [43] VNF Deployment and Flow Scheduling in Geo-distributed Data Centers
    Gu, Lin
    Chen, Xiaoxiao
    Jin, Hai
    Lu, Feng
    2018 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2018,
  • [44] Efficient Process Mapping in Geo-Distributed Cloud Data Centers
    Zhou, Amelie Chi
    Gong, Yifan
    He, Bingsheng
    Zhai, Jidong
    SC'17: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2017,
  • [45] Plexus: Optimizing Join Approximation for Geo-Distributed Data Analytics
    Wolfrath, Joel
    Chandra, Abhishek
    PROCEEDINGS OF THE 2023 ACM SYMPOSIUM ON CLOUD COMPUTING, SOCC 2023, 2023, : 1 - 16
  • [46] Orchestrating Bulk Data Transfers across Geo-Distributed Datacenters
    Wu, Yu
    Zhang, Zhizhong
    Wu, Chuan
    Guo, Chuanxiong
    Li, Zongpeng
    Lau, Francis C. M.
    IEEE TRANSACTIONS ON CLOUD COMPUTING, 2017, 5 (01) : 112 - 125
  • [47] Improving Performance for Geo-Distributed Data Process in Wide -Area
    Zhang, Ge
    Wang, Haozhan
    Luan, Zhongzhi
    Wu, Weiguo
    Qian, Depei
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY (CIT), 2017, : 162 - 167
  • [48] DAG-Aware Optimization for Geo-Distributed Data Analytics
    Wang, Qingyuan
    Gao, Bin
    Zhou, Zhi
    Xu, Fei
    Chenghao, Ouyang
    PROCEEDINGS OF THE 52ND INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, ICPP 2023, 2023, : 472 - 481
  • [49] A Hierarchical Hadoop Framework to Process Geo-Distributed Big Data
    Di Modica, Giuseppe
    Tomarchio, Orazio
    BIG DATA AND COGNITIVE COMPUTING, 2022, 6 (01)
  • [50] A Hadoop based Framework to Process Geo-distributed Big Data
    Cavallo, Marco
    Cusma', Lorenzo
    Di Modica, Giuseppe
    Polito, Carmelo
    Tomarchio, Orazio
    PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND SERVICES SCIENCE, VOL 1 (CLOSER), 2016, : 178 - 185