Efficient Graph Query Processing over Geo-Distributed Datacenters

被引:8
|
作者
Yuan, Ye [1 ]
Ma, Delong [2 ]
Wen, Zhenyu [3 ]
Ma, Yuliang [2 ]
Wang, Guoren [1 ]
Chen, Lei [4 ]
机构
[1] Beijing Inst Technol, Beijing, Peoples R China
[2] Northeastern Univ, Shenyang, Peoples R China
[3] Newcastle Univ, Newcastle Upon Tyne, Tyne & Wear, England
[4] Hong Kong Univ Sci & Technol, Hong Kong, Peoples R China
关键词
Graph search; Geo-distributed; Datacenters; MAPREDUCE;
D O I
10.1145/3397271.3401157
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Graph queries have emerged as one of the fundamental techniques to support modern search services, such as PageRank web search, social networking search and knowledge graph search. As such graphs are maintained globally and very huge (e.g., billions of nodes), we need to efficiently process graph queries across multiple geographically distributed datacenters, running geo-distributed graph queries. Existing graph computing frameworks may not work well for geographically distributed datacenters, because they implement a Bulk Synchronous Parallel model that requires excessive inter-datacenter transfers, thereby introducing extremely large latency for query processing. In this paper, we propose GeoGraph-a universal framework to support efficient geo-distributed graph query processing based on clustering datacenters and meta-graph, while reducing the inter-datacenter communication. Our new framework can be applied to many types of graph algorithms without any modification. The framework is developed on the top of Apache Giraph. The experiments were conducted by applying four important graph queries, i.e., shortest path, graph keyword search, subgraph isomorphism and PageRank. The evaluation results show that our proposed framework can achieve up to 82% faster convergence, 42% lower WAN bandwidth usage, and 45% less total monetary cost for the four graph queries, with input graphs stored across ten geo-distributed datacenters.
引用
收藏
页码:619 / 628
页数:10
相关论文
共 50 条
  • [31] Leveraging Endpoint Flexibility When Scheduling Coflows across Geo-distributed Datacenters
    Li, Wenxin
    Yuan, Xu
    Li, Keqiu
    Qi, Heng
    Zhou, Xiaobo
    IEEE CONFERENCE ON COMPUTER COMMUNICATIONS (IEEE INFOCOM 2018), 2018, : 873 - 881
  • [32] Scheduling Jobs across Geo-Distributed Datacenters with Max-Min Fairness
    Chen, Li
    Liu, Shuhao
    Li, Baochun
    Li, Bo
    IEEE INFOCOM 2017 - IEEE CONFERENCE ON COMPUTER COMMUNICATIONS, 2017,
  • [33] Cost Optimization for Time-Bounded Request Scheduling in Geo-Distributed Datacenters
    Wei, Xiaohui
    Li, Lanxin
    Wang, Xingwang
    Liu, Yuanyuan
    CLOUD COMPUTING AND SECURITY, PT II, 2017, 10603 : 601 - 610
  • [34] A Framework of Hypergraph-Based Data Placement Among Geo-Distributed Datacenters
    Yu, Boyang
    Pan, Jianping
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2020, 13 (03) : 395 - 409
  • [35] MAST: Global Scheduling of ML Training across Geo-Distributed Datacenters at Hyperscale
    Choudhury, Arnab
    Wang, Yang
    Pelkonen, Tuomas
    Srinivasan, Kutta
    Jain, Abha
    Lin, Shenghao
    David, Delia
    Soleimanifard, Siavash
    Chen, Michael
    Yadav, Abhishek
    Tijoriwala, Ritesh
    Samoylov, Denis
    Tang, Chunqiang
    PROCEEDINGS OF THE 18TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, OSDI 2024, 2024, : 563 - 580
  • [36] Bellini: Ferrying Application Traffic Flows through Geo-distributed Datacenters in the Cloud
    Liu, Zimu
    Feng, Yuan
    Li, Baochun
    2013 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2013, : 1753 - 1759
  • [37] Optimizing Network Transfers for Data Analytic Jobs Across Geo-Distributed Datacenters
    Chen, Li
    Liu, Shuhao
    Li, Baochun
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2022, 33 (02) : 403 - 414
  • [38] Geo-distributed efficient deployment of containers with Kubernetes
    Rossi, Fabiana
    Cardellini, Valeria
    Lo Presti, Francesco
    Nardelli, Matteo
    COMPUTER COMMUNICATIONS, 2020, 159 : 161 - 174
  • [39] Accelerating Geo-Distributed Transaction Processing with Fast Logging
    Ogura, Takuto
    Akita, Yoshiki
    Miyazawa, Yuki
    Kawashima, Hideyuki
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 2390 - 2399
  • [40] Truthful auction mechanisms for VNF chain provisioning and allocation across geo-distributed datacenters
    Wang, Xueyi
    Wang, Xingwei
    Wu, Dongkuo
    Ma, Lianbo
    Huang, Min
    COMPUTER NETWORKS, 2022, 217