Cluster-Based Joins for Federated SPARQL Queries

被引:0
|
作者
Yang, Fan [1 ]
Crainiceanu, Adina [2 ]
Chen, Zhiyuan [1 ]
Needham, Don [2 ]
机构
[1] Univ Maryland, Baltimore, MD 21250 USA
[2] United States Naval Acad, Annapolis, MD 21402 USA
关键词
Clustering algorithms; Resource description framework; Costs; Distributed databases; Seaports; Pattern matching; Marine vehicles; RDF; SPARQL; federated queries; join; cluster; SYSTEM; RDF;
D O I
10.1109/TKDE.2021.3135507
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Federated RDF systems allow users to retrieve data from multiple independent sources without needing to have all the data in the same triple store. The performance of these systems can be poor for large and geographically distributed RDF data where network transfer costs are high. This article introduces CBTP-OL and CBTP-Nhop, two novel join algorithms that take advantage of network topology to decrease the cost of processing Basic Graph Pattern (BGP) SPARQL queries in a geographically distributed environment. Federation members are grouped in clusters, based on the network communication cost between the members, and the bulk of the join processing is pushed to the clusters. Our CBTP-OL and CBTL-Nhop algorithms use an overlap list and, respectively, an N-hop overlap list, to efficiently compute join results from triples in different clusters. We implement our algorithms in the OpenRDF Sesame federated framework and use Apache Rya triple store instances as federation members. Experimental evaluation results show the advantages of our approach over existing techniques.
引用
收藏
页码:3525 / 3539
页数:15
相关论文
共 50 条
  • [31] Reverse Engineering SPARQL Queries
    Arenas, Marcelo
    Diaz, Gonzalo I.
    Kostylev, Egor V.
    PROCEEDINGS OF THE 25TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'16), 2016, : 239 - 249
  • [32] Computing Recursive SPARQL Queries
    Atzori, Maurizio
    2014 IEEE INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC), 2014, : 258 - 259
  • [33] Explaining similarity for SPARQL queries
    Wang, Meng
    Chen, Kefei
    Xiao, Gang
    Zhang, Xinyue
    Chen, Hongxu
    Wang, Sen
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2021, 24 (05): : 1813 - 1835
  • [34] Optimizing SPARQL Queries with SHACL
    Thapa, Ratan Bahadur
    Giese, Martin
    SEMANTIC WEB, ISWC 2023, PART I, 2023, 14265 : 41 - 60
  • [35] Canonicalisation of SPARQL 1.1 Queries
    Salas, Jaime
    COMPANION PROCEEDINGS OF THE WEB CONFERENCE 2022, WWW 2022 COMPANION, 2022, : 318 - 323
  • [36] On the Expressivity of ASK Queries in SPARQL
    Zhang, Xiaowang
    Van den Bussche, Jan
    Wang, Kewen
    Zhang, Heng
    Yang, Xuanxing
    Feng, Zhiyong
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 3057 - 3064
  • [37] Explaining similarity for SPARQL queries
    Meng Wang
    Kefei Chen
    Gang Xiao
    Xinyue Zhang
    Hongxu Chen
    Sen Wang
    World Wide Web, 2021, 24 : 1813 - 1835
  • [38] For the DISTINCT Clause of SPARQL Queries
    Atre, Medha
    PROCEEDINGS OF THE 25TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'16 COMPANION), 2016, : 7 - 8
  • [39] CB-FL: Cluster-Based Federated Learning applied to Quality of Experience modelling
    Porcu, Simone
    Floris, Alessandro
    Atzori, Luigi
    2022 16TH INTERNATIONAL CONFERENCE ON SIGNAL-IMAGE TECHNOLOGY & INTERNET-BASED SYSTEMS, SITIS, 2022, : 585 - 591
  • [40] Blockchain-enabled Edge Computing Framework for Hierarchic Cluster-based Federated Learning
    Huang, Xiaoge
    Wu, Yuhang
    Chen, Zhi
    Chen, Qianbin
    Zhang, Jie
    2022 14TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING, WCSP, 2022, : 33 - 37