Cluster-Based Joins for Federated SPARQL Queries

被引:0
|
作者
Yang, Fan [1 ]
Crainiceanu, Adina [2 ]
Chen, Zhiyuan [1 ]
Needham, Don [2 ]
机构
[1] Univ Maryland, Baltimore, MD 21250 USA
[2] United States Naval Acad, Annapolis, MD 21402 USA
关键词
Clustering algorithms; Resource description framework; Costs; Distributed databases; Seaports; Pattern matching; Marine vehicles; RDF; SPARQL; federated queries; join; cluster; SYSTEM; RDF;
D O I
10.1109/TKDE.2021.3135507
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Federated RDF systems allow users to retrieve data from multiple independent sources without needing to have all the data in the same triple store. The performance of these systems can be poor for large and geographically distributed RDF data where network transfer costs are high. This article introduces CBTP-OL and CBTP-Nhop, two novel join algorithms that take advantage of network topology to decrease the cost of processing Basic Graph Pattern (BGP) SPARQL queries in a geographically distributed environment. Federation members are grouped in clusters, based on the network communication cost between the members, and the bulk of the join processing is pushed to the clusters. Our CBTP-OL and CBTL-Nhop algorithms use an overlap list and, respectively, an N-hop overlap list, to efficiently compute join results from triples in different clusters. We implement our algorithms in the OpenRDF Sesame federated framework and use Apache Rya triple store instances as federation members. Experimental evaluation results show the advantages of our approach over existing techniques.
引用
收藏
页码:3525 / 3539
页数:15
相关论文
共 50 条
  • [21] Extending SPARQL with Similarity Joins
    Ferrada, Sebastian
    Bustos, Benjamin
    Hogan, Aidan
    SEMANTIC WEB - ISWC 2020, PT I, 2020, 12506 : 201 - 217
  • [22] Similarity joins and clustering for SPARQL
    Ferrada, Sebastian
    Bustos, Benjamin
    Hogan, Aidan
    SEMANTIC WEB, 2024, 15 (05) : 1701 - 1732
  • [23] Extended Adaptive Join Operator with Bind-Bloom Join for Federated SPARQL Queries
    Oguz, Damla
    Yin, Shaoyi
    Ergenc, Belgin
    Hameurlain, Abdelkader
    Dikenelli, Oguz
    INTERNATIONAL JOURNAL OF DATA WAREHOUSING AND MINING, 2017, 13 (03) : 47 - 72
  • [24] Energy-efficient filtering for skyline queries in cluster-based sensor networks
    Yin, Bo
    Lin, Yaping
    Yu, Jianping
    Luo, Qing
    COMPUTERS & ELECTRICAL ENGINEERING, 2014, 40 (02) : 350 - 366
  • [25] Adaptive cluster-based browsing using incrementally expanded queries and its effects
    Eguchi, K
    SIGIR'99: PROCEEDINGS OF 22ND INTERNATIONAL CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 1999, : 265 - 266
  • [26] Energy-Aware Edge Association for Cluster-Based Personalized Federated Learning
    Li, Yixuan
    Qin, Xiaoqi
    Chen, Hao
    Han, Kaifeng
    Zhang, Ping
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2022, 71 (06) : 6756 - 6761
  • [27] An Embedding-based Approach to Recommending SPARQL Queries
    Zhang, Lijing
    Zhang, Xiaowang
    Feng, Zhiyong
    2018 IEEE 30TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2018, : 991 - 998
  • [28] SPARQL2NL-Verbalizing SPARQL queries
    Ngomo, Axel-Cyrille Ngonga
    Buehmann, Lorenz
    Unger, Christina
    Lehmann, Jens
    Gerber, Daniel
    PROCEEDINGS OF THE 22ND INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'13 COMPANION), 2013, : 329 - 332
  • [29] Beyond Classical SERVICE Clause in Federated SPARQL Queries: Leveraging the Full Potential of URI Parameters
    Corby, Olivier
    Faron, Catherine
    Gandon, Fabien
    Graux, Damien
    Michel, Franck
    PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON WEB INFORMATION SYSTEMS AND TECHNOLOGIES (WEBIST), 2021, : 65 - 76
  • [30] On the formulation of performant SPARQL queries
    Loizou, Antonis
    Angles, Renzo
    Groth, Paul
    JOURNAL OF WEB SEMANTICS, 2015, 31 : 1 - 26