Federated SPARQL Queries Processing with Replicated Fragments

被引:13
|
作者
Montoya, Gabriela [1 ,2 ]
Skaf-Molli, Hala [1 ]
Molli, Pascal [1 ]
Vidal, Maria-Esther [3 ]
机构
[1] Univ Nantes, LINA, Nantes, France
[2] CNRS, Unit UMR6241, Nantes, France
[3] Univ Simon Bolivar, Caracas, Venezuela
来源
关键词
Linked data; Federated query processing; Source selection; Fragment replication;
D O I
10.1007/978-3-319-25007-6_3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Federated query engines provide a unified query interface to federations of SPARQL endpoints. Replicating data fragments from different Linked Data sources facilitates data re-organization to better fit federated query processing needs of data consumers. However, existing federated query engines are not designed to support replication and replicated data can negatively impact their performance. In this paper, we formulate the source selection problem with fragment replication (SSP-FR). For a given set of endpoints with replicated fragments and a SPARQL query, the problem is to select the endpoints that minimize the number of tuples to be transferred. We devise the FEDRA source selection algorithm that approximates SSP-FR. We implement FEDRA in the state-of-the-art federated query engines FedX and ANAPSID, and empirically evaluate their performance. Experimental results suggest that FEDRA efficiently solves SSP-FR, reducing the number of selected SPARQL endpoints as well as the size of query intermediate results.
引用
收藏
页码:36 / 51
页数:16
相关论文
共 50 条
  • [21] How Good Is Your SPARQL Endpoint? A QoS-Aware SPARQL Endpoint Monitoring and Data Source Selection Mechanism for Federated SPARQL Queries
    Ali, Muhammad Intizar
    Mileo, Alessandra
    ON THE MOVE TO MEANINGFUL INTERNET SYSTEMS: OTM 2014 CONFERENCES, 2014, 8841 : 491 - 508
  • [22] Processing SPARQL Property Path Queries Online with Web Preemption
    Aimonier-Davat, Julien
    Skaf-Molli, Hala
    Molli, Pascal
    SEMANTIC WEB, ESWC 2021, 2021, 12731 : 57 - 72
  • [23] A parallel processing architecture to optimize runtime in aggregated SPARQL queries
    Rabhi, Ahmed
    Fissoune, Rachida
    Tabaa, Mohamed
    Badir, Hassan
    PROCEEDINGS OF 2022 14TH INTERNATIONAL CONFERENCE ON MANAGEMENT OF DIGITAL ECOSYSTEMS, MEDES 2022, 2022, : 9 - 15
  • [24] EMBEDDING XPATH QUERIES INTO SPARQL QUERIES
    Droop, Matthias
    Flarer, Markus
    Groppe, Jinghua
    Groppe, Sven
    Linnemann, Volker
    Pinggera, Jakob
    Santner, Florian
    Schier, Michael
    Schoepf, Felix
    Staffler, Hannes
    Zugal, Stefan
    ICEIS 2008: PROCEEDINGS OF THE TENTH INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS, VOL DISI: DATABASES AND INFORMATION SYSTEMS INTEGRATION, 2008, : 5 - +
  • [25] Translating XPath queries into SPARQL queries
    Droop, M.
    Flarer, M.
    Groppe, J.
    Groppe, S.
    Linnemann, V.
    Pinggeral, J.
    Santner, F.
    Schier, M.
    Schoepf, F.
    Staffler, H.
    Zugal, S.
    ON THE MOVE TO MEANINGFUL INTERNET SYSTEMS 2007: OTM 2007 WORKSHOPS, PT 1, PROCEEDINGS, 2007, 4805 : 9 - +
  • [26] A Simple Approach for Enabling SPARQL-based Temporal Queries for Media Fragments
    Nimkanjana, Klinsukon
    Witosurapot, Suntorn
    PROCEEDINGS OF 2018 7TH INTERNATIONAL CONFERENCE ON SOFTWARE AND COMPUTER APPLICATIONS (ICSCA 2018), 2018, : 212 - 216
  • [27] Extended Adaptive Join Operator with Bind-Bloom Join for Federated SPARQL Queries
    Oguz, Damla
    Yin, Shaoyi
    Ergenc, Belgin
    Hameurlain, Abdelkader
    Dikenelli, Oguz
    INTERNATIONAL JOURNAL OF DATA WAREHOUSING AND MINING, 2017, 13 (03) : 47 - 72
  • [28] SPARQL2NL-Verbalizing SPARQL queries
    Ngomo, Axel-Cyrille Ngonga
    Buehmann, Lorenz
    Unger, Christina
    Lehmann, Jens
    Gerber, Daniel
    PROCEEDINGS OF THE 22ND INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'13 COMPANION), 2013, : 329 - 332
  • [29] Efficient parallel processing of range queries through replicated declustering
    Hakan Ferhatosmanoglu
    Ali Şaman Tosun
    Guadalupe Canahuate
    Aravind Ramachandran
    Distributed and Parallel Databases, 2006, 20 : 117 - 147
  • [30] Efficient parallel processing of range queries through replicated declustering
    Ferhatosmanoglu, Hakan
    Tosun, Ali Saman
    Canahuate, Guadalupe
    Ramachandran, Aravind
    DISTRIBUTED AND PARALLEL DATABASES, 2006, 20 (02) : 117 - 147