VEDAS: an efficient GPU alternative for store and query of large RDF data sets

被引:3
|
作者
Makpaisit, Pisit [1 ]
Chantrapornchai, Chantana [1 ]
机构
[1] Kasetsart Univ, Dept Comp Engn, Bangkok, Thailand
关键词
Query processing; Parallel processing; Graphic Processing Units; Resource Description Framework; SPARQL; SPARQL QUERIES;
D O I
10.1186/s40537-021-00513-y
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Resource Description Framework (RDF) is commonly used as a standard for data interchange on the web. The collection of RDF data sets can form a large graph which consumes time to query. It is known that modern Graphic Processing Units (GPUs) can be employed to execute parallel programs in order to speedup the running time. In this paper, we propose a novel RDF data representation along with the query processing algorithm that is suitable for GPU processing. Since the main challenges of GPU architecture are the limited memory sizes, the memory transfer latency, and the vast number of GPU cores. Our system is designed to strengthen the use of GPU cores and reduce the effect of memory transfer. We propose a representation consists of indices and column-based RDF ID data that can reduce the GPU memory requirement. The indexing and pre-upload filtering techniques are then applied to reduce the data transfer between the host and GPU memory. We add the index swapping process to facilitate the sorting and joining data process based on the given variable and add the pre-upload step to reduce the size of results' storage, and the data transfer time. The experimental results show that our representation is about 35% smaller than the traditional NT format and 40% less compared to that of gStore. The query processing time can be speedup ranging from 1.95 to 397.03 when compared with RDF3X and gStore processing time with WatDiv test suite. It achieves speedup 578.57 and 62.97 for LUBM benchmark when compared to RDF-3X and gStore. The analysis shows the query cases which can gain benefits from our approach.
引用
收藏
页数:34
相关论文
共 50 条
  • [11] Domain-Driven Visual Query Formulation over RDF Data Sets
    Balis, Bartosz
    Grabiec, Tomasz
    Bubak, Marian
    PARALLEL PROCESSING AND APPLIED MATHEMATICS (PPAM 2013), PT I, 2014, 8384 : 293 - 301
  • [12] Efficient and Adaptable Query Workload-Aware Management for RDF Data
    MahmoudiNasab, Hooran
    Sakr, Sherif
    WEB INFORMATION SYSTEM ENGINEERING-WISE 2010, 2010, 6488 : 390 - +
  • [13] Efficient Continuous Skyline Query Processing Scheme over Large Dynamic Data Sets
    Li, He
    Yoo, Jaesoo
    ETRI JOURNAL, 2016, 38 (06) : 1197 - 1206
  • [14] Compact Representation of Large RDF Data Sets for Publishing and Exchange
    Fernandez, Javier D.
    Martinez-Prieto, Miguel A.
    Gutierrez, Claudio
    SEMANTIC WEB-ISWC 2010, PT I, 2010, 6496 : 193 - +
  • [15] Xlight, An Efficient Relational Schema To Store And Query XML Data
    Zafari, Hasan
    Hasani, Keramat
    Shiri, M. Ebrahim
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON DATA STORAGE AND DATA ENGINEERING (DSDE 2010), 2010, : 254 - 257
  • [16] Alternative input devices for efficient navigation of large CT angiography data sets
    Sherboncly, A
    Holmlund, D
    Rubin, GD
    Schraedley, PK
    Winograd, T
    Napel, S
    RADIOLOGY, 2005, 234 (02) : 391 - 398
  • [17] Query-driven visualization of large data sets
    Stockinger, K
    Shalf, J
    Wu, KS
    Bethel, EW
    IEEE VISUALIZATION 2005, PROCEEDINGS, 2005, : 167 - 174
  • [18] Efficient Query Analysis and Performance Evaluation of the Nosql Data Store for BigData
    Gupta, Sangeeta
    Narsimha, G.
    PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND INFORMATICS, ICCII 2016, 2017, 507 : 549 - 558
  • [19] Efficient Spatio-temporal RDF Query Processing in Large Dynamic Knowledge Bases
    Vlachou, Akrivi
    Doulkeridis, Christos
    Glenis, Apostolos
    Santipantakis, Georgios M.
    Vouros, George A.
    SAC '19: PROCEEDINGS OF THE 34TH ACM/SIGAPP SYMPOSIUM ON APPLIED COMPUTING, 2019, : 439 - 447
  • [20] Efficient Query Construction for Large Scale Data
    Demidova, Elena
    Zhou, Xuan
    Nejdl, Wolfgang
    SIGIR'13: THE PROCEEDINGS OF THE 36TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH & DEVELOPMENT IN INFORMATION RETRIEVAL, 2013, : 573 - 582