VEDAS: an efficient GPU alternative for store and query of large RDF data sets

被引:3
|
作者
Makpaisit, Pisit [1 ]
Chantrapornchai, Chantana [1 ]
机构
[1] Kasetsart Univ, Dept Comp Engn, Bangkok, Thailand
关键词
Query processing; Parallel processing; Graphic Processing Units; Resource Description Framework; SPARQL; SPARQL QUERIES;
D O I
10.1186/s40537-021-00513-y
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Resource Description Framework (RDF) is commonly used as a standard for data interchange on the web. The collection of RDF data sets can form a large graph which consumes time to query. It is known that modern Graphic Processing Units (GPUs) can be employed to execute parallel programs in order to speedup the running time. In this paper, we propose a novel RDF data representation along with the query processing algorithm that is suitable for GPU processing. Since the main challenges of GPU architecture are the limited memory sizes, the memory transfer latency, and the vast number of GPU cores. Our system is designed to strengthen the use of GPU cores and reduce the effect of memory transfer. We propose a representation consists of indices and column-based RDF ID data that can reduce the GPU memory requirement. The indexing and pre-upload filtering techniques are then applied to reduce the data transfer between the host and GPU memory. We add the index swapping process to facilitate the sorting and joining data process based on the given variable and add the pre-upload step to reduce the size of results' storage, and the data transfer time. The experimental results show that our representation is about 35% smaller than the traditional NT format and 40% less compared to that of gStore. The query processing time can be speedup ranging from 1.95 to 397.03 when compared with RDF3X and gStore processing time with WatDiv test suite. It achieves speedup 578.57 and 62.97 for LUBM benchmark when compared to RDF-3X and gStore. The analysis shows the query cases which can gain benefits from our approach.
引用
收藏
页数:34
相关论文
共 50 条
  • [1] VEDAS: an efficient GPU alternative for store and query of large RDF data sets
    Pisit Makpaisit
    Chantana Chantrapornchai
    Journal of Big Data, 8
  • [2] Entailment Processing for Large RDF Data Sets Using GPU
    Chantrapornchai, Chantana
    Choksuchat, Chidchanok
    Haidl, Michael
    Gorlatch, Sergei
    NEW TRENDS IN SOFTWARE METHODOLOGIES, TOOLS AND TECHNIQUES, 2016, 286 : 333 - 345
  • [3] Query Execution for RDF Data on Row and Column Store
    Padiya, Trupti
    Bhise, Minal
    Vasani, Sandeep
    Pandey, Mohit
    DISTRIBUTED COMPUTING AND INTERNET TECHNOLOGY, ICDCIT 2015, 2015, 8956 : 403 - 408
  • [4] Efficient Distributed Query Processing on Large Scale RDF Graph Data
    Wang X.
    Xu Q.
    Chai L.-L.
    Yang Y.-J.
    Chai Y.-P.
    Ruan Jian Xue Bao/Journal of Software, 2019, 30 (03): : 498 - 514
  • [5] Efficient indexing RDF query algorithm for big data
    Zeng, Yiqun
    Wang, Jingbin
    MACHINERY ELECTRONICS AND CONTROL ENGINEERING III, 2014, 441 : 691 - 694
  • [6] Towards Efficient SPARQL Query Processing on RDF Data
    刘畅
    王昊奋
    俞勇
    徐林昊
    TsinghuaScienceandTechnology, 2010, 15 (06) : 613 - 622
  • [7] Research on Efficient SPARQL Query Processing for RDF Data
    Zhang, Yi
    PROCEEDINGS OF THE 2015 2ND INTERNATIONAL WORKSHOP ON MATERIALS ENGINEERING AND COMPUTER SCIENCES (IWMECS 2015), 2015, 33 : 476 - 482
  • [8] Towards efficient SPARQL query processing on RDF data
    Liu C.
    Wang H.
    Yu Y.
    Xu L.
    Tsinghua Science and Technology, 2010, 15 (06) : 613 - 622
  • [9] Efficient GPU processing method to analyze large GWAS data sets
    Agapito, Giuseppe
    Guardasole, Gaetano
    Cannataro, Mario
    2024 32ND EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND NETWORK-BASED PROCESSING, PDP 2024, 2024, : 260 - 265
  • [10] Grace: An Efficient Parallel SPARQL Query System over Large-Scale RDF Data
    Kang, Xiang
    Zhao, Yuying
    Yuan, Pingpeng
    Jin, Hai
    PROCEEDINGS OF THE 2021 IEEE 24TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN (CSCWD), 2021, : 769 - 774