A Graph-Based Indexing Technique to Enhance the Performance of Boolean AND Queries in Big Data Systems

被引:1
|
作者
Mohideen, Abdulla Kalandar [1 ]
Majumdar, Shikharesh [1 ]
St-Hilaire, Marc [1 ]
El-Haraki, A. [2 ]
机构
[1] Carleton Univ, Dept Syst & Comp Engn, Ottawa, ON, Canada
[2] Telus, Ottawa, ON, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
text indexing; keyword search; Boolean queries; graph-based index;
D O I
10.1109/CCGrid49817.2020.00-24
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper introduces a new graph-based indexing (GBI) technique for big data systems. It uses a directed graph structure that effectively captures the simultaneous occurrence of multiple keywords in the same document. The objective is to use the relationship between the search keywords captured in the graph structure to effectively retrieve all results of Boolean AND queries at once. The performance of the proposed technique is compared with the conventional inverted index-based technique. This paper highlights that, irrespective of the intersection algorithm used to evaluate Boolean AND queries, GBI always returns Boolean AND search results faster than the inverted index. This is due to the fact that GBI always performs a smaller number of intersection operations and avoids intersection if search keywords do not have a common document. A preliminary performance analysis is performed through prototyping and measurement on a system subjected to a synthetic workload. The analysis shows that GBI improves search latency when executing Boolean AND queries by an average of 69% to 99.9% in comparison to the inverted index.
引用
收藏
页码:677 / 680
页数:4
相关论文
共 50 条
  • [1] Graph-based Indexing Method for Searching in RDF Data
    Kyu, Khin Myat
    Oo, Aung Nway
    2019 INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION TECHNOLOGIES (ICAIT), 2019, : 96 - 101
  • [2] A graph-based approach for modeling and indexing video data
    Lee, Jeongkyu
    ISM 2006: EIGHTH IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA, PROCEEDINGS, 2006, : 348 - 355
  • [3] Graph-based shape indexing
    Demirci, M. Fatih
    MACHINE VISION AND APPLICATIONS, 2012, 23 (03) : 541 - 555
  • [4] Graph-based shape indexing
    M. Fatih Demirci
    Machine Vision and Applications, 2012, 23 : 541 - 555
  • [5] A Graph-based Model for Big Data Warehouses Governance
    Costa, Ines
    Galvao, Joao
    Magalhaes, Fernando
    Yasmina Santos, Maribel
    EDUCATION EXCELLENCE AND INNOVATION MANAGEMENT: A 2025 VISION TO SUSTAIN ECONOMIC DEVELOPMENT DURING GLOBAL CHALLENGES, 2020, : 5816 - 5829
  • [6] Graph-Based Feature Crossing to Enhance Recommender Systems
    Cai, Congyu
    Chen, Hong
    Liu, Yunxuan
    Chen, Daoquan
    Zhou, Xiuze
    Lin, Yuanguo
    MATHEMATICS, 2025, 13 (02)
  • [7] Biospytial: spatial graph-based computing for ecological Big Data
    Molgora, Juan M. Escamilla
    Sedda, Luigi
    Atkinson, Peter M.
    GIGASCIENCE, 2020, 9 (05):
  • [8] GRAPH-BASED ALGORITHMS FOR BOOLEAN FUNCTION MANIPULATION
    BRYANT, RE
    IEEE TRANSACTIONS ON COMPUTERS, 1986, 35 (08) : 677 - 691
  • [9] Universal designated verifier transitive signatures for graph-based big data
    Hou, Shuquan
    Huang, Xinyi
    Liu, Joseph K.
    Li, Jin
    Xu, Li
    INFORMATION SCIENCES, 2015, 318 : 144 - 156
  • [10] Collaborative Graph-based Mechanism for Distributed Big Data Leakage Prevention
    Lu, Yunlong
    Huang, Xiaohong
    Li, Dandan
    Zhang, Yan
    2018 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2018,