Improving Distance-Join Query processing with Voronoi-Diagram based partitioning in SpatialHadoop

被引:12
|
作者
Garcia-Garcia, Francisco [1 ]
Corral, Antonio [1 ]
Iribarne, Luis [1 ]
Vassilakopoulos, Michael [2 ]
机构
[1] Univ Almeria, Dept Informat, Almeria, Spain
[2] Univ Thessaly, Dept Elect & Comp Engn, Volos, Greece
关键词
Data partitioning; K nearest neighbors join; K closest pairs; SpatialHadoop; MapReduce; Spatial query evaluation; ALGORITHMS;
D O I
10.1016/j.future.2019.10.037
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
SpatialHadoop is an extended MapReduce framework supporting global indexing techniques that partition spatial datasets across several machines and improve spatial query processing performance compared to traditional Hadoop systems. SpatialHadoop supports several spatial operations (e.g., K Nearest Neighbor search, range query, spatial intersection join, etc.) and seven spatial partitioning techniques (Grid, Quadtree, STR, STR+, k-d tree, Z-curve and Hilbert-curve). Distance-Join Queries (DJQs), like the K Nearest Neighbors Join Query (KNNJQ) and K Closest Pairs Query (KCPQ), are common operations used in numerous spatial applications. DJQs are costly operations, since they combine spatial joins with distance-based search. Data partitioning improves the management of large datasets and speeds up query performance. Therefore, performing DJQs efficiently with new partitioning methods in SpatialHadoop is a challenging task. In this paper, a new data partitioning technique based on Voronoi-Diagrams is designed and implemented in SpatialHadoop. Moreover, improved KNNJQ and KCPQ MapReduce algorithms, using the new partitioning mechanism, are also designed and developed for SpatialHadoop. Finally, the results of an extensive set of experiments with real-world datasets are presented, demonstrating that the new partitioning technique and the improved DR MapReduce algorithms are efficient, scalable and robust in SpatialHadoop. (C) 2019 Elsevier B.V. All rights reserved.
引用
收藏
页码:723 / 740
页数:18
相关论文
共 50 条
  • [41] Sequence similarity query processing technique based on two-partitioning frequency transformation
    Wang, Guo-Ren
    Ge, Jian
    Xu, Heng-Yu
    Zheng, Ruo-Shi
    Ruan Jian Xue Bao/Journal of Software, 2006, 17 (02): : 232 - 241
  • [42] Inverse Distance Weighting Method Based on a Dynamic Voronoi Diagram for Thermal Reconstruction with Limited Sensor Data on Multiprocessors
    Li, Xin
    Rong, Mengtian
    Liu, Tao
    Zhou, Liang
    IEICE TRANSACTIONS ON ELECTRONICS, 2011, E94C (08): : 1295 - 1301
  • [43] SigMR: MapReduce-based SPARQL query processing by signature encoding and multi-way join
    Ahn, Jinhyun
    Im, Dong-Hyuk
    Kim, Hong-Gee
    JOURNAL OF SUPERCOMPUTING, 2015, 71 (10): : 3695 - 3725
  • [44] SigMR: MapReduce-based SPARQL query processing by signature encoding and multi-way join
    Jinhyun Ahn
    Dong-Hyuk Im
    Hong-Gee Kim
    The Journal of Supercomputing, 2015, 71 : 3695 - 3725
  • [45] Mobile agent based self-adaptive join for wide-area distributed query processing
    Arcangeli, JP
    Hameurlain, A
    Migeon, E
    Morvan, F
    JOURNAL OF DATABASE MANAGEMENT, 2004, 15 (04) : 25 - 44
  • [46] Playing Fetch with CAT Composing Cache Partitioning and Prefetching for Task-based Query Processing
    Zeng, Qitian
    Hale, Kyle C.
    Glavic, Boris
    17TH INTERNATIONAL WORKSHOP ON DATA MANAGEMENT ON NEW HARDWARE, DAMON 2021, 2021,
  • [47] Improving the Reliability and Availability of Vehicular Communications using Voronoi Diagram-based Placement of Road Side Units
    Patil, Prithviraj
    Gokhale, Aniruddha
    2012 31ST INTERNATIONAL SYMPOSIUM ON RELIABLE DISTRIBUTED SYSTEMS (SRDS 2012), 2012, : 400 - 401
  • [48] SJCBMQ: A novel spatial join-based algorithm for continuous border monitoring query processing in data streams
    Zhang, Yunyi
    Huang, Chongzheng
    Zhang, Deyun
    2007 2ND INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND APPLICATIONS, VOLS 1 AND 2, 2007, : 291 - +
  • [49] Improving Execution Efficiency of Just-in-time Compilation based Query Processing on GPUs
    Paul, Johns
    He, Bingsheng
    Lu, Shengliang
    Lau, Chiew Tong
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2020, 14 (02): : 202 - 214
  • [50] In-Network Processing of an Iceberg Join Query in Wireless Sensor Networks Based on 2-Way Fragment Semijoins
    Kang, Hyunchul
    SENSORS, 2015, 15 (03): : 6105 - 6132