Improving Distance-Join Query processing with Voronoi-Diagram based partitioning in SpatialHadoop

被引:12
|
作者
Garcia-Garcia, Francisco [1 ]
Corral, Antonio [1 ]
Iribarne, Luis [1 ]
Vassilakopoulos, Michael [2 ]
机构
[1] Univ Almeria, Dept Informat, Almeria, Spain
[2] Univ Thessaly, Dept Elect & Comp Engn, Volos, Greece
关键词
Data partitioning; K nearest neighbors join; K closest pairs; SpatialHadoop; MapReduce; Spatial query evaluation; ALGORITHMS;
D O I
10.1016/j.future.2019.10.037
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
SpatialHadoop is an extended MapReduce framework supporting global indexing techniques that partition spatial datasets across several machines and improve spatial query processing performance compared to traditional Hadoop systems. SpatialHadoop supports several spatial operations (e.g., K Nearest Neighbor search, range query, spatial intersection join, etc.) and seven spatial partitioning techniques (Grid, Quadtree, STR, STR+, k-d tree, Z-curve and Hilbert-curve). Distance-Join Queries (DJQs), like the K Nearest Neighbors Join Query (KNNJQ) and K Closest Pairs Query (KCPQ), are common operations used in numerous spatial applications. DJQs are costly operations, since they combine spatial joins with distance-based search. Data partitioning improves the management of large datasets and speeds up query performance. Therefore, performing DJQs efficiently with new partitioning methods in SpatialHadoop is a challenging task. In this paper, a new data partitioning technique based on Voronoi-Diagrams is designed and implemented in SpatialHadoop. Moreover, improved KNNJQ and KCPQ MapReduce algorithms, using the new partitioning mechanism, are also designed and developed for SpatialHadoop. Finally, the results of an extensive set of experiments with real-world datasets are presented, demonstrating that the new partitioning technique and the improved DR MapReduce algorithms are efficient, scalable and robust in SpatialHadoop. (C) 2019 Elsevier B.V. All rights reserved.
引用
收藏
页码:723 / 740
页数:18
相关论文
共 50 条
  • [21] Multi-robot Coverage Using Voronoi Partitioning Based on Geodesic Distance
    Nair, Vishnu G.
    Guruprasad, K. R.
    CONTROL INSTRUMENTATION SYSTEMS, CISCON 2018, 2020, 581 : 59 - 66
  • [22] Manhattan Distance Based Voronoi Partitioning for Efficient Multi-robot Coverage
    Nair, Vishnu G.
    Guruprasad, K. R.
    CONTROL INSTRUMENTATION SYSTEMS, CISCON 2018, 2020, 581 : 81 - 90
  • [23] Directional nearest neighbor query method for specified geographical direction space based on Voronoi diagram
    李松
    SONG Shuang
    HAO Xiaohong
    ZHANG Liping
    High Technology Letters, 2022, 28 (02) : 122 - 133
  • [24] Directional nearest neighbor query method for specified geographical direction space based on Voronoi diagram
    Li S.
    Song S.
    Hao X.
    Zhang L.
    High Technology Letters, 2022, 28 (02): : 122 - 133
  • [25] Voronoi-based reverse nearest neighbor query processing on spatial networks
    Safar, Maytham
    Ibrahimi, Dariush
    Taniar, David
    MULTIMEDIA SYSTEMS, 2009, 15 (05) : 295 - 308
  • [26] Voronoi-based reverse nearest neighbor query processing on spatial networks
    Maytham Safar
    Dariush Ibrahimi
    David Taniar
    Multimedia Systems, 2009, 15 : 295 - 308
  • [27] Voronoi-based range and continuous range query processing in mobile databases
    Xuan, Kefeng
    Zhao, Geng
    Taniar, David
    Rahayu, Wenny
    Safar, Maytham
    Srinivasan, Bala
    JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2011, 77 (04) : 637 - 651
  • [28] A parallel query processing system based on graph-based database partitioning
    Nam, Yoon-Min
    Han, Donghyoung
    Kim, Min-Soo
    INFORMATION SCIENCES, 2019, 480 : 237 - 260
  • [29] Probabilistic Threshold k-ANN Query Method Based on Uncertain Voronoi Diagram in Internet of Vehicles
    Li, Song
    Li, Bohan
    Yu, Jiaxi
    Zhang, Liping
    Zhang, Anman
    Cai, Ken
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2021, 22 (06) : 3592 - 3602
  • [30] Improving the performance of pipelined query processing with skipping—and its comparison to document-wise partitioning
    Simon Jonassen
    Svein Erik Bratsberg
    World Wide Web, 2014, 17 : 949 - 967