Improving Distance-Join Query processing with Voronoi-Diagram based partitioning in SpatialHadoop

被引:12
|
作者
Garcia-Garcia, Francisco [1 ]
Corral, Antonio [1 ]
Iribarne, Luis [1 ]
Vassilakopoulos, Michael [2 ]
机构
[1] Univ Almeria, Dept Informat, Almeria, Spain
[2] Univ Thessaly, Dept Elect & Comp Engn, Volos, Greece
关键词
Data partitioning; K nearest neighbors join; K closest pairs; SpatialHadoop; MapReduce; Spatial query evaluation; ALGORITHMS;
D O I
10.1016/j.future.2019.10.037
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
SpatialHadoop is an extended MapReduce framework supporting global indexing techniques that partition spatial datasets across several machines and improve spatial query processing performance compared to traditional Hadoop systems. SpatialHadoop supports several spatial operations (e.g., K Nearest Neighbor search, range query, spatial intersection join, etc.) and seven spatial partitioning techniques (Grid, Quadtree, STR, STR+, k-d tree, Z-curve and Hilbert-curve). Distance-Join Queries (DJQs), like the K Nearest Neighbors Join Query (KNNJQ) and K Closest Pairs Query (KCPQ), are common operations used in numerous spatial applications. DJQs are costly operations, since they combine spatial joins with distance-based search. Data partitioning improves the management of large datasets and speeds up query performance. Therefore, performing DJQs efficiently with new partitioning methods in SpatialHadoop is a challenging task. In this paper, a new data partitioning technique based on Voronoi-Diagrams is designed and implemented in SpatialHadoop. Moreover, improved KNNJQ and KCPQ MapReduce algorithms, using the new partitioning mechanism, are also designed and developed for SpatialHadoop. Finally, the results of an extensive set of experiments with real-world datasets are presented, demonstrating that the new partitioning technique and the improved DR MapReduce algorithms are efficient, scalable and robust in SpatialHadoop. (C) 2019 Elsevier B.V. All rights reserved.
引用
收藏
页码:723 / 740
页数:18
相关论文
共 50 条
  • [1] Voronoi-Diagram Based Partitioning for Distance Join Query Processing in SpatialHadoop
    Garcia-Garcia, Francisco
    Corral, Antonio
    Iribarne, Luis
    Vassilakopoulos, Michael
    MODEL AND DATA ENGINEERING, MEDI 2018, 2018, 11163 : 251 - 267
  • [2] Distance-Join: Pattern Match Query In a Large Graph Database
    Zou, Lei
    Chen, Lei
    Oezsu, M. Tamer
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2009, 2 (01): : 886 - 897
  • [3] Voronoi-diagram based heuristics for the location of Mobile and Unreliable Service Providers
    Szakas, Joseph
    Trefftz, Christian
    PROCEEDINGS OF THE IASTED INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTER SCIENCE AND TECHNOLOGY, 2006, : 73 - +
  • [4] Spatial adjacency query based on Voronoi diagram
    Li, Chengming
    Chen, Jun
    Zhu, Yinghao
    Wuhan Cehui Keji Daxue Xuebao/Journal of Wuhan Technical University of Surveying and Mapping, 23 (02): : 128 - 131
  • [5] Nearest neighbor query processing using the network voronoi diagram
    Wang, Mei-Tzu
    DATA & KNOWLEDGE ENGINEERING, 2016, 103 : 19 - 43
  • [6] Efficient large-scale distance-based join queries in spatialhadoop
    Francisco García-García
    Antonio Corral
    Luis Iribarne
    Michael Vassilakopoulos
    Yannis Manolopoulos
    GeoInformatica, 2018, 22 : 171 - 209
  • [7] Efficient large-scale distance-based join queries in spatialhadoop
    Garcia-Garcia, Francisco
    Corral, Antonio
    Iribarne, Luis
    Vassilakopoulos, Michael
    Manolopoulos, Yannis
    GEOINFORMATICA, 2018, 22 (02) : 171 - 209
  • [8] Cost-efficient Spatial Network Partitioning for Distance-based Query Processing
    Wang, Jiping
    Zheng, Kai
    Jeung, Hoyoung
    Wang, Haozhou
    Zheng, Bolong
    Zhou, Xiaofang
    2014 IEEE 15TH INTERNATIONAL CONFERENCE ON MOBILE DATA MANAGEMENT (MDM), VOL 1, 2014, : 13 - 22
  • [9] RkNN query algorithm based on K-order Voronoi diagram
    1600, Science and Engineering Research Support Society (07):
  • [10] Optimizing join index based join processing: A graph partitioning approach
    Ravada, S
    Shekhar, S
    Lu, CT
    Chawla, S
    SEVENTEENTH IEEE SYMPOSIUM ON RELIABLE DISTRIBUTED SYSTEMS, PROCEEDINGS, 1998, : 302 - 308