Parallel co-location mining with MapReduce and NoSQL systems

被引:16
|
作者
Yoo, Jin Soung [1 ]
Boulware, Douglas [2 ]
Kimmey, David [1 ]
机构
[1] Purdue Univ Ft Wayne, Dept Comp Sci, Ft Wayne, IN 46805 USA
[2] Air Force Res Lab, Rome Res Site, New York, NY USA
关键词
Spatial data mining; Parallel co-location mining; Cloud computing; MapReduce; NoSQL; COLOCATION PATTERNS; DATA SETS; FRAMEWORK;
D O I
10.1007/s10115-019-01381-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the rapid growth of georeferenced data, large-scale data processing and analysis methods are needed for spatial big data. Spatial co-location pattern mining is an interesting and important issue in spatial data mining area which discovers the subsets of features whose objects are frequently located together in geographic proximity. There are several works for efficiently processing co-location pattern discovery; however, they may be insufficient for large dense spatial data because the mining task takes up a lot of processing time and memory. In this work, we leveraged the power of a modern distributed computing platform, Hadoop, and developed an algorithm (called ParColoc) for parallel co-location mining on the MapReduce framework. This study explored challenge issues in designing the parallel co-location mining algorithm and solved them with adopting a spatial declusteirng technique and a NoSQL system. We conducted an experimental evaluation with real-world data and synthetic data to examine the effectiveness of proposed methods. The experiment result shows that ParColoc is a promising method for parallel co-location mining in cloud computing environment.
引用
收藏
页码:1433 / 1463
页数:31
相关论文
共 50 条
  • [31] A Framework for Co-location Patterns Mining in Big Spatial Data
    Garaeva, A.
    Makhmutova, F.
    Anikin, I.
    Sattler, Kai-Uwe
    PROCEEDINGS OF 2017 XX IEEE INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND MEASUREMENTS (SCM), 2017, : 477 - 480
  • [32] Mining maximal sub-prevalent co-location patterns
    Lizhen Wang
    Xuguang Bao
    Lihua Zhou
    Hongmei Chen
    World Wide Web, 2019, 22 : 1971 - 1997
  • [33] A Framework for Mining Spatial High Utility Co-location Patterns
    Yang, Shisheng
    Wang, Lizhen
    Bao, Xuguang
    Lu, Junli
    2015 12TH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (FSKD), 2015, : 595 - 601
  • [34] Mining maximal sub-prevalent co-location patterns
    Wang, Lizhen
    Bao, Xuguang
    Zhou, Lihua
    Chen, Hongmei
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2019, 22 (05): : 1971 - 1997
  • [35] Enumeration of maximal clique for mining spatial co-location patterns
    Al-Naymat, Ghazi
    2008 IEEE/ACS INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS, VOLS 1-3, 2008, : 126 - 133
  • [36] Local Co-location Pattern Mining Based on Regional Embedding
    Zeng, Yumming
    Wang, Lizhen
    Zhou, Lihua
    Chen, Hongmei
    SPATIAL DATA AND INTELLIGENCE, SPATIALDI 2024, 2024, 14619 : 108 - 119
  • [37] On the relationships between clustering and spatial co-location pattern mining
    Huang, Yan
    Zhang, Pusheng
    Zhang, Chengyang
    INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2008, 17 (01) : 55 - 70
  • [38] Efficient spatial co-location pattern mining on multiple GPUs
    Andrzejewski, W.
    Boinski, P.
    EXPERT SYSTEMS WITH APPLICATIONS, 2018, 93 : 465 - 483
  • [39] A clique-based approach for co-location pattern mining
    Bao, Xuguang
    Wang, Lizhen
    INFORMATION SCIENCES, 2019, 490 : 244 - 264
  • [40] METHODS FOR MINING CO-LOCATION PATTERNS WITH EXTENDED SPATIAL OBJECTS
    Bembenik, Robert
    Jozwicki, Wiktor
    Protaziuk, Grzegorz
    INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS AND COMPUTER SCIENCE, 2017, 27 (04) : 681 - 695