Clustering Algorithms for Spatial Big Data

被引:5
|
作者
Schoier, Gabriella [1 ]
Gregorio, Caterina [1 ]
机构
[1] Univ Trieste, Dept Econ Business Math & Stat Sci Bruno de Finet, DEAMS, Tigor 22, I-34100 Trieste, Italy
关键词
Spatial data mining; Clustering algorithms; DBSCAN; FSDP; K-Means; Arbitrary shape of clusters; Handling noise; Image analysis;
D O I
10.1007/978-3-319-62401-3_41
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In our time people and devices constantly generate data. User activity generates data about needs and preferences as well as the quality of their experiences in different ways: i. e. streaming a video, looking at the news, searching for a restaurant or a an hotel, playing a game with others, making purchases, driving a car. Even when people put their devices in their pockets, the network is generating location and other data that keeps services running and ready to use. This rapid developments in the availability and access to data and in particular spatially referenced data in a different areas, has induced the need for better analysis techniques to understand the various phenomena. Spatial clustering algorithms, which groups similar spatial objects into classes, can be used for the identification of areas sharing common characteristics. The aim of this paper is to analyze the performance of three different clustering algorithms i. e. the Density-Based Spatial Clustering of Applications with Noise algorithm (DBSCAN), the Fast Search by Density Peak (FSDP) algorithm and the classic K-means algorithm (K-Means) as regards the analysis of spatial big data. We propose a modification of the FSDP algorithm in order to improve its efficiency in large databases. The applications concern both synthetic data sets and satellite images.
引用
收藏
页码:571 / 583
页数:13
相关论文
共 50 条
  • [1] Big Data and Clustering Algorithms
    Ajin, V. W.
    Kumar, Lekshmy D.
    2016 INTERNATIONAL CONFERENCE ON RESEARCH ADVANCES IN INTEGRATED NAVIGATION SYSTEMS (RAINS), 2016,
  • [2] A Review of Clustering Algorithms for Big Data
    Djouzi, Kheyreddine
    Beghdad-Bey, Kadda
    2019 4TH INTERNATIONAL CONFERENCE ON NETWORKING AND ADVANCED SYSTEMS (ICNAS 2019), 2019, : 117 - 122
  • [3] On the Problem of Clustering Spatial Big Data
    Schoier, Gabriella
    Borruso, Giuseppe
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2015, PT III, 2015, 9157 : 688 - 697
  • [4] Iterative big data clustering algorithms: a review
    Mohebi, Amin
    Aghabozorgi, Saeed
    Teh Ying Wah
    Herawan, Tutut
    Yahyapour, Ramin
    SOFTWARE-PRACTICE & EXPERIENCE, 2016, 46 (01): : 107 - 129
  • [5] Analysis of Mahout Big Data Clustering Algorithms
    Sharma, Ishan
    Tiwari, Rajeev
    Rana, Hukam Singh
    Anand, Abhineet
    INTELLIGENT COMMUNICATION, CONTROL AND DEVICES, ICICCD 2017, 2018, 624 : 999 - 1008
  • [6] A survey on parallel clustering algorithms for Big Data
    Zineb Dafir
    Yasmine Lamari
    Said Chah Slaoui
    Artificial Intelligence Review, 2021, 54 : 2411 - 2443
  • [7] The research on clustering algorithms in big data analysis
    Liu, Weigang
    BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2020, 127 : 75 - 75
  • [8] A survey on parallel clustering algorithms for Big Data
    Dafir, Zineb
    Lamari, Yasmine
    Slaoui, Said Chah
    ARTIFICIAL INTELLIGENCE REVIEW, 2021, 54 (04) : 2411 - 2443
  • [9] Scalable Clustering Algorithms for Big Data: A Review
    Mahdi, Mahmoud A.
    Hosny, Khalid M.
    Elhenawy, Ibrahim
    IEEE ACCESS, 2021, 9 : 80015 - 80027
  • [10] The Modeling and Simulation of Data Clustering Algorithms in Data Mining with Big Data
    Chen, Weiru
    Oliverio, Jared
    Kim, Jin Ho
    Shen, Jiayue
    JOURNAL OF INDUSTRIAL INTEGRATION AND MANAGEMENT-INNOVATION AND ENTREPRENEURSHIP, 2019, 4 (01):