Clustering Algorithms for Spatial Big Data

被引:5
|
作者
Schoier, Gabriella [1 ]
Gregorio, Caterina [1 ]
机构
[1] Univ Trieste, Dept Econ Business Math & Stat Sci Bruno de Finet, DEAMS, Tigor 22, I-34100 Trieste, Italy
关键词
Spatial data mining; Clustering algorithms; DBSCAN; FSDP; K-Means; Arbitrary shape of clusters; Handling noise; Image analysis;
D O I
10.1007/978-3-319-62401-3_41
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In our time people and devices constantly generate data. User activity generates data about needs and preferences as well as the quality of their experiences in different ways: i. e. streaming a video, looking at the news, searching for a restaurant or a an hotel, playing a game with others, making purchases, driving a car. Even when people put their devices in their pockets, the network is generating location and other data that keeps services running and ready to use. This rapid developments in the availability and access to data and in particular spatially referenced data in a different areas, has induced the need for better analysis techniques to understand the various phenomena. Spatial clustering algorithms, which groups similar spatial objects into classes, can be used for the identification of areas sharing common characteristics. The aim of this paper is to analyze the performance of three different clustering algorithms i. e. the Density-Based Spatial Clustering of Applications with Noise algorithm (DBSCAN), the Fast Search by Density Peak (FSDP) algorithm and the classic K-means algorithm (K-Means) as regards the analysis of spatial big data. We propose a modification of the FSDP algorithm in order to improve its efficiency in large databases. The applications concern both synthetic data sets and satellite images.
引用
收藏
页码:571 / 583
页数:13
相关论文
共 50 条
  • [21] Research on network security defence based on big data clustering algorithms
    Zhao J.
    International Journal of Information and Computer Security, 2021, 15 (04) : 343 - 356
  • [22] A Review on Density-Based Clustering Algorithms for Big Data Analysis
    Reddy, K. Shyam Sunder
    Bindu, C. Shoba
    2017 INTERNATIONAL CONFERENCE ON I-SMAC (IOT IN SOCIAL, MOBILE, ANALYTICS AND CLOUD) (I-SMAC), 2017, : 123 - 130
  • [23] Big Mobility Data Analytics: Algorithms and Techniques for Efficient Trajectory Clustering
    Tampakis, Panagiotis
    2020 21ST IEEE INTERNATIONAL CONFERENCE ON MOBILE DATA MANAGEMENT (MDM 2020), 2020, : 244 - 245
  • [24] Time and memory scalable algorithms for clustering tendency assessment of big data
    Deshpande, Kartik Vishal
    Kumar, Dheeraj
    INFORMATION SCIENCES, 2024, 664
  • [25] Clustering Enabled Wireless Channel Modeling Using Big Data Algorithms
    He, Ruisi
    Ai, Bo
    Molisch, Andreas F.
    Stuber, Gordon L.
    Li, Qingyong
    Zhong, Zhangdui
    Yu, Jian
    IEEE COMMUNICATIONS MAGAZINE, 2018, 56 (05) : 177 - 183
  • [26] Knowledge extraction from maritime spatiotemporal data: An evaluation of clustering algorithms on Big Data
    Spiliopoulos, Giannis
    Chatzikokolakis, Konstantinos
    Zissis, Dimitrios
    Biliri, Evmorfia
    Papaspyros, Dimitrios
    Tsapelas, Giannis
    Mouzakitis, Spyros
    2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 1682 - 1687
  • [27] Storing and Clustering Large Spatial Datasets Using Big Data Technologies
    Cortinas, Alejandro
    Luaces, Miguel R.
    Rodeiro, Tirso V.
    WEB AND WIRELESS GEOGRAPHICAL INFORMATION SYSTEMS, W2GIS 2018, 2018, 10819 : 15 - 24
  • [28] Using Parallel Hierarchical Clustering to Address Spatial Big Data Challenges
    Woodley, Alan
    Tang, Ling-Xiang
    Geva, Shlomo
    Nayak, Richi
    Chappell, Timothy
    2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2016, : 2692 - 2698
  • [29] Advances in Meta-Heuristic Optimization Algorithms in Big Data Text Clustering
    Abualigah, Laith
    Gandomi, Amir H.
    Elaziz, Mohamed Abd
    Hamad, Husam Al
    Omari, Mahmoud
    Alshinwan, Mohammad
    Khasawneh, Ahmad M.
    ELECTRONICS, 2021, 10 (02) : 1 - 29
  • [30] A Performance Comparison of Big Data Processing Platform Based on Parallel Clustering Algorithms
    Hai, Mo
    Zhang, Yuejing
    Li, Haifeng
    6TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND QUANTITATIVE MANAGEMENT, 2018, 139 : 127 - 135