Clustering Algorithms for Spatial Big Data

被引:5
|
作者
Schoier, Gabriella [1 ]
Gregorio, Caterina [1 ]
机构
[1] Univ Trieste, Dept Econ Business Math & Stat Sci Bruno de Finet, DEAMS, Tigor 22, I-34100 Trieste, Italy
关键词
Spatial data mining; Clustering algorithms; DBSCAN; FSDP; K-Means; Arbitrary shape of clusters; Handling noise; Image analysis;
D O I
10.1007/978-3-319-62401-3_41
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In our time people and devices constantly generate data. User activity generates data about needs and preferences as well as the quality of their experiences in different ways: i. e. streaming a video, looking at the news, searching for a restaurant or a an hotel, playing a game with others, making purchases, driving a car. Even when people put their devices in their pockets, the network is generating location and other data that keeps services running and ready to use. This rapid developments in the availability and access to data and in particular spatially referenced data in a different areas, has induced the need for better analysis techniques to understand the various phenomena. Spatial clustering algorithms, which groups similar spatial objects into classes, can be used for the identification of areas sharing common characteristics. The aim of this paper is to analyze the performance of three different clustering algorithms i. e. the Density-Based Spatial Clustering of Applications with Noise algorithm (DBSCAN), the Fast Search by Density Peak (FSDP) algorithm and the classic K-means algorithm (K-Means) as regards the analysis of spatial big data. We propose a modification of the FSDP algorithm in order to improve its efficiency in large databases. The applications concern both synthetic data sets and satellite images.
引用
收藏
页码:571 / 583
页数:13
相关论文
共 50 条
  • [41] Big Data Landscapes: Improving the Visualization of Machine Learning-based Clustering Algorithms
    Kammer, Dietrich
    Keck, Mandy
    Gruender, Thomas
    Groh, Rainer
    AVI'18: PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON ADVANCED VISUAL INTERFACES, 2018,
  • [42] Spatial Clustering Algorithms and Quality Assessment
    Xi, Jingke
    FIRST IITA INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2009, : 105 - 108
  • [43] A Survey and Experimental Review on Data Distribution Strategies for Parallel Spatial Clustering Algorithms
    Challa, Jagat Sesh
    Goyal, Navneet
    Sharma, Amogh
    Sreekumar, Nikhil
    Balasubramaniam, Sundar
    Goyal, Poonam
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2024, 39 (03) : 610 - 636
  • [44] Algorithms for clustering clickstream data
    Antonellis, Panagiotis
    Makris, Christos
    Tsirakis, Nikos
    INFORMATION PROCESSING LETTERS, 2009, 109 (08) : 381 - 385
  • [45] High-Performance Computing based Scalable Online Fuzzy Clustering Algorithms for Big Data
    Jha, Preeti
    Tiwari, Aruna
    Bharill, Neha
    Ratnaparkhe, Milind
    Patel, Om Prakash
    Pulakitha, Rapolu
    Chauhan, Aditi
    2022 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2022, : 1400 - 1407
  • [46] Density-based Algorithms for Big Data Clustering Using MapReduce Framework: A Comprehensive Study
    Khader, Mariam
    Al-Naymat, Ghazi
    ACM COMPUTING SURVEYS, 2020, 53 (05)
  • [47] Big data policing: The use of big data and algorithms by the Netherlands Police
    Schuilenburg, Marc
    Soudijn, Melvin
    POLICING-A JOURNAL OF POLICY AND PRACTICE, 2023, 17
  • [48] Iterative Unified Clustering in Big Data
    Misal, Vasundhara
    Janeja, Vandana P.
    Pallaprolu, Sai C.
    Yesha, Yelena
    Chintalapati, Raghu
    2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2016, : 3412 - 3421
  • [49] High Performance Big Data Clustering
    Agrawal, Ankit
    Patwary, Md. Mostofa Ali
    Hendrix, William
    Liao, Wei-keng
    Choudhary, Alok
    CLOUD COMPUTING AND BIG DATA, 2013, 23 : 192 - 211
  • [50] A Hybrid Approach to Clustering in Big Data
    Kumar, Dheeraj
    Bezdek, James C.
    Palaniswami, Marimuthu
    Rajasegarar, Sutharshan
    Leckie, Christopher
    Havens, Timothy Craig
    IEEE TRANSACTIONS ON CYBERNETICS, 2016, 46 (10) : 2372 - 2385