Clustering Algorithms for Spatial Big Data

被引：5

作者：

Schoier, Gabriella ^{[1
]}

Gregorio, Caterina ^{[1
]}

机构：

[1] Univ Trieste, Dept Econ Business Math & Stat Sci Bruno de Finet, DEAMS, Tigor 22, I-34100 Trieste, Italy

来源：

COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2017, PT IV | 2017年 / 10407卷

关键词：

Spatial data mining; Clustering algorithms; DBSCAN; FSDP; K-Means; Arbitrary shape of clusters; Handling noise; Image analysis;

D O I：

10.1007/978-3-319-62401-3_41

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

In our time people and devices constantly generate data. User activity generates data about needs and preferences as well as the quality of their experiences in different ways: i. e. streaming a video, looking at the news, searching for a restaurant or a an hotel, playing a game with others, making purchases, driving a car. Even when people put their devices in their pockets, the network is generating location and other data that keeps services running and ready to use. This rapid developments in the availability and access to data and in particular spatially referenced data in a different areas, has induced the need for better analysis techniques to understand the various phenomena. Spatial clustering algorithms, which groups similar spatial objects into classes, can be used for the identification of areas sharing common characteristics. The aim of this paper is to analyze the performance of three different clustering algorithms i. e. the Density-Based Spatial Clustering of Applications with Noise algorithm (DBSCAN), the Fast Search by Density Peak (FSDP) algorithm and the classic K-means algorithm (K-Means) as regards the analysis of spatial big data. We propose a modification of the FSDP algorithm in order to improve its efficiency in large databases. The applications concern both synthetic data sets and satellite images.

引用

页码：571 / 583

页数：13

共 50 条

[31] Fuzzy Based Clustering Algorithms to Handle Big Data with Implementation on Apache Spark
Bharill, Neha
Tiwari, Aruna
Malviya, Aayushi
PROCEEDINGS 2016 IEEE SECOND INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING SERVICE AND APPLICATIONS (BIGDATASERVICE 2016), 2016, : 95 - 104
[32] A Quantitative Analysis of Big Data Clustering Algorithms for Market Segmentation in Hospitality Industry
Bose, Avishek
Munir, Arslan
Shabani, Neda
2020 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2020, : 554 - 559
[33] Algorithms for Big Data
Meyer, Ulrich
Abedjan, Ziawasch
IT-INFORMATION TECHNOLOGY, 2020, 62 (3-4): : 117 - 118
[34] Clustering Algorithms for Wireless Sensor Networks Using Spatial Data Correlation
Zhang, Chongqing
Wang, Binguo
Fang, Sheng
Li, Zhe
2008 INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION, VOLS 1-4, 2008, : 53 - 58
[35] Big Data Clustering: A Review
Shirkhorshidi, Ali Seyed
Aghabozorgi, Saeed
Teh, Ying Wah
Herawan, Tutut
COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2014, PT V, 2014, 8583 : 707 - 720
[36] MapReduce Clustering for Big Data
Ghattas, Badih
Pinto, Antoine
Diao, Sambou
2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 5116 - 5124
[37] Strategies for Big Data Clustering
Kurasova, Olga
Marcinkevicius, Virginijus
Medvedev, Viktor
Rapecka, Aurimas
Stefanovic, Pavel
2014 IEEE 26TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2014, : 740 - 747
[38] Consensus Clustering on Big Data
Liu, Hongfu
Cheng, Gong
Wu, Junjie
2015 12TH INTERNATIONAL CONFERENCE ON SERVICE SYSTEMS AND SERVICE MANAGEMENT (ICSSSM), 2015,
[39] Big Data clustering validity
Tlili, Monia
Hamdani, Tarek M.
2014 6TH INTERNATIONAL CONFERENCE OF SOFT COMPUTING AND PATTERN RECOGNITION (SOCPAR), 2014, : 348 - 352
[40] Balancing effort and benefit of K-means clustering algorithms in Big Data realms
Perez-Ortega, Joaquin
Nely Almanza-Ortega, Nelva
Romero, David
PLOS ONE, 2018, 13 (09):

← 1 2 3 4 5 →