AN ALTERNATIVE PARAMETER FREE CLUSTERING ALGORITHM USING DATA POINT POSITIONING ANALYSIS (DPPA) - COMPARISON WITH DBSCAN

被引:0
|
作者
Mustapha, S. M. F. D. Syed [1 ]
机构
[1] Zayed Univ, Coll Technol Innovat, POB 19282, Dubai, U Arab Emirates
关键词
Clustering algorithm; Unsupervised learning; Parameter free clustering al-gorithm; DBSCAN;
D O I
10.24507/ijicic.19.06.1805
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
DBSCAN is one of the most popular clustering algorithms that could handle clusters which have characteristics of arbitrary shape, multiple densities and noises. How-ever, its accuracy depends on the right selection of the two parameters, MinPts and Eps. There have been numerous research works to overcome this issue by developing parameter free clustering algorithm. We propose a clustering algorithm which uses Data Point Po-sitioning Analysis (DPPA) to analyze the relationship of each point to all points based on two nearest neighbor concepts, namely 1-NN and Max-NN. The algorithm is applied on 13 benchmark datasets that have been applied in many clustering algorithms with three-dimensional data and subsequently on higher dimensional data with sixteen attributes. The performance of the algorithm is visually compared with the three-dimensional graph plotting at various angles to determine the actual number of clusters. For the higher dimensional data, Silhouette coefficient is used to measure the performance. For both ex-perimental results, the DPPA algorithm is compared against DBSCAN. The results show that the DPPA algorithm is comparable to the performance of DBSCAN algorithm such that it manages to detect arbitrary cluster shapes, identify the number of clusters and manage the data sets with noises.
引用
收藏
页码:1805 / 1825
页数:21
相关论文
共 50 条
  • [11] Web-Based clustering application using Shiny framework and DBSCAN algorithm for hotspots data in peatland in Sumatra
    Hermawati, Rachma
    Sitanggang, Imas Sukaesih
    2ND INTERNATIONAL SYMPOSIUM ON LAPAN-IPB SATELLITE (LISAT) FOR FOOD SECURITY AND ENVIRONMENTAL MONITORING, 2016, 33 : 317 - 323
  • [12] Nonlinear Data Analysis Using a New Hybrid Data Clustering Algorithm
    Wattanachon, Ureerat
    Suksawatchon, Jakkarin
    Lursinsap, Chidchanok
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2009, 5476 : 160 - +
  • [13] Comparative Analysis of Graph Clustering Algorithm using Bloggers Data
    Dehariya, Yogendra Kumar
    Biswas, Bhaskar
    Singh, Ravi Shankar
    PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON ISSUES AND CHALLENGES IN INTELLIGENT COMPUTING TECHNIQUES (ICICT), 2014, : 24 - 28
  • [14] Data Mining Using Clustering Algorithm as Tool for Poverty Analysis
    Talingdan, Janelyn A.
    2019 8TH INTERNATIONAL CONFERENCE ON SOFTWARE AND COMPUTER APPLICATIONS (ICSCA 2019), 2019, : 56 - 59
  • [15] OCEAN: A Non-Conventional Parameter Free Clustering Algorithm Using Relative Densities of Categories
    Gheyas, Iffat
    Parkinson, Simon
    Khan, Saad
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2021, 35 (05)
  • [16] An automated approach for wood-leaf separation from terrestrial LIDAR point clouds using the density based clustering algorithm DBSCAN
    Ferrara, Roberto
    Virdis, Salvatore G. P.
    Ventura, Andrea
    Ghisu, Tiziano
    Duce, Pierpaolo
    Pellizzaro, Grazia
    AGRICULTURAL AND FOREST METEOROLOGY, 2018, 262 : 434 - 444
  • [17] Performance Analysis and Architecture of a Clustering Hybrid Algorithm Called FA plus GA-DBSCAN Using Artificial Datasets
    Carlos Perafan-Lopez, Juan
    Lucia Ferrer-Gregory, Valeria
    Nieto-Londono, Cesar
    Sierra-Perez, Julian
    ENTROPY, 2022, 24 (07)
  • [18] Trip end identification based on spatial-temporal clustering algorithm using smartphone positioning data
    Yao, Zhenxing
    Yang, Fei
    Guo, Yudong
    Jin, Peter Jing
    Li, Yan
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 197
  • [19] Algorithm for Clustering Analysis of Gene Expression Data using MapReduce Framework
    Priya, P. Packia Amutha
    Lawrance, R.
    2016 INTERNATIONAL CONFERENCE ON COMPUTING TECHNOLOGIES AND INTELLIGENT DATA ENGINEERING (ICCTIDE'16), 2016,
  • [20] Landslide susceptibility mapping using the uncertain and parameter free density-based clustering (UPFDBCAN) algorithm
    Mwakapesa, Deborah Simon
    Lan, Xiaoji
    Mao, Yimin
    Nanehkaran, Yaser Ahangari
    Zhang, Maosheng
    INTERNATIONAL JOURNAL OF EARTH SCIENCES, 2024, 113 (02) : 335 - 351