Efficient and Reliable Clustering by Parallel Random Swap Algorithm

被引:2
|
作者
Nigro, Libero [1 ]
Cicirelli, Franco [2 ]
Franti, Pasi [3 ]
机构
[1] Univ Calabria, DIMES Dept Informat Modelling Elect & Syst Sci, I-87036 Arcavacata Di Rende, Italy
[2] Natl Res Council Italy, CNR, Inst High Performance Comp & Networking ICAR, I-87036 Arcavacata Di Rende, Italy
[3] Univ Eastern Finland, Sch Comp, Machine Learning Grp, POB 111, Joensuu 80101, Finland
关键词
Clustering problem; K-Means; Random swap; Parallelism; Streams; Lambda Expressions; !text type='Java']Java[!/text; K-MEANS;
D O I
10.1109/DS-RT55542.2022.9932090
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Solving large-scale clustering problems requires an efficient algorithm which can be implemented also in parallel. Kmeans would be suitable but it can lead to an inaccurate clustering result. To overcome this problem, we present a parallel version of random swap clustering algorithm. It combines the scalability of k-means with high clustering accuracy. The new clustering method is experimented on top of Java parallel streams and lambda expressions, which offer interesting execution time benefits. The method is applied to standard benchmark datasets, with a varying population size and distribution of managed records, dimensionality of data points and the number of clusters. The experimental results confirm that high quality clustering can be obtained by parallel random swap together with a high time efficiency.
引用
收藏
页数:4
相关论文
共 50 条
  • [21] An efficient algorithm for basket default swap valuation
    Chiang, Mi-Hsiu
    Yueh, Meng-Lan
    Hsieh, Ming-Hua
    JOURNAL OF DERIVATIVES, 2007, 15 (02): : 8 - 19
  • [22] Efficient parallel hierarchical clustering
    Dash, M
    Petrutiu, S
    Scheuermann, P
    EURO-PAR 2004 PARALLEL PROCESSING, PROCEEDINGS, 2004, 3149 : 363 - 371
  • [23] A parallel genetic algorithm for clustering
    Kivijärvi, J
    Lehtinen, J
    Nevalainen, IS
    Recent Advances in Simulated Evolution and Learning, 2004, 2 : 41 - 60
  • [24] A PARALLEL ALGORITHM FOR RECORD CLUSTERING
    OMIECINSKI, E
    SCHEUERMANN, P
    ACM TRANSACTIONS ON DATABASE SYSTEMS, 1990, 15 (04): : 599 - 624
  • [25] A Parallel Clustering Algorithm for Placement
    Momeni, Amir
    Mistry, Perhaad
    Kaeli, David
    PROCEEDINGS OF THE FIFTEENTH INTERNATIONAL SYMPOSIUM ON QUALITY ELECTRONIC DESIGN (ISQED 2014), 2015, : 349 - 356
  • [26] An efficient clustering algorithm
    Zhang, YF
    Mao, JL
    Xiong, ZY
    2003 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-5, PROCEEDINGS, 2003, : 261 - 265
  • [27] EFFICIENT CLUSTERING ALGORITHM
    BHAT, MV
    HAUPT, A
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1976, 6 (01): : 61 - 64
  • [28] An efficient clustering algorithm
    Jiang, SY
    Xu, YM
    PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2004, : 1513 - 1518
  • [29] A Reliable and Efficient Clustering Algorithm for Wireless Sensor Networks Using Fuzzy Petri Nets
    Fu, Xiao
    Yu, Zhenhua
    2010 6TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS NETWORKING AND MOBILE COMPUTING (WICOM), 2010,
  • [30] Efficient parallel implementation of a density peaks clustering algorithm on graphics processing unit
    Ge, Ke-shi
    Su, Hua-you
    Li, Dong-sheng
    Lu, Xi-cheng
    FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2017, 18 (07) : 915 - 927