Implementing a Platform to Run Clustering Algorithms Using Distributed Computing

被引:0
|
作者
Borlea, Ioan-Daniel [1 ]
Iercan, Daniel [1 ]
Precup, Radu-Emil [1 ]
Dragan, Florin [1 ]
Borlea, Alexandra-Bianca [1 ]
机构
[1] Politehn Univ Timisoara, AAI Dept, Timisoara, Romania
关键词
clustering algorithms; distributed computing; partitions; platform; FUZZY;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most of the clustering algorithms are designed to work as a sequential algorithm that requires all data to be present, which limits the actual implementation to run on a single machine and does not support horizontal scalability. This is problematic in today's context when volume of data gets larger each day and the need to process data quickly is essential. Hence, in this paper we propose a platform that allows running clustering algorithms in a distributed manner. This is achieved through splitting the data into smaller and equal partitions, and through redesigning the original clustering algorithms to allow working on a sub-set of the input data without having to interact with the processing of the rest of the input data. At the end the so-called reduce phase aggregates the partial results obtained from processing each partition and it produces the global result.
引用
收藏
页码:217 / 222
页数:6
相关论文
共 50 条
  • [31] Identification of Straggler Node in Distributed Environment Using Soft Computing Algorithms
    Gaykar, Reshma S.
    Nalini, C.
    Joshi, S. D.
    2021 INTERNATIONAL CONFERENCE ON EMERGING SMART COMPUTING AND INFORMATICS (ESCI), 2021, : 1 - 5
  • [32] Framework for task scheduling in heterogeneous distributed computing using genetic algorithms
    Page, AJ
    Naughton, T
    ARTIFICIAL INTELLIGENCE REVIEW, 2005, 24 (3-4) : 415 - 429
  • [33] Framework for Task Scheduling in Heterogeneous Distributed Computing Using Genetic Algorithms
    Andrew J. Page
    Thomas J. Naughton
    Artificial Intelligence Review, 2005, 24 : 415 - 429
  • [34] Using distributed computing platform to solve high computing and huge data processing problems in bioinformatics
    Chen, SN
    Tsai, JJR
    Huang, CW
    Chen, RM
    Lin, RCK
    BIBE 2004: FOURTH IEEE SYMPOSIUM ON BIOINFORMATICS AND BIOENGINEERING, PROCEEDINGS, 2004, : 142 - 148
  • [35] ATLAS Distributed Computing in LHC Run2
    Campana, Simone
    21ST INTERNATIONAL CONFERENCE ON COMPUTING IN HIGH ENERGY AND NUCLEAR PHYSICS (CHEP2015), PARTS 1-9, 2015, 664
  • [36] Towards Developing Fuzzy Neighborhood Based Clustering Algorithms for High Performance Distributed Memory Computing Environments
    Atilgan, Can
    Tezel, Baris Tekin
    Nasibov, Efendi
    2018 3RD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ENGINEERING (UBMK), 2018, : 367 - 371
  • [37] Churn Prediction analysis Using Various Clustering Algorithms in KNIME Analytics Platform
    Franciska, I.
    Swaminathan, B.
    2017 IEEE 3RD INTERNATIONAL CONFERENCE ON SENSING, SIGNAL PROCESSING AND SECURITY (ICSSS), 2017, : 166 - 170
  • [38] Towards Graph Clustering for Distributed Computing Environments
    Szufel, Przemyslaw
    MODELLING AND MINING NETWORKS, WAW 2024, 2024, 14671 : 146 - 158
  • [39] IMPLEMENTING LINEAR ALGEBRA ALGORITHMS ON A MEIKO COMPUTING SURFACE
    HOFFMANN, W
    POTMA, K
    APPLIED NUMERICAL MATHEMATICS, 1991, 8 (02) : 127 - 148
  • [40] CLUSTERING THE PREVALENCE OF PEDIATRIC CHRONIC CONDITIONS IN THE UNITED STATES USING DISTRIBUTED COMPUTING
    Zheng, Yuchen
    Serban, Nicoleta
    ANNALS OF APPLIED STATISTICS, 2018, 12 (02): : 915 - 939