PFCA: An influence-based parallel fuzzy clustering algorithm for large complex networks

被引:1
|
作者
Bhatia, Vandana [1 ]
Rani, Rinkle [1 ]
机构
[1] Thapar Univ, Dept Comp Sci & Engn, Patiala 147004, Punjab, India
关键词
big data; complex networks; fuzzy clustering; PageRank; Pregel; COMMUNITY STRUCTURE; C-MEANS; MODULARITY; MAPREDUCE;
D O I
10.1111/exsy.12295
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Clustering helps in understanding the patterns present in networks and thus helps in getting useful insights. In real-world complex networks, analysing the structure of the network plays a vital role in clustering. Most of the existing clustering algorithms identify disjoint clusters, which do not consider the structure of the network. Moreover, the clustering results do not provide consistency and precision. This paper presents an efficient parallel fuzzy clustering algorithm named "PFCA" for large complex networks using Hadoop and Pregel (parallel processing framework for large graphs). The proposed algorithm first selects the candidate cluster heads on the basis of their influence in the network and then determines the number of clusters by analysing the graph structure using PageRank algorithm. The proposed algorithm identifies both disjoint and fuzzy clusters efficiently and finds membership of only those vertices, which are the part of more than one cluster. The performance is validated on 6 real-life networks having up to billions of connections. The experimental results show that the proposed algorithm scales up linearly with the increase in size of network. It is also shown that the proposed algorithm is efficient and has high precision in comparison with the other state-of-art fuzzy clustering algorithms in terms of F score and modularity.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] Fast Influence-based Coarsening for Large Networks
    Purohit, Manish
    Prakash, B. Aditya
    Kang, Chanhyun
    Zhang, Yao
    Subrahmanian, V. S.
    PROCEEDINGS OF THE 20TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'14), 2014, : 1296 - 1305
  • [2] Asymmetric influence-based superposed random walk link prediction algorithm in complex networks
    Liu, Shihu
    Feng, Xueli
    Yang, Jin
    INTERNATIONAL JOURNAL OF MODERN PHYSICS C, 2024,
  • [3] POFCM: A Parallel Fuzzy Clustering Algorithm for Large Datasets
    Perez-Ortega, Joaquin
    Rey-Figueroa, Cesar David
    Roblero-Aguilar, Sandra Silvia
    Almanza-Ortega, Nelva Nely
    Zavala-Diaz, Crispin
    Garcia-Paredes, Salomon
    Landero-Najera, Vanesa
    MATHEMATICS, 2023, 11 (08)
  • [4] Adaptive Clustering Algorithm of Complex Network Based on Fuzzy Neural Networks
    Zhang, Zhixun
    Wang, Juan
    Xu, Yanqiang
    Han, Wei
    MOBILE INFORMATION SYSTEMS, 2022, 2022
  • [5] Complex Networks Clustering Algorithm Based On the Core Influence of the Nodes
    Tong, Chao
    Niu, Jianwei
    Dai, Bin
    Peng, Jing
    Fan, Jinyang
    2012 IEEE 31ST INTERNATIONAL PERFORMANCE COMPUTING AND COMMUNICATIONS CONFERENCE (IPCCC), 2012, : 185 - 186
  • [6] A parallel fuzzy clustering algorithm for large graphs using Pregel
    Bhatia, Vandana
    Rani, Rinkle
    EXPERT SYSTEMS WITH APPLICATIONS, 2017, 78 : 135 - 144
  • [7] A Novel Complex Networks Clustering Algorithm Based on the Core Influence of Nodes
    Tong, Chao
    Niu, Jianwei
    Dai, Bin
    Xie, Zhongyu
    SCIENTIFIC WORLD JOURNAL, 2014,
  • [8] A Parallel Local Search Algorithm for Clustering Large Biological Networks
    Coccimiglio G.
    Choudhury S.
    1600, World Scientific (27): : 3 - 4
  • [9] Escape velocity centrality: escape influence-based key nodes identification in complex networks
    Ullah, Aman
    Wang, Bin
    Sheng, JinFang
    Khan, Nasrullah
    APPLIED INTELLIGENCE, 2022, 52 (14) : 16586 - 16604
  • [10] Escape velocity centrality: escape influence-based key nodes identification in complex networks
    Aman Ullah
    Bin Wang
    JinFang Sheng
    Nasrullah Khan
    Applied Intelligence, 2022, 52 : 16586 - 16604