MPRK Algorithm for Clustering the Large Text Datasets

被引:0
|
作者
Thangarasu, M. [1 ]
Inbarani, H. Hannah [1 ]
机构
[1] Periyar Univ, Dept Comp Sci, Salem, India
关键词
Clustering; Text document; Parallel Technique; Rough K-Means; Time complexity; PARALLEL;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Text Document clustering is changing the massive collections of text documents into a lesser amount of suitable clusters. While numerous clustering approaches have been projected in the last few decades, the partitioned clustering algorithms are stated performing well on document clustering based on the reviewed papers. In this research, Modified Parallel Rough K-means (MPRK) algorithm is proposed for clustering the text document and it is evaluated on datasets and the results are compared to benchmark algorithms K-means and DPPSOK-means. The experimental analysis shows the proposed algorithm produces efficient result compared to the existing algorithms.
引用
收藏
页码:224 / 229
页数:6
相关论文
共 50 条
  • [1] A new clustering algorithm for large datasets
    Li Qing-feng
    Peng Wen-feng
    JOURNAL OF CENTRAL SOUTH UNIVERSITY OF TECHNOLOGY, 2011, 18 (03): : 823 - 829
  • [2] Coevolutive clustering algorithm for large datasets
    Fabris, Fabio
    Luchi, Diego
    Varejao, Flavio Miguel
    2020 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2020,
  • [3] A new clustering algorithm for large datasets
    Qing-feng Li
    Wen-feng Peng
    Journal of Central South University, 2011, 18 : 823 - 829
  • [4] A new clustering algorithm for large datasets
    李清峰
    彭文峰
    JournalofCentralSouthUniversityofTechnology, 2011, 18 (03) : 823 - 829
  • [5] Stamantic clustering: Combining statistical and semantic features for clustering of large text datasets
    Mehta, Vivek
    Bawa, Seema
    Singh, Jasmeet
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 174
  • [6] POFCM: A Parallel Fuzzy Clustering Algorithm for Large Datasets
    Perez-Ortega, Joaquin
    Rey-Figueroa, Cesar David
    Roblero-Aguilar, Sandra Silvia
    Almanza-Ortega, Nelva Nely
    Zavala-Diaz, Crispin
    Garcia-Paredes, Salomon
    Landero-Najera, Vanesa
    MATHEMATICS, 2023, 11 (08)
  • [7] DHC: A Distributed Hierarchical Clustering Algorithm for Large Datasets
    Zhang, Wei
    Zhang, Gongxuan
    Chen, Xiaohui
    Liu, Yueqi
    Zhou, Xiumin
    Zhou, Junlong
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2019, 28 (04)
  • [8] NBC: An Efficient Hierarchical Clustering Algorithm for Large Datasets
    Zhang, Wei
    Zhang, Gongxuan
    Wang, Yongli
    Zhu, Zhaomeng
    Li, Tao
    INTERNATIONAL JOURNAL OF SEMANTIC COMPUTING, 2015, 9 (03) : 307 - 331
  • [9] WEClustering: word embeddings based text clustering technique for large datasets
    Vivek Mehta
    Seema Bawa
    Jasmeet Singh
    Complex & Intelligent Systems, 2021, 7 : 3211 - 3224
  • [10] WEClustering: word embeddings based text clustering technique for large datasets
    Mehta, Vivek
    Bawa, Seema
    Singh, Jasmeet
    COMPLEX & INTELLIGENT SYSTEMS, 2021, 7 (06) : 3211 - 3224