MPRK Algorithm for Clustering the Large Text Datasets

被引:0
|
作者
Thangarasu, M. [1 ]
Inbarani, H. Hannah [1 ]
机构
[1] Periyar Univ, Dept Comp Sci, Salem, India
来源
2016 IEEE INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTER APPLICATIONS (ICACA) | 2016年
关键词
Clustering; Text document; Parallel Technique; Rough K-Means; Time complexity; PARALLEL;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Text Document clustering is changing the massive collections of text documents into a lesser amount of suitable clusters. While numerous clustering approaches have been projected in the last few decades, the partitioned clustering algorithms are stated performing well on document clustering based on the reviewed papers. In this research, Modified Parallel Rough K-means (MPRK) algorithm is proposed for clustering the text document and it is evaluated on datasets and the results are compared to benchmark algorithms K-means and DPPSOK-means. The experimental analysis shows the proposed algorithm produces efficient result compared to the existing algorithms.
引用
收藏
页码:224 / 229
页数:6
相关论文
共 50 条
  • [41] Clustering Algorithm for Multi-density Datasets
    Fahim, Ahmed
    ROMANIAN JOURNAL OF INFORMATION SCIENCE AND TECHNOLOGY, 2019, 22 (3-4): : 244 - 258
  • [42] A multidisciplinary ensemble algorithm for clustering heterogeneous datasets
    Bryar A. Hassan
    Tarik A. Rashid
    Neural Computing and Applications, 2021, 33 : 10987 - 11010
  • [43] HIREL: An Incremental Clustering Algorithm for Relational Datasets
    Li, Tao
    Anand, Sarabjot S.
    ICDM 2008: EIGHTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2008, : 887 - 892
  • [44] Clustering large datasets in arbitrary metric spaces
    Ganti, V
    Ramakrishnan, R
    Gehrke, J
    Powell, A
    French, J
    15TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 1999, : 502 - 511
  • [45] Clustering algorithms optimizer:A framework for large datasets
    Varshavsky, Roy
    Horn, David
    Linial, Michal
    BIOINFORMATICS RESEARCH AND APPLICATIONS, PROCEEDINGS, 2007, 4463 : 85 - +
  • [46] Classification and Analysis of Clustering Algorithms for Large Datasets
    Badase, P. S.
    Deshbhratar, G. P.
    Bhagat, A. P.
    2015 INTERNATIONAL CONFERENCE ON INNOVATIONS IN INFORMATION, EMBEDDED AND COMMUNICATION SYSTEMS (ICIIECS), 2015,
  • [47] Improved fast partitional clustering algorithm for text clustering
    Bejos, Sebastian
    Feliciano-Avelino, Ivan
    Martinez-Trinidad, J. Fco.
    Carrasco-Ochoa, J. A.
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2020, 39 (02) : 2137 - 2145
  • [48] Text clustering based on kernel KNN clustering algorithm
    Xiong, Hao
    Sun, Sheng
    Feng, Yunfang
    International Journal of Applied Mathematics and Statistics, 2013, 46 (16): : 69 - 75
  • [49] AnyDBC: An Efficient Anytime Density-based Clustering Algorithm for Very Large Complex Datasets
    Mai, Son T.
    Assent, Ira
    Storgaard, Martin
    KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 1025 - 1034
  • [50] Parallel WaveCluster: A linear scaling parallel clustering algorithm implementation with application to very large datasets
    Yildirim, Ahmet Artu
    Ozdogan, Cem
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2011, 71 (07) : 955 - 962