MPRK Algorithm for Clustering the Large Text Datasets

被引:0
|
作者
Thangarasu, M. [1 ]
Inbarani, H. Hannah [1 ]
机构
[1] Periyar Univ, Dept Comp Sci, Salem, India
来源
2016 IEEE INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTER APPLICATIONS (ICACA) | 2016年
关键词
Clustering; Text document; Parallel Technique; Rough K-Means; Time complexity; PARALLEL;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Text Document clustering is changing the massive collections of text documents into a lesser amount of suitable clusters. While numerous clustering approaches have been projected in the last few decades, the partitioned clustering algorithms are stated performing well on document clustering based on the reviewed papers. In this research, Modified Parallel Rough K-means (MPRK) algorithm is proposed for clustering the text document and it is evaluated on datasets and the results are compared to benchmark algorithms K-means and DPPSOK-means. The experimental analysis shows the proposed algorithm produces efficient result compared to the existing algorithms.
引用
收藏
页码:224 / 229
页数:6
相关论文
共 50 条
  • [31] Clustering of very large datasets.
    Downs, GM
    Barnard, JM
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2001, 222 : U396 - U396
  • [32] Clustering Large Datasets with Kernel Methods
    Fausser, Stefan
    Schwenker, Friedhelm
    2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 501 - 504
  • [33] A Very Fast Method for Clustering Big Text Datasets
    Lin, Frank
    Cohen, WilliamW.
    ECAI 2010 - 19TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2010, 215 : 303 - 308
  • [34] An Efficient Density Biased Sampling Algorithm for Clustering Large High-Dimensional Datasets
    Qian, Xue-Zhong
    Deng, Jie
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2015, 29 (08)
  • [35] Clustering large amounts of healthcare datasets using fuzzy c-means algorithm
    Reddy, B. Ramakantha
    Kumar, Y. Vijay
    Prabhakar, M.
    2019 5TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING & COMMUNICATION SYSTEMS (ICACCS), 2019, : 93 - 97
  • [36] Robust multi-scale clustering of large DNA microarray datasets with the consensus algorithm
    Grotkjær, T
    Winther, O
    Regenberg, B
    Nielsen, J
    Hansen, LK
    BIOINFORMATICS, 2006, 22 (01) : 58 - 67
  • [37] High-Dimensional Text Datasets Clustering Algorithm Based on Cuckoo Search and Latent Semantic Indexing
    Boushaki, Saida Ishak
    Kamel, Nadjet
    Bendjeghaba, Omar
    JOURNAL OF INFORMATION & KNOWLEDGE MANAGEMENT, 2018, 17 (03)
  • [38] Clustering Large Datasets Using Data Stream Clustering Techniques
    Bolanos, Matthew
    Forrest, John
    Hahsler, Michael
    DATA ANALYSIS, MACHINE LEARNING AND KNOWLEDGE DISCOVERY, 2014, : 135 - 143
  • [39] A multidisciplinary ensemble algorithm for clustering heterogeneous datasets
    Hassan, Bryar A.
    Rashid, Tarik A.
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (17): : 10987 - 11010
  • [40] A Complete Linkage Algorithm for Clustering Dynamic Datasets
    Banerjee, Payel
    Chakrabarti, Amlan
    Ballabh, Tapas Kumar
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES INDIA SECTION A-PHYSICAL SCIENCES, 2024, 94 (05) : 471 - 486