MPRK Algorithm for Clustering the Large Text Datasets

被引：0

作者：

Thangarasu, M. ^{[1
]}

Inbarani, H. Hannah ^{[1
]}

机构：

[1] Periyar Univ, Dept Comp Sci, Salem, India

来源：

2016 IEEE INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTER APPLICATIONS (ICACA) | 2016年

关键词：

Clustering; Text document; Parallel Technique; Rough K-Means; Time complexity; PARALLEL;

D O I：

暂无

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Text Document clustering is changing the massive collections of text documents into a lesser amount of suitable clusters. While numerous clustering approaches have been projected in the last few decades, the partitioned clustering algorithms are stated performing well on document clustering based on the reviewed papers. In this research, Modified Parallel Rough K-means (MPRK) algorithm is proposed for clustering the text document and it is evaluated on datasets and the results are compared to benchmark algorithms K-means and DPPSOK-means. The experimental analysis shows the proposed algorithm produces efficient result compared to the existing algorithms.

引用

页码：224 / 229

页数：6

共 50 条

[31] Clustering of very large datasets.
Downs, GM
Barnard, JM
ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2001, 222 : U396 - U396
[32] Clustering Large Datasets with Kernel Methods
Fausser, Stefan
Schwenker, Friedhelm
2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 501 - 504
[33] A Very Fast Method for Clustering Big Text Datasets
Lin, Frank
Cohen, WilliamW.
ECAI 2010 - 19TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2010, 215 : 303 - 308
[34] An Efficient Density Biased Sampling Algorithm for Clustering Large High-Dimensional Datasets
Qian, Xue-Zhong
Deng, Jie
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2015, 29 (08)
[35] Clustering large amounts of healthcare datasets using fuzzy c-means algorithm
Reddy, B. Ramakantha
Kumar, Y. Vijay
Prabhakar, M.
2019 5TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING & COMMUNICATION SYSTEMS (ICACCS), 2019, : 93 - 97
[36] Robust multi-scale clustering of large DNA microarray datasets with the consensus algorithm
Grotkjær, T
Winther, O
Regenberg, B
Nielsen, J
Hansen, LK
BIOINFORMATICS, 2006, 22 (01) : 58 - 67
[37] High-Dimensional Text Datasets Clustering Algorithm Based on Cuckoo Search and Latent Semantic Indexing
Boushaki, Saida Ishak
Kamel, Nadjet
Bendjeghaba, Omar
JOURNAL OF INFORMATION & KNOWLEDGE MANAGEMENT, 2018, 17 (03)
[38] Clustering Large Datasets Using Data Stream Clustering Techniques
Bolanos, Matthew
Forrest, John
Hahsler, Michael
DATA ANALYSIS, MACHINE LEARNING AND KNOWLEDGE DISCOVERY, 2014, : 135 - 143
[39] A multidisciplinary ensemble algorithm for clustering heterogeneous datasets
Hassan, Bryar A.
Rashid, Tarik A.
NEURAL COMPUTING & APPLICATIONS, 2021, 33 (17): : 10987 - 11010
[40] A Complete Linkage Algorithm for Clustering Dynamic Datasets
Banerjee, Payel
Chakrabarti, Amlan
Ballabh, Tapas Kumar
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES INDIA SECTION A-PHYSICAL SCIENCES, 2024, 94 (05) : 471 - 486

← 1 2 3 4 5 →