Scaling up kernel grower clustering method for large data sets via core-sets

被引：0

作者：

Chang, Liang ^{[1
]}

Deng, Xiao-Ming ^{[2
,3
]}

Zheng, Sui-Wu ^{[1
]}

Wang, Yong-Qing ^{[1
]}

机构：

[1] Key Laboratory of Complex System and Intelligence Science, Institute of Automation, Chinese Academy of Sciences, Beijing 100080, China

[2] Virtual Reality Laboratory, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100080, China

[3] National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, Beijing 100080, China

来源：

Zidonghua Xuebao/Acta Automatica Sinica | 2008年 / 34卷 / 03期

基金：

中国国家自然科学基金;

关键词：

Data mining - Data structures - Image segmentation - Pattern recognition - Self organizing maps;

D O I：

10.3724/SP.J.1004.2008.00376

中图分类号：

学科分类号：

摘要：

Kernel grower is a novel kernel clustering method proposed recently by Camastra and Verri. It shows good performance for various data sets and compares favorably with respect to popular clustering algorithms. However, the main drawback of the method is the weak scaling ability in dealing with large data sets, which restricts its application greatly. In this paper, we propose a scaled-up kernel grower method using core-sets, which is significantly faster than the original method for large data clustering. Meanwhile, it can deal with very large data sets. Numerical experiments on benchmark data sets as well as synthetic data sets show the efficiency of the proposed method. The method is also applied to real image segmentation to illustrate its performance.

引用

页码：376 / 382

共 50 条

[21] The least core, kernel and bargaining sets of large games
Ezra Einy
Dov Monderer
Diego Moreno
Economic Theory, 1998, 11 : 585 - 601
[22] Multidimensional scaling for large genomic data sets
Tzeng, Jengnan
Lu, Henry Horng-Shing
Li, Wen-Hsiung
BMC BIOINFORMATICS, 2008, 9 (1)
[23] Multidimensional scaling for large genomic data sets
Jengnan Tzeng
Henry Horng-Shing Lu
Wen-Hsiung Li
BMC Bioinformatics, 9
[24] New diagonal bundle method for clustering problems in large data sets
Karmitsa, Napsu
Bagirov, Adil M.
Taheri, Sona
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2017, 263 (02) : 367 - 379
[25] Clustering Analysis for Large Scale Data Sets
Singh, Sachin
Mishra, Ashish
2015 INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION & AUTOMATION (ICCCA), 2015, : 1 - 4
[26] CLUSTERING OF LARGE DATA SETS - ZUPAN,J
EVERITT, BS
STATISTICIAN, 1983, 32 (03): : 355 - 355
[27] Bayesian nonparametric clustering for large data sets
Zuanetti, Daiane Aparecida
Mueller, Peter
Zhu, Yitan
Yang, Shengjie
Ji, Yuan
STATISTICS AND COMPUTING, 2019, 29 (02) : 203 - 215
[28] Bayesian nonparametric clustering for large data sets
Daiane Aparecida Zuanetti
Peter Müller
Yitan Zhu
Shengjie Yang
Yuan Ji
Statistics and Computing, 2019, 29 : 203 - 215
[29] Clustering Algorithms for Large Temporal Data Sets
Scepi, Germana
DATA ANALYSIS AND CLASSIFICATION, 2010, : 369 - 377
[30] Clustering Very Large Dissimilarity Data Sets
Hammer, Barbara
Hasenfuss, Alexander
ARTIFICIAL NEURAL NETWORKS IN PATTERN RECOGNITION, PROCEEDINGS, 2010, 5998 : 259 - +

← 1 2 3 4 5 →