Scaling up kernel grower clustering method for large data sets via core-sets

被引:0
|
作者
Chang, Liang [1 ]
Deng, Xiao-Ming [2 ,3 ]
Zheng, Sui-Wu [1 ]
Wang, Yong-Qing [1 ]
机构
[1] Key Laboratory of Complex System and Intelligence Science, Institute of Automation, Chinese Academy of Sciences, Beijing 100080, China
[2] Virtual Reality Laboratory, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100080, China
[3] National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, Beijing 100080, China
来源
基金
中国国家自然科学基金;
关键词
Data mining - Data structures - Image segmentation - Pattern recognition - Self organizing maps;
D O I
10.3724/SP.J.1004.2008.00376
中图分类号
学科分类号
摘要
Kernel grower is a novel kernel clustering method proposed recently by Camastra and Verri. It shows good performance for various data sets and compares favorably with respect to popular clustering algorithms. However, the main drawback of the method is the weak scaling ability in dealing with large data sets, which restricts its application greatly. In this paper, we propose a scaled-up kernel grower method using core-sets, which is significantly faster than the original method for large data clustering. Meanwhile, it can deal with very large data sets. Numerical experiments on benchmark data sets as well as synthetic data sets show the efficiency of the proposed method. The method is also applied to real image segmentation to illustrate its performance.
引用
收藏
页码:376 / 382
相关论文
共 50 条
  • [21] The least core, kernel and bargaining sets of large games
    Ezra Einy
    Dov Monderer
    Diego Moreno
    Economic Theory, 1998, 11 : 585 - 601
  • [22] Multidimensional scaling for large genomic data sets
    Tzeng, Jengnan
    Lu, Henry Horng-Shing
    Li, Wen-Hsiung
    BMC BIOINFORMATICS, 2008, 9 (1)
  • [23] Multidimensional scaling for large genomic data sets
    Jengnan Tzeng
    Henry Horng-Shing Lu
    Wen-Hsiung Li
    BMC Bioinformatics, 9
  • [24] New diagonal bundle method for clustering problems in large data sets
    Karmitsa, Napsu
    Bagirov, Adil M.
    Taheri, Sona
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2017, 263 (02) : 367 - 379
  • [25] Clustering Analysis for Large Scale Data Sets
    Singh, Sachin
    Mishra, Ashish
    2015 INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION & AUTOMATION (ICCCA), 2015, : 1 - 4
  • [26] CLUSTERING OF LARGE DATA SETS - ZUPAN,J
    EVERITT, BS
    STATISTICIAN, 1983, 32 (03): : 355 - 355
  • [27] Bayesian nonparametric clustering for large data sets
    Zuanetti, Daiane Aparecida
    Mueller, Peter
    Zhu, Yitan
    Yang, Shengjie
    Ji, Yuan
    STATISTICS AND COMPUTING, 2019, 29 (02) : 203 - 215
  • [28] Bayesian nonparametric clustering for large data sets
    Daiane Aparecida Zuanetti
    Peter Müller
    Yitan Zhu
    Shengjie Yang
    Yuan Ji
    Statistics and Computing, 2019, 29 : 203 - 215
  • [29] Clustering Algorithms for Large Temporal Data Sets
    Scepi, Germana
    DATA ANALYSIS AND CLASSIFICATION, 2010, : 369 - 377
  • [30] Clustering Very Large Dissimilarity Data Sets
    Hammer, Barbara
    Hasenfuss, Alexander
    ARTIFICIAL NEURAL NETWORKS IN PATTERN RECOGNITION, PROCEEDINGS, 2010, 5998 : 259 - +