Quasi-cluster centers clustering algorithm based on potential entropy and t-distributed stochastic neighbor embedding

被引:5
|
作者
Fang, Xian [1 ]
Tie, Zhixin [1 ]
Guan, Yinan [1 ]
Rao, Shanshan [1 ]
机构
[1] Zhejiang Sci Tech Univ, Sch Informat Sci & Technol, Hangzhou, Zhejiang, Peoples R China
基金
中国国家自然科学基金;
关键词
Data clustering; Quasi-cluster centers clustering; Potential entropy; Optimal parameter; t-distributed stochastic neighbor embedding; DENSITY PEAKS; FAST SEARCH; FIND; REDUCTION; ROCK;
D O I
10.1007/s00500-018-3221-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A novel density-based clustering algorithm named QCC is presented recently. Although the algorithm has proved its strong robustness, it is still necessary to manually determine the two input parameters, including the number of neighbors (k) and the similarity threshold value (), which severely limits the promotion of the algorithm. In addition, the QCC does not perform excellently when confronting the datasets with relatively high dimensions. To overcome these defects, firstly, we define a new method for computing local density and introduce the strategy of potential entropy into the original algorithm. Based on this idea, we propose a new QCC clustering algorithm (QCC-PE). QCC-PE can automatically extract optimal value of the parameter k by optimizing potential entropy of data field. By this means, the optimized parameter can be calculated from the datasets objectively rather than the empirical estimation accumulated from a large number of experiments. Then, t-distributed stochastic neighbor embedding (tSNE) is applied to the model of QCC-PE and further brings forward a method based on tSNE (QCC-PE-tSNE), which preprocesses high-dimensional datasets by dimensionality reduction technique. We compare the performance of the proposed algorithms with QCC, DBSCAN, and DP in the synthetic datasets, Olivetti Face Database, and real-world datasets respectively. Experimental results show that our algorithms are feasible and effective and can often outperform the comparisons.
引用
收藏
页码:5645 / 5657
页数:13
相关论文
共 50 条
  • [1] Quasi-cluster centers clustering algorithm based on potential entropy and t-distributed stochastic neighbor embedding
    Xian Fang
    Zhixin Tie
    Yinan Guan
    Shanshan Rao
    Soft Computing, 2019, 23 : 5645 - 5657
  • [2] t-Distributed Stochastic Neighbor Embedding Spectral Clustering
    Rogovschi, Nicoleta
    Kitazono, Jun
    Grozavu, Nistor
    Omori, Toshiaki
    Ozawa, Seiichi
    2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 1628 - 1632
  • [3] Cluster Ensemble Approach Based on T-distributed Stochastic Neighbor Embedding
    Xu Sen
    Hua Xiaopeng
    Xu Jing
    Xu Xiufang
    Gao Jun
    An Jing
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2018, 40 (06) : 1316 - 1322
  • [4] QCC: a novel clustering algorithm based on Quasi-Cluster Centers
    Jinlong Huang
    Qingsheng Zhu
    Lijun Yang
    Dongdong Cheng
    Quanwang Wu
    Machine Learning, 2017, 106 : 337 - 357
  • [5] QCC: a novel clustering algorithm based on Quasi-Cluster Centers
    Huang, Jinlong
    Zhu, Qingsheng
    Yang, Lijun
    Cheng, Dongdong
    Wu, Quanwang
    MACHINE LEARNING, 2017, 106 (03) : 337 - 357
  • [6] GPU accelerated t-distributed stochastic neighbor embedding
    Chan, David M.
    Rao, Roshan
    Huang, Forrest
    Canny, John F.
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2019, 131 : 1 - 13
  • [7] Quantum kernel t-distributed stochastic neighbor embedding
    Kawase, Yoshiaki
    Mitarai, Kosuke
    Fujii, Keisuke
    PHYSICAL REVIEW RESEARCH, 2024, 6 (04):
  • [8] Blast furnace condition data clustering based on combination of T-distributed stochastic neighbor embedding and spectral clustering
    Fang, Xu
    Zhang, Sen
    Su, Xiaoli
    Zhao, Baoyong
    Xiao, Wendong
    Yin, Yixin
    Wang, Fenhua
    2019 IEEE 15TH INTERNATIONAL CONFERENCE ON CONTROL AND AUTOMATION (ICCA), 2019, : 1608 - 1613
  • [9] t-Distributed Stochastic Neighbor Embedding with Inhomogeneous Degrees of Freedom
    Kitazono, Jun
    Grozavu, Nistor
    Rogovschi, Nicoleta
    Omori, Toshiaki
    Ozawa, Seiichi
    NEURAL INFORMATION PROCESSING, ICONIP 2016, PT III, 2016, 9949 : 119 - 128
  • [10] T-Distributed Stochastic Neighbor Embedding with Gauss Initialization of Quantum Whale Optimization Algorithm
    Yang, Zan
    Sun, Yuan
    Li, Dan
    Zhang, Zhihao
    Xie, Yuchen
    PROCEEDINGS OF THE 39TH CHINESE CONTROL CONFERENCE, 2020, : 3200 - 3205