Quasi-cluster centers clustering algorithm based on potential entropy and t-distributed stochastic neighbor embedding

被引:5
|
作者
Fang, Xian [1 ]
Tie, Zhixin [1 ]
Guan, Yinan [1 ]
Rao, Shanshan [1 ]
机构
[1] Zhejiang Sci Tech Univ, Sch Informat Sci & Technol, Hangzhou, Zhejiang, Peoples R China
基金
中国国家自然科学基金;
关键词
Data clustering; Quasi-cluster centers clustering; Potential entropy; Optimal parameter; t-distributed stochastic neighbor embedding; DENSITY PEAKS; FAST SEARCH; FIND; REDUCTION; ROCK;
D O I
10.1007/s00500-018-3221-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A novel density-based clustering algorithm named QCC is presented recently. Although the algorithm has proved its strong robustness, it is still necessary to manually determine the two input parameters, including the number of neighbors (k) and the similarity threshold value (), which severely limits the promotion of the algorithm. In addition, the QCC does not perform excellently when confronting the datasets with relatively high dimensions. To overcome these defects, firstly, we define a new method for computing local density and introduce the strategy of potential entropy into the original algorithm. Based on this idea, we propose a new QCC clustering algorithm (QCC-PE). QCC-PE can automatically extract optimal value of the parameter k by optimizing potential entropy of data field. By this means, the optimized parameter can be calculated from the datasets objectively rather than the empirical estimation accumulated from a large number of experiments. Then, t-distributed stochastic neighbor embedding (tSNE) is applied to the model of QCC-PE and further brings forward a method based on tSNE (QCC-PE-tSNE), which preprocesses high-dimensional datasets by dimensionality reduction technique. We compare the performance of the proposed algorithms with QCC, DBSCAN, and DP in the synthetic datasets, Olivetti Face Database, and real-world datasets respectively. Experimental results show that our algorithms are feasible and effective and can often outperform the comparisons.
引用
收藏
页码:5645 / 5657
页数:13
相关论文
共 50 条
  • [11] On the Role and Impact of the Metaparameters in t-distributed Stochastic Neighbor Embedding
    Lee, John A.
    Verleysen, Michel
    COMPSTAT'2010: 19TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL STATISTICS, 2010, : 337 - 346
  • [12] A Preprocessing Manifold Learning Strategy Based on t-Distributed Stochastic Neighbor Embedding
    Shi, Sha
    Xu, Yefei
    Xu, Xiaoyang
    Mo, Xiaofan
    Ding, Jun
    ENTROPY, 2023, 25 (07)
  • [13] Clustering Heterogeneous Conformational Ensembles of Intrinsically Disordered Proteins with t-Distributed Stochastic Neighbor Embedding
    Appadurai, Rajeswari
    Koneru, Jaya Krishna
    Bonomi, Massimiliano
    Robustelli, Paul
    Srivastava, Anand
    JOURNAL OF CHEMICAL THEORY AND COMPUTATION, 2023, 19 (14) : 4711 - 4727
  • [14] Multiscale Distribution Entropy and t-Distributed Stochastic Neighbor Embedding-Based Fault Diagnosis of Rolling Bearings
    Tu, Deyu
    Zheng, Jinde
    Jiang, Zhanwei
    Pan, Haiyang
    ENTROPY, 2018, 20 (05)
  • [15] Hyperspectral image visualization using t-distributed stochastic neighbor embedding
    Zhang, Biyin
    Yu, Xin
    MIPPR 2015: REMOTE SENSING IMAGE PROCESSING, GEOGRAPHIC INFORMATION SYSTEMS, AND OTHER APPLICATIONS, 2015, 9815
  • [16] Supervised t-Distributed Stochastic Neighbor Embedding for Data Visualization and Classification
    Cheng, Yichen
    Wang, Xinlei
    Xia, Yusen
    INFORMS JOURNAL ON COMPUTING, 2021, 33 (02) : 566 - 585
  • [17] T-Distributed Stochastic Neighbor Embedding for Co-Representation Learning
    Chen, Wei
    Wang, Hongjun
    Zhang, Yinghui
    Deng, Ping
    Luo, Zhipeng
    Li, Tianrui
    ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2024, 15 (02)
  • [18] A Novel Radar HRRP Recognition Method with Accelerated T-Distributed Stochastic Neighbor Embedding and Density-Based Clustering
    Wu, Hao
    Dai, Dahai
    Wang, Xuesong
    SENSORS, 2019, 19 (23)
  • [19] Applying t-Distributed Stochastic Neighbor Embedding for Improving Fingerprinting-Based Localization System
    Tarekegn, Getaneh Berie
    Tai, Li-Chia
    Lin, Hsin-Piao
    Tesfaw, Belayneh Abebe
    Juang, Rong-Terng
    Hsu, Huan-Chia
    Huang, Kai-Lun
    Singh, Kanishk
    IEEE SENSORS LETTERS, 2023, 7 (09)
  • [20] T-Distributed Stochastic Neighbor Embedding Based on Cockroach Swarm Optimization with Student Distribution Parameters
    Qiu, Mengdie
    Yang, Zan
    Nai, Wei
    Li, Dan
    Xing, Yidan
    Li, Kai
    PROCEEDINGS OF 2021 IEEE 12TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS), 2021, : 291 - 294