Quasi-cluster centers clustering algorithm based on potential entropy and t-distributed stochastic neighbor embedding

被引：5

作者：

Fang, Xian ^{[1
]}

Tie, Zhixin ^{[1
]}

Guan, Yinan ^{[1
]}

Rao, Shanshan ^{[1
]}

机构：

[1] Zhejiang Sci Tech Univ, Sch Informat Sci & Technol, Hangzhou, Zhejiang, Peoples R China

来源：

SOFT COMPUTING | 2019年 / 23卷 / 14期

基金：

中国国家自然科学基金;

关键词：

Data clustering; Quasi-cluster centers clustering; Potential entropy; Optimal parameter; t-distributed stochastic neighbor embedding; DENSITY PEAKS; FAST SEARCH; FIND; REDUCTION; ROCK;

D O I：

10.1007/s00500-018-3221-y

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

A novel density-based clustering algorithm named QCC is presented recently. Although the algorithm has proved its strong robustness, it is still necessary to manually determine the two input parameters, including the number of neighbors (k) and the similarity threshold value (), which severely limits the promotion of the algorithm. In addition, the QCC does not perform excellently when confronting the datasets with relatively high dimensions. To overcome these defects, firstly, we define a new method for computing local density and introduce the strategy of potential entropy into the original algorithm. Based on this idea, we propose a new QCC clustering algorithm (QCC-PE). QCC-PE can automatically extract optimal value of the parameter k by optimizing potential entropy of data field. By this means, the optimized parameter can be calculated from the datasets objectively rather than the empirical estimation accumulated from a large number of experiments. Then, t-distributed stochastic neighbor embedding (tSNE) is applied to the model of QCC-PE and further brings forward a method based on tSNE (QCC-PE-tSNE), which preprocesses high-dimensional datasets by dimensionality reduction technique. We compare the performance of the proposed algorithms with QCC, DBSCAN, and DP in the synthetic datasets, Olivetti Face Database, and real-world datasets respectively. Experimental results show that our algorithms are feasible and effective and can often outperform the comparisons.

引用

页码：5645 / 5657

页数：13

共 50 条

[11] On the Role and Impact of the Metaparameters in t-distributed Stochastic Neighbor Embedding
Lee, John A.
Verleysen, Michel
COMPSTAT'2010: 19TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL STATISTICS, 2010, : 337 - 346
[12] A Preprocessing Manifold Learning Strategy Based on t-Distributed Stochastic Neighbor Embedding
Shi, Sha
Xu, Yefei
Xu, Xiaoyang
Mo, Xiaofan
Ding, Jun
ENTROPY, 2023, 25 (07)
[13] Clustering Heterogeneous Conformational Ensembles of Intrinsically Disordered Proteins with t-Distributed Stochastic Neighbor Embedding
Appadurai, Rajeswari
Koneru, Jaya Krishna
Bonomi, Massimiliano
Robustelli, Paul
Srivastava, Anand
JOURNAL OF CHEMICAL THEORY AND COMPUTATION, 2023, 19 (14) : 4711 - 4727
[14] Multiscale Distribution Entropy and t-Distributed Stochastic Neighbor Embedding-Based Fault Diagnosis of Rolling Bearings
Tu, Deyu
Zheng, Jinde
Jiang, Zhanwei
Pan, Haiyang
ENTROPY, 2018, 20 (05)
[15] Hyperspectral image visualization using t-distributed stochastic neighbor embedding
Zhang, Biyin
Yu, Xin
MIPPR 2015: REMOTE SENSING IMAGE PROCESSING, GEOGRAPHIC INFORMATION SYSTEMS, AND OTHER APPLICATIONS, 2015, 9815
[16] Supervised t-Distributed Stochastic Neighbor Embedding for Data Visualization and Classification
Cheng, Yichen
Wang, Xinlei
Xia, Yusen
INFORMS JOURNAL ON COMPUTING, 2021, 33 (02) : 566 - 585
[17] T-Distributed Stochastic Neighbor Embedding for Co-Representation Learning
Chen, Wei
Wang, Hongjun
Zhang, Yinghui
Deng, Ping
Luo, Zhipeng
Li, Tianrui
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2024, 15 (02)
[18] A Novel Radar HRRP Recognition Method with Accelerated T-Distributed Stochastic Neighbor Embedding and Density-Based Clustering
Wu, Hao
Dai, Dahai
Wang, Xuesong
SENSORS, 2019, 19 (23)
[19] Applying t-Distributed Stochastic Neighbor Embedding for Improving Fingerprinting-Based Localization System
Tarekegn, Getaneh Berie
Tai, Li-Chia
Lin, Hsin-Piao
Tesfaw, Belayneh Abebe
Juang, Rong-Terng
Hsu, Huan-Chia
Huang, Kai-Lun
Singh, Kanishk
IEEE SENSORS LETTERS, 2023, 7 (09)
[20] T-Distributed Stochastic Neighbor Embedding Based on Cockroach Swarm Optimization with Student Distribution Parameters
Qiu, Mengdie
Yang, Zan
Nai, Wei
Li, Dan
Xing, Yidan
Li, Kai
PROCEEDINGS OF 2021 IEEE 12TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS), 2021, : 291 - 294

← 1 2 3 4 5 →