Sparse Fuzzy C-Means Clustering with Lasso Penalty

被引:1
|
作者
Parveen, Shazia [1 ]
Yang, Miin-Shen [1 ]
机构
[1] Chung Yuan Christian Univ, Dept Appl Math, Taoyuan 32023, Taiwan
来源
SYMMETRY-BASEL | 2024年 / 16卷 / 09期
关键词
clustering; fuzzy c-means (FCM); sparse FCM (S-FCM); lasso; S-FCM-Lasso; evaluation measures; SELECTION; ALGORITHMS;
D O I
10.3390/sym16091208
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Clustering is a technique of grouping data into a homogeneous structure according to the similarity or dissimilarity measures between objects. In clustering, the fuzzy c-means (FCM) algorithm is the best-known and most commonly used method and is a fuzzy extension of k-means in which FCM has been widely used in various fields. Although FCM is a good clustering algorithm, it only treats data points with feature components under equal importance and has drawbacks for handling high-dimensional data. The rapid development of social media and data acquisition techniques has led to advanced methods of collecting and processing larger, complex, and high-dimensional data. However, with high-dimensional data, the number of dimensions is typically immaterial or irrelevant. For features to be sparse, the Lasso penalty is capable of being applied to feature weights. A solution for FCM with sparsity is sparse FCM (S-FCM) clustering. In this paper, we propose a new S-FCM, called S-FCM-Lasso, which is a new type of S-FCM based on the Lasso penalty. The irrelevant features can be diminished towards exactly zero and assigned zero weights for unnecessary characteristics by the proposed S-FCM-Lasso. Based on various clustering performance measures, we compare S-FCM-Lasso with the S-FCM and other existing sparse clustering algorithms on several numerical and real-life datasets. Comparisons and experimental results demonstrate that, in terms of these performance measures, the proposed S-FCM-Lasso performs better than S-FCM and existing sparse clustering algorithms. This validates the efficiency and usefulness of the proposed S-FCM-Lasso algorithm for high-dimensional datasets with sparsity.
引用
收藏
页数:23
相关论文
共 50 条
  • [31] Hierarchically Structured Fuzzy c-Means Clustering
    Hye Won Suk
    Ji Yeh Choi
    Heungsun Hwang
    Behaviormetrika, 2013, 40 (1) : 1 - 17
  • [32] Novel possibilistic fuzzy c-means clustering
    School of Electrical and Information Engineering, Jiangsu University, Zhenjiang 212013, China
    不详
    Tien Tzu Hsueh Pao, 2008, 10 (1996-2000):
  • [33] An Accelerated Fuzzy C-Means clustering algorithm
    Hershfinkel, D
    Dinstein, I
    APPLICATIONS OF FUZZY LOGIC TECHNOLOGY III, 1996, 2761 : 41 - 52
  • [34] Suppressed fuzzy C-means clustering algorithm
    Fan, JL
    Zhen, WZ
    Xie, WX
    PATTERN RECOGNITION LETTERS, 2003, 24 (9-10) : 1607 - 1612
  • [35] Fuzzy c-Means Clustering for Uncertain Data Using Quadratic Penalty-Vector Regularization
    Endo, Yasunori
    Hasegawa, Yasushi
    Yukihiro, Hamasuna
    Kanzawa, Yuchi
    JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2011, 15 (01) : 76 - 82
  • [36] Relative entropy fuzzy c-means clustering
    Zarinbal, M.
    Zarandi, M. H. Fazel
    Turksen, I. B.
    INFORMATION SCIENCES, 2014, 260 : 74 - 97
  • [37] Diverse fuzzy c-means for image clustering
    Zhang, Lingling
    Luo, Minnan
    Liu, Jun
    Li, Zhihui
    Zheng, Qinghua
    PATTERN RECOGNITION LETTERS, 2020, 130 (130) : 275 - 283
  • [38] Robust Weighted Fuzzy C-Means Clustering
    Hadjahmadi, A. H.
    Homayounpour, M. A.
    Ahadi, S. M.
    2008 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-5, 2008, : 305 - 311
  • [39] Soil clustering by fuzzy c-means algorithm
    Goktepe, AB
    Altun, S
    Sezer, A
    ADVANCES IN ENGINEERING SOFTWARE, 2005, 36 (10) : 691 - 698
  • [40] Gaussian Collaborative Fuzzy C-Means Clustering
    Gao, Yunlong
    Wang, Zhihao
    Li, Huidui
    Pan, Jinyan
    INTERNATIONAL JOURNAL OF FUZZY SYSTEMS, 2021, 23 (07) : 2218 - 2234