Sparse Fuzzy C-Means Clustering with Lasso Penalty

被引:1
|
作者
Parveen, Shazia [1 ]
Yang, Miin-Shen [1 ]
机构
[1] Chung Yuan Christian Univ, Dept Appl Math, Taoyuan 32023, Taiwan
来源
SYMMETRY-BASEL | 2024年 / 16卷 / 09期
关键词
clustering; fuzzy c-means (FCM); sparse FCM (S-FCM); lasso; S-FCM-Lasso; evaluation measures; SELECTION; ALGORITHMS;
D O I
10.3390/sym16091208
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Clustering is a technique of grouping data into a homogeneous structure according to the similarity or dissimilarity measures between objects. In clustering, the fuzzy c-means (FCM) algorithm is the best-known and most commonly used method and is a fuzzy extension of k-means in which FCM has been widely used in various fields. Although FCM is a good clustering algorithm, it only treats data points with feature components under equal importance and has drawbacks for handling high-dimensional data. The rapid development of social media and data acquisition techniques has led to advanced methods of collecting and processing larger, complex, and high-dimensional data. However, with high-dimensional data, the number of dimensions is typically immaterial or irrelevant. For features to be sparse, the Lasso penalty is capable of being applied to feature weights. A solution for FCM with sparsity is sparse FCM (S-FCM) clustering. In this paper, we propose a new S-FCM, called S-FCM-Lasso, which is a new type of S-FCM based on the Lasso penalty. The irrelevant features can be diminished towards exactly zero and assigned zero weights for unnecessary characteristics by the proposed S-FCM-Lasso. Based on various clustering performance measures, we compare S-FCM-Lasso with the S-FCM and other existing sparse clustering algorithms on several numerical and real-life datasets. Comparisons and experimental results demonstrate that, in terms of these performance measures, the proposed S-FCM-Lasso performs better than S-FCM and existing sparse clustering algorithms. This validates the efficiency and usefulness of the proposed S-FCM-Lasso algorithm for high-dimensional datasets with sparsity.
引用
收藏
页数:23
相关论文
共 50 条
  • [1] Sparse possibilistic c-means clustering with Lasso
    Yang, Miin-Shen
    Benjamin, Josephine B. M.
    PATTERN RECOGNITION, 2023, 138
  • [2] Sparse learning based fuzzy c-means clustering
    Gu, Jing
    Jiao, Licheng
    Yang, Shuyuan
    Zhao, Jiaqi
    KNOWLEDGE-BASED SYSTEMS, 2017, 119 : 113 - 125
  • [3] Projected fuzzy c-means clustering algorithm with instance penalty
    Wang, Jikui
    Wu, Yiwen
    Huang, Xueyan
    Zhang, Cuihong
    Nie, Feiping
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 255
  • [4] A sparse fuzzy c-means algorithm based on sparse clustering framework
    Qiu, Xianen
    Qiu, Yanyi
    Feng, Guocan
    Li, Peixing
    NEUROCOMPUTING, 2015, 157 : 290 - 295
  • [5] An insensitivity Fuzzy C-means clustering algorithm based on penalty factor
    Chen, Jiashun
    Pi, Dechang
    Liu, Zhipeng
    Journal of Software, 2013, 8 (09) : 2379 - 2384
  • [6] Fuzzy c-means clustering algorithm with a novel penalty term for image segmentation
    Yang, Y
    Zheng, C
    Lin, P
    OPTO-ELECTRONICS REVIEW, 2005, 13 (04) : 309 - 315
  • [7] Image segmentation by fuzzy c-means clustering algorithm with a novel penalty term
    Yang, Yong
    Huang, Shuying
    COMPUTING AND INFORMATICS, 2007, 26 (01) : 17 - 31
  • [8] Fuzzy c-means for fuzzy hierarchical clustering
    Vicenc, T
    FUZZ-IEEE 2005: PROCEEDINGS OF THE IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS: BIGGEST LITTLE CONFERENCE IN THE WORLD, 2005, : 646 - 651
  • [9] Mixed fuzzy C-means clustering
    Demirhan, Haydar
    INFORMATION SCIENCES, 2025, 690
  • [10] On Tolerant Fuzzy c-Means Clustering
    Hamasuna, Yukihiro
    Endo, Yasunori
    Miyamoto, Sadaaki
    JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2009, 13 (04) : 421 - 428