Semi supervised approach towards subspace clustering

被引:5
|
作者
Harikumar, Sandhya [1 ]
Akhil, A. S. [1 ]
机构
[1] Amrita Vishwa Vidyapeetham, Dept Comp Sci & Engn, Amritapuri, India
关键词
Subspace clustering; semi-supervised; information gain; entropy;
D O I
10.3233/JIFS-169456
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
High-dimensional data analysis is quite inevitable due to emerging technologies in various domains such as finance, healthcare, genomics and signal processing. Though data sets generated in these domains are high-dimensional, intrinsic dimensions that provide meaningful information are often much smaller. Conventionally, unsupervised clustering methods known as subspace clustering are utilized for finding clusters in different subspaces of high dimensional data, by identifying relevant features, irrespective of labels associated with each instance. Available label information, if incorporated in clustering algorithm, can bias the algorithm towards solutions more consistent with our knowledge, leading to improved cluster quality. Therefore, an Information Gain based Semi-supervised-subspace Clustering (IGSC) is proposed that identifies a subset of important attributes based on the known label for each data instance. The information about the labels associated with data sets is integrated with the search strategy for subspaces to leverage them into a model based clustering approach. Our experimentation on 13 real world labeled data sets proves the feasibility of IGSC and we validate the clusters obtained, using an improvised Davies Bouldin Index (DBI) for semi-supervised clusters.
引用
收藏
页码:1619 / 1629
页数:11
相关论文
共 50 条
  • [21] Semi-Supervised Subspace Clustering via Tensor Low-Rank Representation
    Jia, Yuheng
    Lu, Guanxing
    Liu, Hui
    Hou, Junhui
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (07) : 3455 - 3461
  • [22] SEMI-SUPERVISED SUBSPACE SEGMENTATION
    Wang, Dong
    Yin, Qiyue
    He, Ran
    Wang, Liang
    Tan, Tieniu
    2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 2854 - 2858
  • [23] A semi-supervised clustering approach using labeled data
    Taghizabet, A.
    Tanha, J.
    Amini, A.
    Mohammadzadeh, J.
    SCIENTIA IRANICA, 2023, 30 (01) : 104 - 115
  • [24] A Semi-Supervised Clustering Approach for Semantic Slot Labelling
    Cuayahuitl, Heriberto
    Dethlefs, Nina
    Hastie, Helen
    2014 13TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2014, : 500 - 505
  • [25] A novel semi-supervised approach for network traffic clustering
    Wang Y.
    Xiang Y.
    Zhang J.
    Yu S.
    Proceedings - 2011 5th International Conference on Network and System Security, NSS 2011, 2011, : 169 - 175
  • [26] Fast Semi-Supervised Fuzzy Clustering :Approach and Application
    Cai, Jia-xin
    Yang, Feng
    Feng, Guo-can
    PROCEEDINGS OF THE 2009 CHINESE CONFERENCE ON PATTERN RECOGNITION AND THE FIRST CJK JOINT WORKSHOP ON PATTERN RECOGNITION, VOLS 1 AND 2, 2009, : 108 - +
  • [27] Constrained Tensor Representation Learning for Multi-View Semi-Supervised Subspace Clustering
    Tang, Yongqiang
    Xie, Yuan
    Zhang, Chenyang
    Zhang, Wensheng
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 3920 - 3933
  • [28] Hypergraph-Supervised Deep Subspace Clustering
    Hu, Yu
    Cai, Hongmin
    MATHEMATICS, 2021, 9 (24)
  • [29] Self-Supervised Embedding for Subspace Clustering
    Zhu, Wenjie
    Peng, Bo
    Chen, Chunchun
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 3687 - 3691
  • [30] Exploiting Unsupervised and Supervised Constraints for Subspace Clustering
    Hu, Han
    Feng, Jianjiang
    Zhou, Jie
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (08) : 1542 - 1557