Semi supervised approach towards subspace clustering

被引:5
|
作者
Harikumar, Sandhya [1 ]
Akhil, A. S. [1 ]
机构
[1] Amrita Vishwa Vidyapeetham, Dept Comp Sci & Engn, Amritapuri, India
关键词
Subspace clustering; semi-supervised; information gain; entropy;
D O I
10.3233/JIFS-169456
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
High-dimensional data analysis is quite inevitable due to emerging technologies in various domains such as finance, healthcare, genomics and signal processing. Though data sets generated in these domains are high-dimensional, intrinsic dimensions that provide meaningful information are often much smaller. Conventionally, unsupervised clustering methods known as subspace clustering are utilized for finding clusters in different subspaces of high dimensional data, by identifying relevant features, irrespective of labels associated with each instance. Available label information, if incorporated in clustering algorithm, can bias the algorithm towards solutions more consistent with our knowledge, leading to improved cluster quality. Therefore, an Information Gain based Semi-supervised-subspace Clustering (IGSC) is proposed that identifies a subset of important attributes based on the known label for each data instance. The information about the labels associated with data sets is integrated with the search strategy for subspaces to leverage them into a model based clustering approach. Our experimentation on 13 real world labeled data sets proves the feasibility of IGSC and we validate the clusters obtained, using an improvised Davies Bouldin Index (DBI) for semi-supervised clusters.
引用
收藏
页码:1619 / 1629
页数:11
相关论文
共 50 条
  • [41] Semi-supervised fuzzy clustering: A kernel-based approach
    Zhang, Huaxiang
    Lu, Jing
    KNOWLEDGE-BASED SYSTEMS, 2009, 22 (06) : 477 - 481
  • [42] A New Approach for Semi-supervised Fuzzy Clustering with Multiple Fuzzifiers
    Tran Manh Tuan
    Mai Dinh Sinh
    Tran Dinh Khang
    Phung The Huan
    Tran Thi Ngan
    Nguyen Long Giang
    Vu Duc Thai
    INTERNATIONAL JOURNAL OF FUZZY SYSTEMS, 2022, 24 (8) : 3688 - 3701
  • [43] Applications of semi-supervised subspace possibilistic fuzzy c-means clustering algorithm in IoT
    Zhang, Y. F.
    Zhang, Wei
    INFORMATION TECHNOLOGY AND COMPUTER APPLICATION ENGINEERING, 2014, : 7 - 10
  • [44] Self-Supervised Convolutional Subspace Clustering Network
    Zhang, Junjian
    Li, Chun-Guang
    You, Chong
    Qi, Xianbiao
    Zhang, Honggang
    Guo, Jun
    Lin, Zhouchen
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 5468 - 5477
  • [45] A Convex Approach to Subspace Clustering
    Ohlsson, Henrik
    Ljung, Lennart
    2011 50TH IEEE CONFERENCE ON DECISION AND CONTROL AND EUROPEAN CONTROL CONFERENCE (CDC-ECC), 2011, : 1467 - 1472
  • [46] Semi-supervised clustering methods
    Bair, Eric
    WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2013, 5 (05): : 349 - 361
  • [47] SEMI-SUPERVISED SPECTRAL CLUSTERING
    Mai, Xiaoyi
    Couillet, Romain
    2018 CONFERENCE RECORD OF 52ND ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS, AND COMPUTERS, 2018, : 2012 - 2016
  • [48] A review on semi-supervised clustering
    Cai, Jianghui
    Hao, Jing
    Yang, Haifeng
    Zhao, Xujun
    Yang, Yuqing
    INFORMATION SCIENCES, 2023, 632 : 164 - 200
  • [49] Semi-supervised transfer subspace for domain adaptation
    Pereira, Luis A. M.
    Torres, Ricardo da Silva
    PATTERN RECOGNITION, 2018, 75 : 235 - 249
  • [50] Towards Safe Semi-supervised Classification: Adjusted Cluster Assumption via Clustering
    Wang, Yunyun
    Meng, Yan
    Fu, Zhenyong
    Xue, Hui
    NEURAL PROCESSING LETTERS, 2017, 46 (03) : 1031 - 1042