Deep Multirepresentation Learning for Data Clustering

被引:2
|
作者
Sadeghi, Mohammadreza [1 ,2 ]
Armanfard, Narges [1 ,2 ]
机构
[1] McGill Univ, Dept Elect & Comp Engn, Montreal, PQ H3A 0E9, Canada
[2] Mila Quebec AI Inst, Montreal, PQ H2S 3H1, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Clustering algorithms; Training; Task analysis; Decoding; Optimization; Learning systems; Image reconstruction; Autoencoder (AE); cluster-specific AEs; data clustering; multiple representation learning; FUZZY C-MEANS; RECOGNITION;
D O I
10.1109/TNNLS.2023.3289158
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep clustering incorporates embedding into clustering in order to find a lower-dimensional space suitable for clustering tasks. Conventional deep clustering methods aim to obtain a single global embedding subspace (aka latent space) for all the data clusters. In contrast, in this article, we propose a deep multirepresentation learning (DML) framework for data clustering whereby each difficult-to-cluster data group is associated with its own distinct optimized latent space and all the easy-to-cluster data groups are associated with a general common latent space. Autoencoders (AEs) are employed for generating cluster-specific and general latent spaces. To specialize each AE in its associated data cluster(s), we propose a novel and effective loss function which consists of weighted reconstruction and clustering losses of the data points, where higher weights are assigned to the samples more probable to belong to the corresponding cluster(s). Experimental results on benchmark datasets demonstrate that the proposed DML framework and loss function outperform state-of-the-art clustering approaches. In addition, the results show that the DML method significantly outperforms the SOTA on imbalanced datasets as a result of assigning an individual latent space to the difficult clusters.
引用
收藏
页码:15675 / 15686
页数:12
相关论文
共 50 条
  • [41] Intelligent medical heterogeneous big data set balanced clustering using deep learning
    Li, Xiaofeng
    Jiao, Hongshuang
    Li, Dong
    PATTERN RECOGNITION LETTERS, 2020, 138 : 548 - 555
  • [42] Deep representation learning for clustering longitudinal survival data from electronic health records
    Qiu, Jiajun
    Hu, Yao
    Li, Li
    Erzurumluoglu, Abdullah Mesut
    Braenne, Ingrid
    Whitehurst, Charles
    Schmitz, Jochen
    Arora, Jatin
    Bartholdy, Boris Alexander
    Gandhi, Shrey
    Khoueiry, Pierre
    Mueller, Stefanie
    Noyvert, Boris
    Ding, Zhihao
    Jensen, Jan Nygaard
    de Jong, Johann
    NATURE COMMUNICATIONS, 2025, 16 (01)
  • [43] A deep mining method for consumer behaviour data of e-commerce users based on clustering and deep learning
    Li J.
    International Journal of Web Based Communities, 2023, 19 (01) : 2 - 14
  • [44] Unsupervised Clustering for Deep Learning: A tutorial survey
    Karoly, Artur Istvan
    Fuller, Robert
    Galambos, Peter
    ACTA POLYTECHNICA HUNGARICA, 2018, 15 (08) : 29 - 53
  • [45] A Deep Learning Approach to Clustering Visual Arts
    Castellano, Giovanna
    Vessio, Gennaro
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2022, 130 (11) : 2590 - 2605
  • [46] Regression Based Clustering by Deep Adversarial Learning
    Tang, Fei
    Zhang, Dabin
    Cai, Tie
    Li, Qin
    IEEE ACCESS, 2020, 8 : 146744 - 146753
  • [47] Hierarchical clustering with deep Q-learning
    Forster, Richard
    Fulop, Agnes
    ACTA UNIVERSITATIS SAPIENTIAE INFORMATICA, 2018, 10 (01) : 86 - 109
  • [48] Online Deep Clustering for Unsupervised Representation Learning
    Zhan, Xiaohang
    Xie, Jiahao
    Liu, Ziwei
    Ong, Yew-Soon
    Loy, Chen Change
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 6687 - 6696
  • [49] Spectral Clustering With Adaptive Neighbors for Deep Learning
    Zhao, Yang
    Li, Xuelong
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (04) : 2068 - 2078
  • [50] A Deep Learning Approach to Clustering Visual Arts
    Giovanna Castellano
    Gennaro Vessio
    International Journal of Computer Vision, 2022, 130 : 2590 - 2605