Robust unsupervised image categorization based on variational autoencoder with disentangled latent representations

被引:5
|
作者
Yang, Lin [1 ]
Fan, Wentao [1 ]
Bouguila, Nizar [2 ]
机构
[1] Huaqiao Univ, Dept Comp Sci & Technol, Xiamen, Peoples R China
[2] Concordia Univ, Concordia Inst Informat Syst Engn CIISE, Montreal, PQ, Canada
基金
中国国家自然科学基金;
关键词
Clustering; Variational autoencoder (VAE); Disentangled latent representations; Robust training; Mixture model; Student's-t distribution;
D O I
10.1016/j.knosys.2022.108671
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, deep generative models have been successfully applied to unsupervised clustering analyses, due to the model capabilities for learning good representations of the input data from a lower dimensional latent space. In this work, we propose a robust deep generative clustering method based on a variational autoencoder (VAE) for unsupervised image categorization. The merits of our method can be summarized as follows. First, each latent representation generated by the encoder is disentangled into the cluster representation and generation representation, where the cluster representation is responsible for preserving the clustering information, while the generation representation is responsible for conserving the generation information. Thus, by only utilizing the cluster representation, we can improve the performance and efficiency of clustering tasks without interference from generating tasks. Second, a Student's-t mixture model is adopted as the prior over the cluster representation to enhance the robustness of our method against clustering outliers. Third, we propose a biaugmentation module to promote the training stability for our model. In contrast with most of the existing deep generative clustering methods that require a pretraining step to stabilize the training process, our model is able to provide a stable training process through feature disentanglement and data augmentation. We validate the proposed robust deep generative clustering method through extensive experiments by comparing it with state-of-the-art methods on unsupervised image categorization. (C) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] CONTRASTIVE PREDICTIVE CODING SUPPORTED FACTORIZED VARIATIONAL AUTOENCODER FOR UNSUPERVISED LEARNING OF DISENTANGLED SPEECH REPRESENTATIONS
    Ebbers, Janek
    Kuhlmann, Michael
    Cord-Landwehr, Tobias
    Haeb-Umbach, Reinhold
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 3860 - 3864
  • [2] Unsupervised Image Categorization Based on Variational Autoencoder and Student's-T Mixture Model
    Zhang, Yu
    Fan, Wentao
    Bouguila, Nizar
    2019 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2019), 2019, : 2403 - 2409
  • [3] Unsupervised image categorization based on deep generative models with disentangled representations and von Mises-Fisher distributions
    Fan, Wentao
    Xu, Kunxiong
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2025, 16 (01) : 611 - 623
  • [4] Robust and Unsupervised KPI Anomaly Detection Based on Conditional Variational Autoencoder
    Li, Zeyan
    Chen, Wenxiao
    Pei, Dan
    2018 IEEE 37TH INTERNATIONAL PERFORMANCE COMPUTING AND COMMUNICATIONS CONFERENCE (IPCCC), 2018,
  • [5] Learning latent representations of bank customers with the Variational Autoencoder
    Mancisidor, Rogelio A.
    Kampffmeyer, Michael
    Aas, Kjersti
    Jenssen, Robert
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 164
  • [6] Unsupervised robust clustering for image database categorization
    Le Saux, B
    Boujemaa, N
    16TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL I, PROCEEDINGS, 2002, : 259 - 262
  • [7] A Disentangled Representations based Unsupervised Deformable Framework for Cross-modality Image Registration
    Wu, Jiong
    Zhou, Shuang
    2021 43RD ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY (EMBC), 2021, : 3531 - 3534
  • [8] Smoothing the Disentangled Latent Style Space for Unsupervised Image-to-Image Translation
    Liu, Yahui
    Sangineto, Enver
    Chen, Yajing
    Bao, Linchao
    Zhang, Haoxian
    Sebe, Nicu
    Lepri, Bruno
    Wang, Wei
    De Nadai, Marco
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 10780 - 10789
  • [9] Multi-Level Variational Autoencoder: Learning Disentangled Representations from Grouped Observations
    Bouchacourt, Diane
    Tomioka, Ryota
    Nowozin, Sebastian
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 2095 - 2102
  • [10] Unsupervised White Blood Cell characterization in the latent space of a Variational Autoencoder
    Tarquino, Jonathan
    Romero, Eduardo
    18TH INTERNATIONAL SYMPOSIUM ON MEDICAL INFORMATION PROCESSING AND ANALYSIS, 2023, 12567