Robust unsupervised image categorization based on variational autoencoder with disentangled latent representations

被引:5
|
作者
Yang, Lin [1 ]
Fan, Wentao [1 ]
Bouguila, Nizar [2 ]
机构
[1] Huaqiao Univ, Dept Comp Sci & Technol, Xiamen, Peoples R China
[2] Concordia Univ, Concordia Inst Informat Syst Engn CIISE, Montreal, PQ, Canada
基金
中国国家自然科学基金;
关键词
Clustering; Variational autoencoder (VAE); Disentangled latent representations; Robust training; Mixture model; Student's-t distribution;
D O I
10.1016/j.knosys.2022.108671
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, deep generative models have been successfully applied to unsupervised clustering analyses, due to the model capabilities for learning good representations of the input data from a lower dimensional latent space. In this work, we propose a robust deep generative clustering method based on a variational autoencoder (VAE) for unsupervised image categorization. The merits of our method can be summarized as follows. First, each latent representation generated by the encoder is disentangled into the cluster representation and generation representation, where the cluster representation is responsible for preserving the clustering information, while the generation representation is responsible for conserving the generation information. Thus, by only utilizing the cluster representation, we can improve the performance and efficiency of clustering tasks without interference from generating tasks. Second, a Student's-t mixture model is adopted as the prior over the cluster representation to enhance the robustness of our method against clustering outliers. Third, we propose a biaugmentation module to promote the training stability for our model. In contrast with most of the existing deep generative clustering methods that require a pretraining step to stabilize the training process, our model is able to provide a stable training process through feature disentanglement and data augmentation. We validate the proposed robust deep generative clustering method through extensive experiments by comparing it with state-of-the-art methods on unsupervised image categorization. (C) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] Unsupervised varistor surface defect detection based on variational autoencoder
    Tang S.
    Chen M.
    Wang H.
    Zhang X.
    Zhang Y.
    Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2022, 28 (05): : 1337 - 1351
  • [22] VARIATIONAL AUTOENCODER BASED UNSUPERVISED DOMAIN ADAPTATION FOR SEMANTIC SEGMENTATION
    Li, Zongyao
    Togo, Ren
    Ogawa, Takahiro
    Haseyama, Miki
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 2426 - 2430
  • [23] USID-Net: Unsupervised Single Image Dehazing Network via Disentangled Representations
    Li, Jiafeng
    Li, Yaopeng
    Zhuo, Li
    Kuang, Lingyan
    Yu, Tianjian
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 3587 - 3601
  • [24] Medical Image Compression Based on Variational Autoencoder
    Liu, Xuan
    Zhang, Lu
    Guo, Zihao
    Han, Tailin
    Ju, Mingchi
    Xu, Bo
    Liu, Hong
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2022, 2022
  • [25] Unsupervised Real Image Super-Resolution via Generative Variational AutoEncoder
    Liu, Zhi-Song
    Siu, Wan-Chi
    Wang, Li-Wen
    Li, Chu-Tak
    Cani, Marie-Paule
    Chan, Yui-Lam
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 1788 - 1797
  • [26] Riesz-Quincunx-UNet Variational Autoencoder for Unsupervised Satellite Image Denoising
    Thai, Duy H.
    Fei, Xiqi
    Le, Minh Tri
    Zufle, Andreas
    Wessels, Konrad
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [27] ENSEMBLE OF UNSUPERVISED LEARNED IMAGE REPRESENTATIONS BASED ON VARIATIONAL AUTOENCODERS FOR LUNG ADENOCARCINOMA SUBTYPE DIFFERENTIATION
    Cano, Fabian
    Alvarez-Jimenez, Charlems
    Romero, Eduardo
    Cruz-Roa, Angel
    2023 IEEE 20TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING, ISBI, 2023,
  • [28] Flotation froth image deblurring algorithm based on disentangled representations
    Huang, Xianwu
    Wang, Yuxiao
    Cao, Zhao
    Shang, Haili
    Zhang, Jinshan
    Yu, Dahua
    JOURNAL OF ELECTRONIC IMAGING, 2024, 33 (03)
  • [29] Towards Robust and Semantically Organised Latent Representations for Unsupervised Text Style Transfer
    Narasimhan, Sharan
    Dey, Suvodip
    Desarkar, Maunendra
    NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 475 - 493
  • [30] Tri-Modal Joint Inversion Based on Disentangled Variational Autoencoder for Human Thorax Imaging
    Lin, Zhichao
    Guo, Rui
    Zhang, Ke
    Zhang, Haolin
    Li, Maokun
    Yang, Fan
    Xu, Shenheng
    Abubakar, Aria
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72