Learning Disentangled Discrete Representations

被引:1
|
作者
Friede, David [1 ]
Reimers, Christian [2 ]
Stuckenschmidt, Heiner [1 ]
Niepert, Mathias [3 ,4 ]
机构
[1] Univ Mannheim, Mannheim, Germany
[2] Max Planck Inst Biogeochem, Jena, Germany
[3] Univ Stuttgart, Stuttgart, Germany
[4] NEC Labs Europe, Heidelberg, Germany
来源
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, ECML PKDD 2023, PT IV | 2023年 / 14172卷
关键词
Categorical VAE; Disentanglement;
D O I
10.1007/978-3-031-43421-1_35
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent successes in image generation, model-based reinforcement learning, and text-to-image generation have demonstrated the empirical advantages of discrete latent representations, although the reasons behind their benefits remain unclear. We explore the relationship between discrete latent spaces and disentangled representations by replacing the standard Gaussian variational autoencoder (VAE) with a tailored categorical variational autoencoder. We show that the underlying grid structure of categorical distributions mitigates the problem of rotational invariance associated with multivariate Gaussian distributions, acting as an efficient inductive prior for disentangled representations. We provide both analytical and empirical findings that demonstrate the advantages of discrete VAEs for learning disentangled representations. Furthermore, we introduce the first unsupervised model selection strategy that favors disentangled representations.
引用
收藏
页码:593 / 609
页数:17
相关论文
共 50 条
  • [31] Learning Disentangled Representations and Group Structure of Dynamical Environments
    Quessard, Robin
    Barrett, Thomas D.
    Clements, William R.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [32] Towards a Unified Framework of Contrastive Learning for Disentangled Representations
    Matthes, Stefan
    Han, Zhiwei
    Shen, Hao
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [33] Disentangled Relational Representations for Explaining and Learning from Demonstration
    Hristov, Yordan
    Angelov, Daniel
    Burke, Michael
    Lascarides, Alex
    Ramamoorthy, Subramanian
    CONFERENCE ON ROBOT LEARNING, VOL 100, 2019, 100
  • [34] Adversarial Learning of Disentangled and Generalizable Representations of Visual Attributes
    Oldfield, James
    Panagakis, Yannis
    Nicolaou, Mihalis A.
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (08) : 3498 - 3509
  • [35] Learning Interpretable Disentangled Representations Using Adversarial VAEs
    Sarhan, Mhd Hasan
    Eslami, Abouzar
    Navab, Nassir
    Albarqouni, Shadi
    DOMAIN ADAPTATION AND REPRESENTATION TRANSFER AND MEDICAL IMAGE LEARNING WITH LESS LABELS AND IMPERFECT DATA, DART 2019, MIL3ID 2019, 2019, 11795 : 37 - 44
  • [36] Learning Disentangled Representations of Texts with Application to Biomedical Abstracts
    Jain, Sarthak
    Banner, Edward
    van de Meent, Jan-Willem
    Marshall, Iain J.
    Wallace, Byron C.
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 4683 - 4693
  • [37] Learning Disentangled Representations of Satellite Image Time Series
    Sanchez, Eduardo H.
    Serrurier, Mathieu
    Ortner, Mathias
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2019, PT III, 2020, 11908 : 306 - 321
  • [38] On learning disentangled representations for individual treatment effect estimation
    Chu, Jiebin
    Sun, Zhoujian
    Dong, Wei
    Shi, Jinlong
    Huang, Zhengxing
    JOURNAL OF BIOMEDICAL INFORMATICS, 2021, 124
  • [39] Speech Resynthesis from Discrete Disentangled Self-Supervised Representations
    Polyak, Adam
    Adi, Yossi
    Copet, Jade
    Kharitonov, Eugene
    Lakhotia, Kushal
    Hsu, Wei-Ning
    Mohamed, Abdelrahman
    Dupoux, Emmanuel
    INTERSPEECH 2021, 2021, : 3615 - 3619
  • [40] On the Fairness of Disentangled Representations
    Locatello, Francesco
    Abbati, Gabriele
    Rainforth, Tom
    Bauer, Stefan
    Scholkopf, Bernhard
    Bachem, Olivier
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32