Learning Disentangled Discrete Representations

被引：1

作者：

Friede, David ^{[1
]}

Reimers, Christian ^{[2
]}

Stuckenschmidt, Heiner ^{[1
]}

Niepert, Mathias ^{[3
,4
]}

机构：

[1] Univ Mannheim, Mannheim, Germany

[2] Max Planck Inst Biogeochem, Jena, Germany

[3] Univ Stuttgart, Stuttgart, Germany

[4] NEC Labs Europe, Heidelberg, Germany

来源：

MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, ECML PKDD 2023, PT IV | 2023年 / 14172卷

关键词：

Categorical VAE; Disentanglement;

D O I：

10.1007/978-3-031-43421-1_35

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recent successes in image generation, model-based reinforcement learning, and text-to-image generation have demonstrated the empirical advantages of discrete latent representations, although the reasons behind their benefits remain unclear. We explore the relationship between discrete latent spaces and disentangled representations by replacing the standard Gaussian variational autoencoder (VAE) with a tailored categorical variational autoencoder. We show that the underlying grid structure of categorical distributions mitigates the problem of rotational invariance associated with multivariate Gaussian distributions, acting as an efficient inductive prior for disentangled representations. We provide both analytical and empirical findings that demonstrate the advantages of discrete VAEs for learning disentangled representations. Furthermore, we introduce the first unsupervised model selection strategy that favors disentangled representations.

引用

页码：593 / 609

页数：17

共 50 条

[31] Learning Disentangled Representations and Group Structure of Dynamical Environments
Quessard, Robin
Barrett, Thomas D.
Clements, William R.
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
[32] Towards a Unified Framework of Contrastive Learning for Disentangled Representations
Matthes, Stefan
Han, Zhiwei
Shen, Hao
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[33] Disentangled Relational Representations for Explaining and Learning from Demonstration
Hristov, Yordan
Angelov, Daniel
Burke, Michael
Lascarides, Alex
Ramamoorthy, Subramanian
CONFERENCE ON ROBOT LEARNING, VOL 100, 2019, 100
[34] Adversarial Learning of Disentangled and Generalizable Representations of Visual Attributes
Oldfield, James
Panagakis, Yannis
Nicolaou, Mihalis A.
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (08) : 3498 - 3509
[35] Learning Interpretable Disentangled Representations Using Adversarial VAEs
Sarhan, Mhd Hasan
Eslami, Abouzar
Navab, Nassir
Albarqouni, Shadi
DOMAIN ADAPTATION AND REPRESENTATION TRANSFER AND MEDICAL IMAGE LEARNING WITH LESS LABELS AND IMPERFECT DATA, DART 2019, MIL3ID 2019, 2019, 11795 : 37 - 44
[36] Learning Disentangled Representations of Texts with Application to Biomedical Abstracts
Jain, Sarthak
Banner, Edward
van de Meent, Jan-Willem
Marshall, Iain J.
Wallace, Byron C.
2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 4683 - 4693
[37] Learning Disentangled Representations of Satellite Image Time Series
Sanchez, Eduardo H.
Serrurier, Mathieu
Ortner, Mathias
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2019, PT III, 2020, 11908 : 306 - 321
[38] On learning disentangled representations for individual treatment effect estimation
Chu, Jiebin
Sun, Zhoujian
Dong, Wei
Shi, Jinlong
Huang, Zhengxing
JOURNAL OF BIOMEDICAL INFORMATICS, 2021, 124
[39] Speech Resynthesis from Discrete Disentangled Self-Supervised Representations
Polyak, Adam
Adi, Yossi
Copet, Jade
Kharitonov, Eugene
Lakhotia, Kushal
Hsu, Wei-Ning
Mohamed, Abdelrahman
Dupoux, Emmanuel
INTERSPEECH 2021, 2021, : 3615 - 3619
[40] On the Fairness of Disentangled Representations
Locatello, Francesco
Abbati, Gabriele
Rainforth, Tom
Bauer, Stefan
Scholkopf, Bernhard
Bachem, Olivier
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32

← 1 2 3 4 5 →