Disentangling shared and private latent factors in multimodal Variational Autoencoders

被引:0
|
作者
Martens, Kaspar [1 ]
Yau, Christopher [1 ,2 ]
机构
[1] Univ Oxford, Oxford, England
[2] Hlth Data Res UK, Alan Turing Inst, London, England
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Generative models for multimodal data permit the identification of latent factors that may be associated with important determinants of observed data heterogeneity. Common or shared factors could be important for explaining variation across modalities whereas other factors may be private and important only for the explanation of a single modality. Multimodal Variational Autoencoders, such as MVAE and MMVAE, are a natural choice for inferring those underlying latent factors and separating shared variation from private. In this work, we investigate their capability to reliably perform this disentanglement. In particular, we highlight a challenging problem setting where modality-specific variation dominates the shared signal. Taking a cross-modal prediction perspective, we demonstrate limitations of existing models, and propose a modification how to make them more robust to modality-specific variation. Our findings are supported by experiments on synthetic as well as various real-world multi-omics data sets.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] Extracting a biologically relevant latent space from cancer transcriptomes with variational autoencoders
    Way, Gregory P.
    Greene, Casey S.
    PACIFIC SYMPOSIUM ON BIOCOMPUTING 2018 (PSB), 2018, : 80 - 91
  • [32] Developing Soft-Sensor Models Using Latent Dynamic Variational Autoencoders
    Lee, Yi Shan
    Ooi, Sai Kit
    Tanny, Dave
    Chen, Junghui
    IFAC PAPERSONLINE, 2021, 54 (03): : 61 - 66
  • [33] Dynamic Movement Primitives in Latent Space of Time-Dependent Variational Autoencoders
    Chen, Nutan
    Karl, Maximilian
    van der Smagt, Patrick
    2016 IEEE-RAS 16TH INTERNATIONAL CONFERENCE ON HUMANOID ROBOTS (HUMANOIDS), 2016, : 629 - 636
  • [34] One-class learning for fake news detection through multimodal variational autoencoders
    Golo, Marcos Paulo Silva
    de Souza, Mariana Caravanti
    Rossi, Rafael Geraldeli
    Rezende, Solange Oliveira
    Nogueira, Bruno Magalhaes
    Marcacini, Ricardo Marcondes
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 122
  • [35] Chest X-Rays Image Classification from β-Variational Autoencoders Latent Features
    Crespi, Leonardo
    Loiacono, Daniele
    Chiti, Arturo
    2021 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2021), 2021,
  • [36] ANOMALY DETECTION THROUGH LATENT SPACE RESTORATION USING VECTOR QUANTIZED VARIATIONAL AUTOENCODERS
    Marimont, Sergio Naval
    Tarroni, Giacomo
    2021 IEEE 18TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI), 2021, : 1764 - 1767
  • [37] Optimizing Satellite Image Analysis: Leveraging Variational Autoencoders Latent Representations for Direct Integration
    Giuliano, Alessandro
    Gadsden, S. Andrew
    Yawney, John
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2025, 63
  • [38] Into the latent space of capacitive sensors: interpolation and synthetic data generation using variational autoencoders
    Honrubia, Miguel Monteagudo
    Herraiz-Martinez, Francisco Javier
    Domingo, Javier Matanza
    MACHINE LEARNING-SCIENCE AND TECHNOLOGY, 2025, 6 (01):
  • [39] Multimodal Disentanglement Variational AutoEncoders for Zero-Shot Cross-Modal Retrieval
    Tian, Jialin
    Wang, Kai
    Xu, Xing
    Cao, Zuo
    Shen, Fumin
    Shen, Heng Tao
    PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22), 2022, : 960 - 969
  • [40] Fourier (Common-Tone) Phase Spaces are in Tune with Variational Autoencoders' Latent Space
    Carvalho, Nadia
    Bernardes, Gilberto
    MATHEMATICS AND COMPUTATION IN MUSIC, MCM 2024, 2024, 14639 : 305 - 316