VAE-GNA: a variational autoencoder with Gaussian neurons in the latent space and attention mechanisms

被引:2
|
作者
Rocha, Matheus B. [1 ,2 ]
Krohling, Renato A. [1 ,2 ]
机构
[1] Univ Fed Espirito Santo, Labcin Nat Inspired Comp Lab, BR-29075910 Vitoria, Brazil
[2] Univ Fed Espirito Santo, Grad Program Comp Sci, BR-29075910 Vitoria, Brazil
关键词
Variational autoencoder (VAE); Gaussian neurons; Attention layer; Skin lesions; Near-infrared (NIR) spectroscopy; Cancer detection; SPECTROSCOPY; CANCER;
D O I
10.1007/s10115-024-02169-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Variational autoencoders (VAEs) are generative models known for learning compact and continuous latent representations of data. While they have proven effective in various applications, using latent representations for classification tasks presents challenges. Typically, a straightforward approach involves concatenating the mean and variance vectors and inputting them into a shallow neural network. In this paper, we introduce a novel approach for variational autoencoders, named VAE-GNA, which integrates Gaussian neurons into the latent space along with attention mechanisms. These neurons directly process mean and variance values through a suitable modified sigmoid function, not only improving classification, but also optimizing the training of the VAE in extracting features, in synergy with the classification network. Additionally, we investigate both additive and multiplicative attention mechanisms to enhance the model's capabilities. We applied the proposed method to automatic cancer detection using near-infrared (NIR) spectral data, showing that the experimental results of VAE-GNA surpass established baselines for spectral datasets. The results obtained indicate the feasibility and effectiveness of our approach.
引用
收藏
页码:6415 / 6437
页数:23
相关论文
共 37 条
  • [31] Unsupervised classification of multi-contrast magnetic resonance histology of peripheral arterial disease lesions using a convolutional variational autoencoder with a Gaussian mixture model in latent space: A technical feasibility study
    Csore, Judit
    Roy, Trisha L.
    Wright, Graham
    Karmonik, Christof
    COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2024, 115
  • [32] Efficient and expressive high-resolution image synthesis via variational autoencoder-enriched transformers with sparse attention mechanisms
    Tang, Bingyin
    Feng, Fan
    JOURNAL OF ELECTRONIC IMAGING, 2024, 33 (03)
  • [33] A supervised subtype differentiation learning for building invariant features of non-small cell lung cancer in a latent space of a Variational Autoencoder
    Cano, Fabian
    Cruz-Roa, Angel
    Alvarez-Jimenez, Charlens
    Becerra, David
    Siabatto, Andres
    Romero, Eduardo
    17TH INTERNATIONAL SYMPOSIUM ON MEDICAL INFORMATION PROCESSING AND ANALYSIS, 2021, 12088
  • [34] Multi-modal hybrid modeling strategy based on Gaussian Mixture Variational Autoencoder and spatial–temporal attention: Application to industrial process prediction
    Peng, Haifei
    Long, Jian
    Huang, Cheng
    Wei, Shibo
    Ye, Zhencheng
    Chemometrics and Intelligent Laboratory Systems, 2024, 244
  • [35] Multi-modal hybrid modeling strategy based on Gaussian Mixture Variational Autoencoder and spatial-temporal attention: Application to industrial process prediction
    Peng, Haifei
    Long, Jian
    Huang, Cheng
    Wei, Shibo
    Ye, Zhencheng
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2024, 244
  • [36] Learning Cell-Type-Specific Gene Regulation Mechanisms by Multi-Attention Based Deep Learning With Regulatory Latent Space
    Kang, Minji
    Lee, Sangseon
    Lee, Dohoon
    Kim, Sun
    FRONTIERS IN GENETICS, 2020, 11
  • [37] Mutation-Based Multivariate Time-Series Anomaly Generation on Latent Space with an Attention-Based Variational Recurrent Neural Network for Robust Anomaly Detection in an Industrial Control System
    Jeon, Seungho
    Koo, Kijong
    Moon, Daesung
    Seo, Jung Taek
    APPLIED SCIENCES-BASEL, 2024, 14 (17):