StyleAutoEncoder for Manipulating Image Attributes Using Pre-trained StyleGAN

被引:0
|
作者
Bedychaj, Andrzej [1 ]
Tabor, Jacek [1 ]
Smieja, Marek [1 ]
机构
[1] Jagiellonian Univ, Krakow, Poland
关键词
D O I
10.1007/978-981-97-2253-2_10
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep conditional generative models are excellent tools for creating high-quality images and editing their attributes. However, training modern generative models from scratch is very expensive and requires large computational resources. In this paper, we introduce StyleAutoEncoder (StyleAE), a lightweight AutoEncoder module, which works as a plugin for pre-trained generative models and allows for manipulating the requested attributes of images. The proposed method offers a cost-effective solution for training deep generative models with limited computational resources, making it a promising technique for a wide range of applications. We evaluate StyleAE by combining it with StyleGAN, which is currently one of the top generative models. Our experiments demonstrate that StyleAE is at least as effective in manipulating image attributes as the state-of-the-art algorithms based on invertible normalizing flows. However, it is simpler, faster, and gives more freedom in designing neural architecture.
引用
收藏
页码:118 / 130
页数:13
相关论文
共 50 条
  • [41] Medical Image Classification using Pre-trained Convolutional Neural Networks and Support Vector Machine
    Ahmed, Ali
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2021, 21 (06): : 1 - 6
  • [42] Efficiently Gluing Pre-Trained Language and Vision Models for Image Captioning
    Song, Peipei
    Zhou, Yuanen
    Liu, Daqing
    Yang, Xun
    Wang, Depeng
    Hu, Zhenzhen
    Wang, Meng
    ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2024, 15 (06)
  • [43] Evaluation of Pre-Trained CNN Models for Geographic Fake Image Detection
    Fezza, Sid Ahmed
    Ouis, Mohammed Yasser
    Kaddar, Bachir
    Hamidouche, Wassim
    Hadid, Abdenour
    2022 IEEE 24TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2022,
  • [44] Pre-Trained Convolutional Neural Network for Classification of Tanning Leather Image
    Winiarti, Sri
    Prahara, Adhi
    Murinto
    Ismi, Dewi Pramudi
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2018, 9 (01) : 212 - 217
  • [45] Optimization of a Pre-Trained AlexNet Model for Detecting and Localizing Image Forgeries
    Samir, Soad
    Emary, Eid
    El-Sayed, Khaled
    Onsi, Hoda
    INFORMATION, 2020, 11 (05)
  • [46] Pre-trained deep learning models for brain MRI image classification
    Krishnapriya, Srigiri
    Karuna, Yepuganti
    FRONTIERS IN HUMAN NEUROSCIENCE, 2023, 17
  • [47] A comparative study of pre-trained models in breast ultrasound image segmentation
    Honi, Dhafer G.
    Nsaif, Mohammed
    Szathmary, Laszlo
    Szeghalmy, Szilvia
    2024 IEEE 3RD CONFERENCE ON INFORMATION TECHNOLOGY AND DATA SCIENCE, CITDS 2024, 2024, : 81 - 86
  • [48] Improved Arabic image captioning model using feature concatenation with pre-trained word embedding
    Samar Elbedwehy
    T. Medhat
    Neural Computing and Applications, 2023, 35 : 19051 - 19067
  • [49] Bidirectional brain image translation using transfer learning from generic pre-trained models
    Haimour, Fatima
    Al-Sayyed, Rizik
    Mahafza, Waleed
    Al-Kadi, Omar S.
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 248
  • [50] Improving Street View Image Classification Using Pre-trained CNN Model Extracted Features
    Djouadi M.
    Kholladi M.-K.
    Periodica polytechnica Electrical engineering and computer science, 2022, 66 (04): : 370 - 379