Bimodal Neural Style Transfer for Image Generation Based on Text Prompts

被引:0
|
作者
Gutierrez, Diego [1 ]
Mendoza, Marcelo [2 ]
机构
[1] Univ Tecn Federico Santa Maria, Dept Informat, Av Vicuna Mackenna 3939, Santiago, Chile
[2] Pontificia Univ Catolica Chile, Dept Comp Sci, Av Vicuna Mackenna 6840, Santiago, Chile
来源
关键词
Generative models; Creative AI; Image generation;
D O I
10.1007/978-3-031-34732-0_29
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Neural networks have become one of the essential areas in Artificial Intelligence due to their extraordinary capacity to address problems in different domains. This ability led to the proposal of novel architectures and models to tackle challenging tasks such as neural style transfer. We propose a novel methodology for bimodal style transfer using text as input. We initially retrieve one image and a short descriptive text, which are mapped into a multimodal common latent space. Then, a new image is retrieved using an image retrieval engine. Finally, we use a generative model, which allows us to create artistic images by combining content and style. The proposed system can retrieve semantically similar images concerning a descriptive text (prompt), achieving great precision rates in image retrieval applied to the SemArt dataset. The transfer style neural model also preserves the image's high quality, combining style and content.
引用
收藏
页码:379 / 390
页数:12
相关论文
共 50 条
  • [1] Learning to Transfer Prompts for Text Generation
    Li, Junyi
    Tang, Tianyi
    Nie, Jian-Yun
    Wen, Ji-Rong
    Zhaol, Wayne Xin
    NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 3506 - 3518
  • [2] Optimizing Prompts for Text-to-Image Generation
    Hao, Yaru
    Chi, Zewen
    Dong, Li
    Wei, Furu
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [3] Text-based Image Style Transfer and Synthesis
    He, Yifan
    Li, Jian
    Zhu, Anna
    2019 INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION WORKSHOPS (ICDARW) AND 8TH INTERNATIONAL WORKSHOP ON CAMERA-BASED DOCUMENT ANALYSIS AND RECOGNITION, VOL 4, 2019, : 43 - 48
  • [4] A New Approach for Text Style Transfer Based on Deletion and Generation
    Liu, Guoqiang
    Xiao, Ruochen
    Li, Wenkai
    Zhang, Youcheng
    2022 EURO-ASIA CONFERENCE ON FRONTIERS OF COMPUTER SCIENCE AND INFORMATION TECHNOLOGY, FCSIT, 2022, : 6 - 11
  • [5] Emergent Text-to-Image Generation Using Short Neologism Prompts and Negative Prompts
    Kanada, Yasusi
    2024 NICOGRAPH INTERNATIONAL, NICOINT 2024, 2024, : 86 - 86
  • [6] Image-based CAPTCHAs based on neural style transfer
    Cheng, Zhouhang
    Gao, Haichang
    Liu, Zhongyu
    Wu, Huaxi
    Zi, Yang
    Pei, Ge
    IET INFORMATION SECURITY, 2019, 13 (06) : 519 - 529
  • [7] Material Translation Based on Neural Style Transfer with Ideal Style Image Retrieval
    Benitez-Garcia, Gibran
    Takahashi, Hiroki
    Yanai, Keiji
    SENSORS, 2022, 22 (19)
  • [8] Image neural style transfer: A review*
    Cai, Qiang
    Ma, Mengxu
    Wang, Chen
    Li, Haisheng
    COMPUTERS & ELECTRICAL ENGINEERING, 2023, 108
  • [9] The Contour Image Style Transfer based Convolutional Neural Network
    Deng, Nan
    Li, Jing
    Wang, Xingce
    Wu, Zhongke
    Fu, Yan
    Shui, Wuyang
    Zhou, Mingquan
    Korkhov, Vladimir
    Gaspary, Luciano Paschoal
    INTERNATIONAL WORKSHOP ON ADVANCED IMAGE TECHNOLOGY (IWAIT) 2019, 2019, 11049
  • [10] Image Augmentation with Neural Style Transfer
    Georgievski, Borijan
    ICT INNOVATIONS 2019: BIG DATA PROCESSING AND MINING, 2019, 1110 : 212 - 224