Style-Guided Inference of Transformer for High-resolution Image Synthesis

被引:0
|
作者
Yim, Jonghwa [1 ]
Kim, Minjae [1 ]
机构
[1] NCSOFT, AI Ctr, Vis AI Lab, Seoul, South Korea
关键词
D O I
10.1109/WACV56688.2023.00179
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Transformer is eminently suitable for auto-regressive image synthesis which predicts discrete value from the past values recursively to make up full image. Especially, combined with vector quantised latent representation, the state-of-the-art auto-regressive transformer displays realistic high-resolution images. However, sampling the latent code from discrete probability distribution makes the output unpredictable. Therefore, it requires to generate lots of diverse samples to acquire desired outputs. To alleviate the process of generating lots of samples repetitively, in this article, we propose to take a desired output, a style image, as an additional condition without re-training the transformer. To this end, our method transfers the style to a probability constraint to re-balance the prior, thereby specifying the target distribution instead of the original prior. Thus, generated samples from the re-balanced prior have similar styles to reference style. In practice, we can choose either an image or a category of images as an additional condition. In our qualitative assessment, we show that styles of majority of outputs are similar to the input style.
引用
收藏
页码:1745 / 1755
页数:11
相关论文
共 50 条
  • [31] Mask Embedding for Realistic High-Resolution Medical Image Synthesis
    Ren, Yinhao
    Zhu, Zhe
    Li, Yingzhou
    Kong, Dehan
    Hou, Rui
    Grimm, Lars J.
    Marks, Jeffery R.
    Lo, Joseph Y.
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2019, PT VI, 2019, 11769 : 422 - 430
  • [32] Development of a high-resolution objective for an IR image synthesis system
    Soldatenko, A., V
    Verhoglyad, A. G.
    Zav'yalov, P. S.
    Stupak, M. F.
    Maximov, A. G.
    Mareeva, N. E.
    JOURNAL OF OPTICAL TECHNOLOGY, 2020, 87 (02) : 100 - 104
  • [33] Segmentation of high-resolution CT images for image-guided spinal surgery
    Williams, M
    Bouchet, L
    Bova, F
    Friedman, WA
    MEDICAL PHYSICS, 2002, 29 (06) : 1322 - 1322
  • [34] A HIGH-RESOLUTION IMAGE SENSOR
    EASTMAN, FH
    JOURNAL OF THE SOCIETY OF MOTION PICTURE TELEVISION ENGINEERS, 1970, 79 (01): : 10 - &
  • [35] A Novel Lightweight Attention-Discarding Transformer for High-Resolution SAR Image Classification
    Liu, Xingyu
    Wu, Yan
    Hu, Xin
    Li, Zhikang
    Li, Ming
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
  • [36] HIGH-RESOLUTION GUIDED RADAR SYSTEM
    MACKAY, NA
    BEATTIE, DG
    ELECTRONICS LETTERS, 1976, 12 (22) : 583 - 584
  • [37] Generation of Orthoimage from High-Resolution DEM and High-Resolution Image
    Saati, M.
    Amini, J.
    Sadeghian, S.
    Hosseini, S. A.
    SCIENTIA IRANICA, 2008, 15 (05) : 568 - 574
  • [38] High-resolution knee plain radiography image synthesis using style generative adversarial network adaptive discriminator augmentation
    Gun Ahn
    Choi, Byung S.
    Ko, Sunho
    Jo, Changwung
    Han, Hyuk-Soo
    Lee, Myung Chul
    Du Hyun Ro
    JOURNAL OF ORTHOPAEDIC RESEARCH, 2023, 41 (01) : 84 - 93
  • [39] High-resolution guided wave tomography
    Huthwaite, Peter
    Simonetti, Francesco
    WAVE MOTION, 2013, 50 (05) : 979 - 993
  • [40] HRFormer: High-Resolution Transformer for Dense Prediction
    Yuan, Yuhui
    Fu, Rao
    Huang, Lang
    Lin, Weihong
    Zhang, Chao
    Chen, Xilin
    Wang, Jingdong
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34