HRInversion: High-Resolution GAN Inversion for Cross-Domain Image Synthesis

被引:9
|
作者
Zhou, Peng [1 ]
Xie, Lingxi [2 ]
Ni, Bingbing [1 ]
Liu, Lin [3 ]
Tian, Qi [2 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Elect Engn, Shanghai 200240, Peoples R China
[2] Huawei Cloud BU, Guangdong518129, Shenzhen, Peoples R China
[3] Univ Sci & Technol China, Dept Elect Engn & Informat Sci, Hefei 230052, Anhui, Peoples R China
基金
美国国家科学基金会;
关键词
Image reconstruction; Image resolution; Generative adversarial networks; Task analysis; Semantics; Generators; Image synthesis; GAN inversion; perceptual loss; image synthesis;
D O I
10.1109/TCSVT.2022.3222456
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We investigate GAN inversion problems of using pre-trained GANs to reconstruct real images. Recent methods for such problems typically employ a VGG perceptual loss to measure the difference between images. While the perceptual loss has achieved remarkable success in various computer vision tasks, it may cause unpleasant artifacts and is sensitive to changes in input scale. This paper delivers an important message that algorithm details are crucial for achieving satisfying performance. In particular, we propose two important but undervalued design principles: (i) not down-sampling the input of the perceptual loss to avoid high-frequency artifacts; and (ii) calculating the perceptual loss using convolutional features which are robust to scale. Integrating these designs derives the proposed framework, HRInversion, that achieves superior performance in reconstructing image details. We validate the effectiveness of HRInversion on a cross-domain image synthesis task and propose a post-processing approach named local style optimization (LSO) to synthesize clean and controllable stylized images. For the evaluation of the cross-domain images, we introduce a metric named ID retrieval which captures the similarity of face identities of stylized images to content images. We also test HRInversion on non-square images. Equipped with implicit neural representation, HRInversion applies to ultra-high resolution images with more than 10 million pixels. Furthermore, we show applications of style transfer and 3D-aware GAN inversion, paving the way for extending the application range of HRInversion.
引用
收藏
页码:2147 / 2161
页数:15
相关论文
共 50 条
  • [21] HIGH-RESOLUTION REFLECTIVITY INVERSION
    Nguyen, Thang
    Castagna, John
    JOURNAL OF SEISMIC EXPLORATION, 2010, 19 (04): : 303 - 320
  • [22] Cross-Domain Face Sketch Synthesis
    Zhang, Mingjin
    Zhang, Jing
    Chi, Yuan
    Li, Yunsong
    Wang, Nannan
    Gao, Xinbo
    IEEE ACCESS, 2019, 7 : 98866 - 98874
  • [23] Any-Resolution Training for High-Resolution Image Synthesis
    Chai, Lucy
    Gharbi, Michael
    Shechtman, Eli
    Isola, Phillip
    Zhang, Richard
    COMPUTER VISION - ECCV 2022, PT XVI, 2022, 13676 : 170 - 188
  • [24] Cross-domain learning for underwater image enhancement
    Li, Fei
    Zheng, Jiangbin
    Zhang, Yuan-fang
    Jia, Wenjing
    Wei, Qianru
    He, Xiangjian
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2023, 110
  • [25] Multitask Learning for Cross-Domain Image Captioning
    Yang, Min
    Zhao, Wei
    Xu, Wei
    Feng, Yabing
    Zhao, Zhou
    Chen, Xiaojun
    Lei, Kai
    IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 21 (04) : 1047 - 1061
  • [26] Cross-domain image retrieval: methods and applications
    Xiaoping Zhou
    Xiangyu Han
    Haoran Li
    Jia Wang
    Xun Liang
    International Journal of Multimedia Information Retrieval, 2022, 11 : 199 - 218
  • [27] Cross-Domain Image Captioning with Discriminative Finetuning
    Dessi, Roberto
    Bevilacqua, Michele
    Gualdoni, Eleonora
    Carraz Rakotonirina, Nathanael
    Franzon, Francesca
    Baroni, Marco
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 6935 - 6944
  • [28] CROSS-DOMAIN CNN FOR HYPERSPECTRAL IMAGE CLASSIFICATION
    Lee, Hyungtae
    Eum, Sungmin
    Kwon, Heesung
    IGARSS 2018 - 2018 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2018, : 3627 - 3630
  • [29] Survey on clothing image retrieval with cross-domain
    Chen Ning
    Yang Di
    Li Menglu
    COMPLEX & INTELLIGENT SYSTEMS, 2022, 8 (06) : 5531 - 5544
  • [30] Cross-domain image retrieval: methods and applications
    Zhou, Xiaoping
    Han, Xiangyu
    Li, Haoran
    Wang, Jia
    Liang, Xun
    INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2022, 11 (03) : 199 - 218