HRInversion: High-Resolution GAN Inversion for Cross-Domain Image Synthesis

被引:9
|
作者
Zhou, Peng [1 ]
Xie, Lingxi [2 ]
Ni, Bingbing [1 ]
Liu, Lin [3 ]
Tian, Qi [2 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Elect Engn, Shanghai 200240, Peoples R China
[2] Huawei Cloud BU, Guangdong518129, Shenzhen, Peoples R China
[3] Univ Sci & Technol China, Dept Elect Engn & Informat Sci, Hefei 230052, Anhui, Peoples R China
基金
美国国家科学基金会;
关键词
Image reconstruction; Image resolution; Generative adversarial networks; Task analysis; Semantics; Generators; Image synthesis; GAN inversion; perceptual loss; image synthesis;
D O I
10.1109/TCSVT.2022.3222456
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We investigate GAN inversion problems of using pre-trained GANs to reconstruct real images. Recent methods for such problems typically employ a VGG perceptual loss to measure the difference between images. While the perceptual loss has achieved remarkable success in various computer vision tasks, it may cause unpleasant artifacts and is sensitive to changes in input scale. This paper delivers an important message that algorithm details are crucial for achieving satisfying performance. In particular, we propose two important but undervalued design principles: (i) not down-sampling the input of the perceptual loss to avoid high-frequency artifacts; and (ii) calculating the perceptual loss using convolutional features which are robust to scale. Integrating these designs derives the proposed framework, HRInversion, that achieves superior performance in reconstructing image details. We validate the effectiveness of HRInversion on a cross-domain image synthesis task and propose a post-processing approach named local style optimization (LSO) to synthesize clean and controllable stylized images. For the evaluation of the cross-domain images, we introduce a metric named ID retrieval which captures the similarity of face identities of stylized images to content images. We also test HRInversion on non-square images. Equipped with implicit neural representation, HRInversion applies to ultra-high resolution images with more than 10 million pixels. Furthermore, we show applications of style transfer and 3D-aware GAN inversion, paving the way for extending the application range of HRInversion.
引用
收藏
页码:2147 / 2161
页数:15
相关论文
共 50 条
  • [31] Survey on clothing image retrieval with cross-domain
    Chen Ning
    Yang Di
    Li Menglu
    Complex & Intelligent Systems, 2022, 8 : 5531 - 5544
  • [32] Learning High-Resolution Domain-Specific Representations with a GAN Generator
    Galeev, Danil
    Sofiiuk, Konstantin
    Rukhovich, Danila
    Romanov, Mikhail
    Barinova, Olga
    Konushin, Anton
    STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, S+SSPR 2020, 2021, 12644 : 108 - 118
  • [33] PuppetGAN: Cross-Domain Image Manipulation by Demonstration
    Usman, Ben
    Dufour, Nick
    Saenko, Kate
    Bregler, Chris
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 9449 - 9457
  • [34] Cross-Domain Image Retrieval with Attention Modeling
    Ji, Xin
    Wang, Wei
    Zhang, Meihui
    Yang, Yang
    PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE (MM'17), 2017, : 1654 - 1662
  • [35] Dual Learning for Cross-domain Image Captioning
    Zhao, Wei
    Xu, Wei
    Yang, Min
    Ye, Jianbo
    Zhao, Zhou
    Feng, Yabing
    Qiao, Yu
    CIKM'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2017, : 29 - 38
  • [36] Cross-Domain Image Steganography Based on GANs
    Wang, Yaojie
    Yang, Xiaoyuan
    Liu, Jia
    SECURITY WITH INTELLIGENT COMPUTING AND BIG-DATA SERVICES, 2020, 895 : 743 - 755
  • [37] A Bio-Inspired Visual Perception Transformer for Cross-Domain Semantic Segmentation of High-Resolution Remote Sensing Images
    Wang, Xinyao
    Wang, Haitao
    Jing, Yuqian
    Yang, Xianming
    Chu, Jianbo
    REMOTE SENSING, 2024, 16 (09)
  • [38] AN IMAGE OF THE COLUMBIA PLATEAU FROM INVERSION OF HIGH-RESOLUTION SEISMIC DATA
    LUTTER, WJ
    CATCHINGS, RD
    JARCHOW, CM
    GEOPHYSICS, 1994, 59 (08) : 1278 - 1289
  • [39] Unsupervised Domain Adaptation for Cross-domain Histopathology Image Classification
    Xiangning Li
    Chen Pan
    Lingmin He
    Xinyu Li
    Multimedia Tools and Applications, 2024, 83 : 23311 - 23331
  • [40] A DISCRIMINATIVE DOMAIN ADAPTATION MODEL FOR CROSS-DOMAIN IMAGE CLASSIFICATION
    Chou, Yen-Cheng
    Wei, Chia-Po
    Wang, Yu-Chiang Frank
    2013 20TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2013), 2013, : 3083 - 3087