Semantic Image Synthesis with Spatially-Adaptive Normalization

被引:1690
|
作者
Park, Taesung [1 ]
Liu, Ming-Yu [2 ]
Wang, Ting-Chun [2 ]
Zhu, Jun-Yan [2 ,3 ]
机构
[1] Univ Calif Berkeley, Berkeley, CA 94720 USA
[2] NVIDIA, Santa Clara, CA USA
[3] MIT CSAIL, Cambridge, MA USA
关键词
D O I
10.1109/CVPR.2019.00244
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose spatially-adaptive normalization, a simple but effective layer for synthesizing photorealistic images given an input semantic layout. Previous methods directly feed the semantic layout as input to the deep network, which is then processed through stacks of convolution, normalization, and nonlinearity layers. We show that this is suboptimal as the normalization layers tend to "wash away" semantic information. To address the issue, we propose using the input layout. for modulating the activations in normalization layers through a spatially-adaptive,learned transformation. Experiments on several challenging datasets demonstrate the advantage of the proposed method over existing approaches, regarding both visual fidelity and align-ment with input layouts. Finally, our model allows user control over both semantic and style as synthesizing images.
引用
收藏
页码:2332 / 2341
页数:10
相关论文
共 50 条
  • [41] Energy-Efficient Spatially-Adaptive Clustering and Routing in Wireless Sensor Networks
    Long, Hengyu
    Liu, Yongpan
    Fan, Xiaoguang
    Dick, Robert P.
    Yang, Huazhong
    DATE: 2009 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION, VOLS 1-3, 2009, : 1267 - +
  • [42] Subband-adaptive and spatially-adaptive wavelet thresholding for denoising and feature preservation of texture images
    Li, J.
    Mohamed, S. S.
    Salama, M. M. A.
    Freeman, G. H.
    IMAGE ANALYSIS AND RECOGNITION, PROCEEDINGS, 2007, 4633 : 24 - 37
  • [43] PLACE: Adaptive Layout-Semantic Fusion for Semantic Image Synthesis
    Lv, Zhengyao
    Wei, Yuxiang
    Zuo, Wangmeng
    Wong, Kwan-Yee K.
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 9264 - 9268
  • [44] Semantic-shape Adaptive Feature Modulation for Semantic Image Synthesis
    Lv, Zhengyao
    Li, Xiaoming
    Niu, Zhenxing
    Cao, Bing
    Zuo, Wangmeng
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 11204 - 11213
  • [45] Unsupervised Deep Asymmetric Stereo Matching with Spatially-Adaptive Self-Similarity
    Song, Taeyong
    Kim, Sunok
    Sohn, Kwanghoon
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 13672 - 13680
  • [46] Spatially-adaptive analytical reconstruction of quantitative gated cardiac SPECT in KL domain
    Fan, Yi
    Lu, Hongbing
    Liu, Xin
    Wang, Shuyi
    Liang, Zhengrong
    2007 IEEE NUCLEAR SCIENCE SYMPOSIUM CONFERENCE RECORD, VOLS 1-11, 2007, : 3889 - +
  • [47] Generative Adversarial Networks with Bi-directional Normalization for Semantic Image Synthesis
    Long, Jia
    Lu, Hongtao
    PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL (ICMR '21), 2021, : 219 - 226
  • [48] Spatially-adaptive regularized pel-recursive motion estimation based on the EM algorithm
    Estrela, VV
    Galatsanos, NP
    IMAGE AND VIDEO COMMUNICATIONS AND PROCESSING 2000, 2000, 3974 : 372 - 383
  • [49] DRAN: Detailed Region-Adaptive Normalization for Conditional Image Synthesis
    Lyu, Yueming
    Chen, Peibin
    Sun, Jingna
    Peng, Bo
    Wang, Xu
    Dong, Jing
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 1969 - 1982
  • [50] Spatially-adaptive bases in wavelet-based coding of semi-regular meshes
    Denis, Leon
    Florea, Ruxandra
    Munteanu, Adrian
    Schelkens, Peter
    OPTICS, PHOTONICS, AND DIGITAL TECHNOLOGIES FOR MULTIMEDIA APPLICATIONS, 2010, 7723