Semantic Image Synthesis with Spatially-Adaptive Normalization

被引：1690

作者：

Park, Taesung ^{[1
]}

Liu, Ming-Yu ^{[2
]}

Wang, Ting-Chun ^{[2
]}

Zhu, Jun-Yan ^{[2
,3
]}

机构：

[1] Univ Calif Berkeley, Berkeley, CA 94720 USA

[2] NVIDIA, Santa Clara, CA USA

[3] MIT CSAIL, Cambridge, MA USA

来源：

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019) | 2019年

关键词：

D O I：

10.1109/CVPR.2019.00244

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We propose spatially-adaptive normalization, a simple but effective layer for synthesizing photorealistic images given an input semantic layout. Previous methods directly feed the semantic layout as input to the deep network, which is then processed through stacks of convolution, normalization, and nonlinearity layers. We show that this is suboptimal as the normalization layers tend to "wash away" semantic information. To address the issue, we propose using the input layout. for modulating the activations in normalization layers through a spatially-adaptive,learned transformation. Experiments on several challenging datasets demonstrate the advantage of the proposed method over existing approaches, regarding both visual fidelity and align-ment with input layouts. Finally, our model allows user control over both semantic and style as synthesizing images.

引用

页码：2332 / 2341

页数：10

共 50 条

[41] Energy-Efficient Spatially-Adaptive Clustering and Routing in Wireless Sensor Networks
Long, Hengyu
Liu, Yongpan
Fan, Xiaoguang
Dick, Robert P.
Yang, Huazhong
DATE: 2009 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION, VOLS 1-3, 2009, : 1267 - +
[42] Subband-adaptive and spatially-adaptive wavelet thresholding for denoising and feature preservation of texture images
Li, J.
Mohamed, S. S.
Salama, M. M. A.
Freeman, G. H.
IMAGE ANALYSIS AND RECOGNITION, PROCEEDINGS, 2007, 4633 : 24 - 37
[43] PLACE: Adaptive Layout-Semantic Fusion for Semantic Image Synthesis
Lv, Zhengyao
Wei, Yuxiang
Zuo, Wangmeng
Wong, Kwan-Yee K.
2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 9264 - 9268
[44] Semantic-shape Adaptive Feature Modulation for Semantic Image Synthesis
Lv, Zhengyao
Li, Xiaoming
Niu, Zhenxing
Cao, Bing
Zuo, Wangmeng
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 11204 - 11213
[45] Unsupervised Deep Asymmetric Stereo Matching with Spatially-Adaptive Self-Similarity
Song, Taeyong
Kim, Sunok
Sohn, Kwanghoon
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 13672 - 13680
[46] Spatially-adaptive analytical reconstruction of quantitative gated cardiac SPECT in KL domain
Fan, Yi
Lu, Hongbing
Liu, Xin
Wang, Shuyi
Liang, Zhengrong
2007 IEEE NUCLEAR SCIENCE SYMPOSIUM CONFERENCE RECORD, VOLS 1-11, 2007, : 3889 - +
[47] Generative Adversarial Networks with Bi-directional Normalization for Semantic Image Synthesis
Long, Jia
Lu, Hongtao
PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL (ICMR '21), 2021, : 219 - 226
[48] Spatially-adaptive regularized pel-recursive motion estimation based on the EM algorithm
Estrela, VV
Galatsanos, NP
IMAGE AND VIDEO COMMUNICATIONS AND PROCESSING 2000, 2000, 3974 : 372 - 383
[49] DRAN: Detailed Region-Adaptive Normalization for Conditional Image Synthesis
Lyu, Yueming
Chen, Peibin
Sun, Jingna
Peng, Bo
Wang, Xu
Dong, Jing
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 1969 - 1982
[50] Spatially-adaptive bases in wavelet-based coding of semi-regular meshes
Denis, Leon
Florea, Ruxandra
Munteanu, Adrian
Schelkens, Peter
OPTICS, PHOTONICS, AND DIGITAL TECHNOLOGIES FOR MULTIMEDIA APPLICATIONS, 2010, 7723

← 1 2 3 4 5 →