Photo-realistic image synthesis from lines and appearance with modular modulation

被引:2
|
作者
Luo, Wuyang [1 ]
Yang, Su [1 ]
Zhang, Weishan [2 ]
机构
[1] Fudan Univ, Sch Comp Sci, Shanghai Key Lab Intelligent Informat Proc, Shanghai, Peoples R China
[2] China Univ Petr Huadong, Qingdao Campus, Qingdao, Peoples R China
关键词
Image Synthesis; Image -to -Image Translation; Feature Fusion; Generative Adversarial Networks;
D O I
10.1016/j.neucom.2022.06.007
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The image-to-image translation task has made significant progress by relying on conditional generative adversarial networks. However, for many tasks, multiple condition images are required. This paper con-siders a very classic application scenario, using lines and appearance to synthesize photo-realistic images, describing structure and appearance information, respectively, for example, generating realistic face images from portrait drawings and color scribbles, and generating photos from sketches and texture patches. The key to this type of task is how to fuse the two conditional information. We propose an image translation system driven by line and appearance images, introducing a modular architecture for condi-tion fusion. Unlike the previous condition fusion schemes, its main body of the generator is composed of stacked modulation units (MUs). Here, structural features and appearance features are progressively incorporated via cascaded MUs, each of which pays attention to the local regions. The visualization exper-iment shows that such a scheme lets the network automatically learn to decompose the fusion process as multiple sub-steps in latent spaces. Our model produces higher quality results quantitatively and qual-itatively compared to the state-of-the-art method on different tasks and datasets. The ablation study demonstrates the effectiveness of the MUs and intuitively explains the process of feature fusion through visualization.(c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页码:81 / 91
页数:11
相关论文
共 50 条
  • [1] Toward Interactive Modulation for Photo-Realistic Image Restoration
    Cai, Haoming
    He, Jingwen
    Qiao, Yu
    Dong, Chao
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 294 - 303
  • [2] Photo-Realistic Facial Details Synthesis From Single Image
    Chen, Anpei
    Chen, Zhang
    Zhang, Guli
    Mitchell, Kenny
    Yu, Jingyi
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 9428 - 9438
  • [3] Photo-realistic facial expression synthesis
    Ghent, J
    McDonald, J
    IMAGE AND VISION COMPUTING, 2005, 23 (12) : 1041 - 1050
  • [4] Photo-Realistic Talking-Heads from Image Samples
    Cosatto, Eric
    Graf, Hans Peter
    IEEE TRANSACTIONS ON MULTIMEDIA, 2000, 2 (03) : 152 - 163
  • [5] Automatic generation of photo-realistic mosaic image
    Park, JS
    Chang, DH
    Park, SG
    BIOLOGICALLY MOTIVATED COMPUTER VISION, PROCEEDING, 2000, 1811 : 343 - 352
  • [6] Image-based photo hulls for fast and photo-realistic new view synthesis
    Slabaugh, GG
    Schafer, RW
    Hans, MC
    REAL-TIME IMAGING, 2003, 9 (05) : 347 - 360
  • [7] 3DCGiRAM: An intelligent memory architecture for photo-realistic image synthesis
    Kobayashi, H
    Suzuki, K
    Sano, K
    Kaeriyama, Y
    Saida, Y
    Oba, N
    Nakamura, T
    2001 INTERNATIONAL CONFERENCE ON COMPUTER DESIGN, ICCD 2001, PROCEEDINGS, 2001, : 462 - 467
  • [8] PAINT: Photo-realistic Fashion Design Synthesis
    Gu, Xiaoling
    Huang, Jie
    Wong, Yongkang
    Yu, Jun
    Fan, Jianping
    Peng, Pai
    Kankanhalli, Mohan S.
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (02)
  • [9] StackGAN: Text to Photo-realistic Image Synthesis with Stacked Generative Adversarial Networks
    Zhang, Han
    Xu, Tao
    Li, Hongsheng
    Zhang, Shaoting
    Wang, Xiaogang
    Huang, Xiaolei
    Metaxas, Dimitris
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 5908 - 5916
  • [10] Image-based rendering for photo-realistic visualization
    Verbiest, F.
    Willems, G.
    van Gool, L.
    VIRTUAL AND PHYSICAL PROTOTYPING, 2006, 1 (01) : 19 - 30