Photo-realistic image synthesis from lines and appearance with modular modulation

被引:2
|
作者
Luo, Wuyang [1 ]
Yang, Su [1 ]
Zhang, Weishan [2 ]
机构
[1] Fudan Univ, Sch Comp Sci, Shanghai Key Lab Intelligent Informat Proc, Shanghai, Peoples R China
[2] China Univ Petr Huadong, Qingdao Campus, Qingdao, Peoples R China
关键词
Image Synthesis; Image -to -Image Translation; Feature Fusion; Generative Adversarial Networks;
D O I
10.1016/j.neucom.2022.06.007
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The image-to-image translation task has made significant progress by relying on conditional generative adversarial networks. However, for many tasks, multiple condition images are required. This paper con-siders a very classic application scenario, using lines and appearance to synthesize photo-realistic images, describing structure and appearance information, respectively, for example, generating realistic face images from portrait drawings and color scribbles, and generating photos from sketches and texture patches. The key to this type of task is how to fuse the two conditional information. We propose an image translation system driven by line and appearance images, introducing a modular architecture for condi-tion fusion. Unlike the previous condition fusion schemes, its main body of the generator is composed of stacked modulation units (MUs). Here, structural features and appearance features are progressively incorporated via cascaded MUs, each of which pays attention to the local regions. The visualization exper-iment shows that such a scheme lets the network automatically learn to decompose the fusion process as multiple sub-steps in latent spaces. Our model produces higher quality results quantitatively and qual-itatively compared to the state-of-the-art method on different tasks and datasets. The ablation study demonstrates the effectiveness of the MUs and intuitively explains the process of feature fusion through visualization.(c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页码:81 / 91
页数:11
相关论文
共 50 条
  • [41] PHOTO-REALISTIC 3D MAPPING FROM AERIAL OBLIQUE IMAGERY
    Rau, J. Y.
    Chu, C. Y.
    2010 CANADIAN GEOMATICS CONFERENCE AND SYMPOSIUM OF COMMISSION I, ISPRS CONVERGENCE IN GEOMATICS - SHAPING CANADA'S COMPETITIVE LANDSCAPE, 2010, 38
  • [42] Rec2Real: Semantics-Guided Photo-Realistic Image Synthesis Using Rough Urban Reconstruction Models
    Miao, Hui
    Lu, Feixiang
    Xu, Tiancheng
    Zhang, Liangjun
    Zhou, Bin
    ADVANCES IN COMPUTER GRAPHICS, CGI 2022, 2022, 13443 : 369 - 380
  • [43] Sketch2Photo: Synthesizing photo-realistic images from sketches via global contexts
    Liu, Heng
    Xu, Yao
    Chen, Feng
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 117
  • [44] Deep learning-based image analysis in muscle histopathology using photo-realistic synthetic data
    Mill, Leonid
    Aust, Oliver
    Ackermann, Jochen A.
    Burger, Philipp
    Pascual, Monica
    Palumbo-Zerr, Katrin
    Kroenke, Gerhard
    Uderhardt, Stefan
    Schett, Georg
    Clemen, Christoph S.
    Holtzhausen, Christian
    Jabari, Samir
    Schroeder, Rolf
    Maier, Andreas
    Grueneboom, Anika
    COMMUNICATIONS MEDICINE, 2025, 5 (01):
  • [45] Photo-realistic image bit-depth enhancement via residual transposed convolutional neural network
    Su, Yuting
    Sun, Wanning
    Liu, Jing
    Zhai, Guangtao
    Jing, Peiguang
    NEUROCOMPUTING, 2019, 347 : 200 - 211
  • [46] Photo-realistic 2D expression transfer based on FFT and modified Poisson image editing
    Tian, Chunna
    Li, Haiyang
    Gao, Xinbo
    NEUROCOMPUTING, 2018, 309 : 1 - 10
  • [47] StyleAvatar: Real-time Photo-realistic Portrait Avatar from a Single Video
    Wang, Lizhen
    Zhao, Xiaochen
    Sun, Jingxiang
    Zhang, Yuxiang
    Zhang, Hongwen
    Yu, Tao
    Liu, Yebin
    PROCEEDINGS OF SIGGRAPH 2023 CONFERENCE PAPERS, SIGGRAPH 2023, 2023,
  • [48] U-Net Conditional GANs for Photo-Realistic and Identity-Preserving Facial Expression Synthesis
    Wang, Xueping
    Wang, Yunhong
    Li, Weixin
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2019, 15 (03)
  • [49] AUTOMATIC RETRIEVAL OF NEAR PHOTO-REALISTIC TEXTURES FROM SINGLE GROUND-LEVEL BUILDING IMAGES
    Turker, M.
    Sumer, E.
    GEOBIA 2010: GEOGRAPHIC OBJECT-BASED IMAGE ANALYSIS, 2010, 38-4-C7
  • [50] Photo-consistent motion blur modeling for realistic image synthesis
    Lin, Huei-Yung
    Chang, Chia-Hong
    ADVANCES IN IMAGE AND VIDEO TECHNOLOGY, PROCEEDINGS, 2006, 4319 : 1273 - +