High-Quality Facial Photo-Sketch Synthesis Using Multi-Adversarial Networks

被引:107
|
作者
Wang, Lidan [1 ]
Sindagi, Vishwanath A. [1 ]
Patel, Vishal M. [1 ]
机构
[1] Rutgers State Univ, 94 Brett Rd, Piscataway, NJ 08854 USA
关键词
face photo sketch synthesis; image-to-image translation; face recognition; multi-adversarial networks; FACE-RECOGNITION;
D O I
10.1109/FG.2018.00022
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Synthesizing face sketches from real photos and its inverse have many applications. However, photo/sketch synthesis remains a challenging problem due to the fact that photo and sketch have different characteristics. In this work, we consider this task as an image-to-image translation problem and explore the recently popular generative models (GANs) to generate high-quality realistic photos from sketches and sketches from photos. Recent GAN-based methods have shown promising results on image-to image translation problems and photo-to-sketch synthesis in particular, however, they are known to have limited abilities in generating high-resolution realistic images. To this end, we propose a novel synthesis framework called Photo-Sketch Synthesis using Multi-Adversarial Networks, (PS2-MAN) that iteratively generates low resolution to high resolution images in an adversarial way. The hidden layers of the generator are supervised to first generate lower resolution images followed by implicit refinement in the network to generate higher resolution images. Furthermore, since photo sketch synthesis is a coupled/paired translation problem, we leverage the pair information using Cyc1eGAN framework. Both Image Quality Assessment (IQA) and Photo-Sketch Matching experiments are conducted to demonstrate the superior performance of our framework in comparison to existing state-of-the-art solutions. Code available at: https://github.com/lidan1/PhotoSketchMAN.
引用
收藏
页码:83 / 90
页数:8
相关论文
共 50 条
  • [31] High-quality face image generation using particle swarm optimization-based generative adversarial networks
    Zhang, Long
    Zhao, Lin
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2021, 122 : 98 - 104
  • [32] Discriminative Region Proposal Adversarial Networks for High-Quality Image-to-Image Translation
    Wang, Chao
    Zheng, Haiyong
    Yu, Zhibin
    Zheng, Ziqiang
    Gu, Zhaorui
    Zheng, Bing
    COMPUTER VISION - ECCV 2018, PT I, 2018, 11205 : 796 - 812
  • [33] Probability Density Distillation with Generative Adversarial Networks for High-Quality Parallel Waveform Generation
    Yamamoto, Ryuichi
    Song, Eunwoo
    Kim, Jae-Min
    INTERSPEECH 2019, 2019, : 699 - 703
  • [34] High-quality face image generated with conditional boundary equilibrium generative adversarial networks
    Huang, Bin
    Chen, Weihai
    Wu, Xingming
    Lin, Chun-Liang
    Suganthan, Ponnuthurai Nagaratnam
    PATTERN RECOGNITION LETTERS, 2018, 111 : 72 - 79
  • [35] Failure of anthropometry as a facial identification technique using high-quality photographs
    Kleinberg, Krista F.
    Vanezis, Peter
    Burton, A. Mike
    JOURNAL OF FORENSIC SCIENCES, 2007, 52 (04) : 779 - 783
  • [36] High-Quality Passive Facial Performance Capture using Anchor Frames
    Beeler, Thabo
    Hahn, Fabian
    Bradley, Derek
    Bickel, Bernd
    Beardsley, Paul
    Gotsman, Craig
    Sumner, Robert W.
    Gross, Markus
    ACM TRANSACTIONS ON GRAPHICS, 2011, 30 (04):
  • [37] Deep face generation from a rough sketch using multi-level generative adversarial networks
    Xie, Binghua
    Jung, Cheolkon
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 1200 - 1207
  • [38] Multi-channel face reconstruction system based on sketch features using Conditional Adversarial Networks
    Zhang, Zeping
    Jiang, Miao
    Zhang, Zhiwei
    2020 5TH INTERNATIONAL CONFERENCE ON MATHEMATICS AND ARTIFICIAL INTELLIGENCE (ICMAI 2020), 2020, : 187 - 191
  • [39] Dynamic GAN for high-quality sign language video generation from skeletal poses using generative adversarial networks
    B. Natarajan
    R. Elakkiya
    Soft Computing, 2022, 26 : 13153 - 13175
  • [40] Dynamic GAN for high-quality sign language video generation from skeletal poses using generative adversarial networks
    Natarajan, B.
    Elakkiya, R.
    SOFT COMPUTING, 2022, 26 (23) : 13153 - 13175