Dual Encoder-Decoder Based Generative Adversarial Networks for Disentangled Facial Representation Learning

被引:8
|
作者
Hu, Cong [1 ,2 ,3 ]
Feng, Zhenhua [4 ,5 ]
Wu, Xiaojun [1 ,2 ]
Kittler, Josef [5 ]
机构
[1] Jiangnan Univ, Sch Artificial Intelligence & Comp Sci, Wuxi 214122, Jiangsu, Peoples R China
[2] Jiangnan Univ, Jiangsu Prov Engn Lab Pattern Recognit & Computat, Wuxi 214122, Jiangsu, Peoples R China
[3] Minjiang Univ, Fujian Prov Key Lab Informat Proc & Intelligent C, Fuzhou 350121, Peoples R China
[4] Univ Surrey, Dept Comp Sci, Guildford GU2 7XH, Surrey, England
[5] Univ Surrey, Ctr Vis Speech & Signal Proc, Guildford GU2 7XH, Surrey, England
基金
中国国家自然科学基金; 英国工程与自然科学研究理事会;
关键词
Face; Gallium nitride; Generative adversarial networks; Training; Generators; Face recognition; Task analysis; Disentangled representation learning; encoder-decoder; generative adversarial networks; face synthesis; pose invariant face recognition; FACE RECOGNITION;
D O I
10.1109/ACCESS.2020.3009512
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
To learn disentangled representations of facial images, we present a Dual Encoder-Decoder based Generative Adversarial Network (DED-GAN). In the proposed method, both the generator and discriminator are designed with deep encoder-decoder architectures as their backbones. To be more specific, the encoder-decoder structured generator is used to learn a pose disentangled face representation, and the encoder-decoder structured discriminator is tasked to perform real/fake classification, face reconstruction, determining identity and estimating face pose. We further improve the proposed network architecture by minimizing the additional pixel-wise loss defined by the Wasserstein distance at the output of the discriminator so that the adversarial framework can be better trained. Additionally, we consider face pose variation to be continuous, rather than discrete in existing literature, to inject richer pose information into our model. The pose estimation task is formulated as a regression problem, which helps to disentangle identity information from pose variations. The proposed network is evaluated on the tasks of pose-invariant face recognition (PIFR) and face synthesis across poses. An extensive quantitative and qualitative evaluation carried out on several controlled and in-the-wild benchmarking datasets demonstrates the superiority of the proposed DED-GAN method over the state-of-the-art approaches.
引用
收藏
页码:130159 / 130171
页数:13
相关论文
共 50 条
  • [41] Unsupervised Feature Selection using Encoder-Decoder Networks
    SharifiPour, Sasan
    Fayyazi, Hossein
    Sabokro, Mohammad
    2020 6TH IRANIAN CONFERENCE ON SIGNAL PROCESSING AND INTELLIGENT SYSTEMS (ICSPIS), 2020,
  • [42] A visual localization method based on encoder-decoder dual-stream CNN
    Jia R.
    Liu S.
    Li J.
    Wang Y.
    Pan H.
    Beijing Hangkong Hangtian Daxue Xuebao/Journal of Beijing University of Aeronautics and Astronautics, 2019, 45 (10): : 1965 - 1972
  • [43] Comparison of Encoder-Decoder Networks for Soccer Field Segmentation
    Guimaraes, Otavio H. R.
    Maximo, Marcos R. O. A.
    Parente de Oliveira, Jose Maria
    2023 LATIN AMERICAN ROBOTICS SYMPOSIUM, LARS, 2023 BRAZILIAN SYMPOSIUM ON ROBOTICS, SBR, AND 2023 WORKSHOP ON ROBOTICS IN EDUCATION, WRE, 2023, : 496 - 501
  • [44] Denoising of 3D magnetic resonance images using a residual encoder-decoder Wasserstein generative adversarial network
    Ran, Maosong
    Hu, Jinrong
    Chen, Yang
    Chen, Hu
    Sun, Huaiqiang
    Zhou, Jiliu
    Zhang, Yi
    MEDICAL IMAGE ANALYSIS, 2019, 55 : 165 - 180
  • [45] Encoder-decoder based process generation method
    Tang W.
    Wang P.
    Cai D.
    Zhang G.
    Wang Y.
    Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2023, 29 (11): : 3656 - 3668
  • [46] Locating Anatomical Landmarks on 2D Lateral Cephalograms Through Adversarial Encoder-Decoder Networks
    Dai, Xiubin
    Zhao, Hao
    Liu, Tianliang
    Cao, Dan
    Xie, Lizhe
    IEEE ACCESS, 2019, 7 : 132738 - 132747
  • [47] Image Captioning: From Encoder-Decoder to Reinforcement Learning
    Tang, Yu
    2022 6TH INTERNATIONAL CONFERENCE ON IMAGING, SIGNAL PROCESSING AND COMMUNICATIONS, ICISPC, 2022, : 6 - 10
  • [48] WalkGAN: Network Representation Learning With Sequence-Based Generative Adversarial Networks
    Jin, Taisong
    Yang, Xixi
    Yu, Zhengtao
    Luo, Han
    Zhang, Yongmei
    Jie, Feiran
    Zeng, Xiangxiang
    Jiang, Min
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (04) : 5684 - 5694
  • [49] Describing Multimedia Content Using Attention-Based Encoder-Decoder Networks
    Cho, Kyunghyun
    Courville, Aaron
    Bengio, Yoshua
    IEEE TRANSACTIONS ON MULTIMEDIA, 2015, 17 (11) : 1875 - 1886
  • [50] Image restoration of finger-vein networks based on encoder-decoder model
    Guo, Xiao-jing
    Li, Dan
    Zhang, Hai-gang
    Yang, Jin-feng
    OPTOELECTRONICS LETTERS, 2019, 15 (06) : 463 - 467