Dual Encoder-Decoder Based Generative Adversarial Networks for Disentangled Facial Representation Learning

被引:8
|
作者
Hu, Cong [1 ,2 ,3 ]
Feng, Zhenhua [4 ,5 ]
Wu, Xiaojun [1 ,2 ]
Kittler, Josef [5 ]
机构
[1] Jiangnan Univ, Sch Artificial Intelligence & Comp Sci, Wuxi 214122, Jiangsu, Peoples R China
[2] Jiangnan Univ, Jiangsu Prov Engn Lab Pattern Recognit & Computat, Wuxi 214122, Jiangsu, Peoples R China
[3] Minjiang Univ, Fujian Prov Key Lab Informat Proc & Intelligent C, Fuzhou 350121, Peoples R China
[4] Univ Surrey, Dept Comp Sci, Guildford GU2 7XH, Surrey, England
[5] Univ Surrey, Ctr Vis Speech & Signal Proc, Guildford GU2 7XH, Surrey, England
基金
中国国家自然科学基金; 英国工程与自然科学研究理事会;
关键词
Face; Gallium nitride; Generative adversarial networks; Training; Generators; Face recognition; Task analysis; Disentangled representation learning; encoder-decoder; generative adversarial networks; face synthesis; pose invariant face recognition; FACE RECOGNITION;
D O I
10.1109/ACCESS.2020.3009512
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
To learn disentangled representations of facial images, we present a Dual Encoder-Decoder based Generative Adversarial Network (DED-GAN). In the proposed method, both the generator and discriminator are designed with deep encoder-decoder architectures as their backbones. To be more specific, the encoder-decoder structured generator is used to learn a pose disentangled face representation, and the encoder-decoder structured discriminator is tasked to perform real/fake classification, face reconstruction, determining identity and estimating face pose. We further improve the proposed network architecture by minimizing the additional pixel-wise loss defined by the Wasserstein distance at the output of the discriminator so that the adversarial framework can be better trained. Additionally, we consider face pose variation to be continuous, rather than discrete in existing literature, to inject richer pose information into our model. The pose estimation task is formulated as a regression problem, which helps to disentangle identity information from pose variations. The proposed network is evaluated on the tasks of pose-invariant face recognition (PIFR) and face synthesis across poses. An extensive quantitative and qualitative evaluation carried out on several controlled and in-the-wild benchmarking datasets demonstrates the superiority of the proposed DED-GAN method over the state-of-the-art approaches.
引用
收藏
页码:130159 / 130171
页数:13
相关论文
共 50 条
  • [21] Finger-Vein Image Inpainting Based on an Encoder-Decoder Generative Network
    Li, Dan
    Guo, Xiaojing
    Zhang, Haigang
    Jia, Guimin
    Yang, Jinfeng
    PATTERN RECOGNITION AND COMPUTER VISION (PRCV 2018), PT I, 2018, 11256 : 87 - 97
  • [22] Semantic Image Inpainting using Self-Learning Encoder-Decoder and Adversarial Loss
    Salem, Nermin M.
    Mahdi, Hani M. K.
    Abbas, Hazem
    PROCEEDINGS OF 2018 13TH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND SYSTEMS (ICCES), 2018, : 103 - 108
  • [23] DOM Refinement with neural Encoder-Decoder Networks
    Metzger, Nando
    PFG-JOURNAL OF PHOTOGRAMMETRY REMOTE SENSING AND GEOINFORMATION SCIENCE, 2020, 88 (3-4): : 362 - 363
  • [24] SCGAN: Disentangled Representation Learning by Adding Similarity Constraint on Generative Adversarial Nets
    Li, Xiaoqiang
    Chen, Liangbo
    Wang, Lu
    Wu, Pin
    Tong, Weiqin
    IEEE ACCESS, 2019, 7 : 147928 - 147938
  • [25] On Mining Conditions using Encoder-decoder Networks
    Gallego, Fernando O.
    Corchuelo, Rafael
    PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE (ICAART), VOL 2, 2019, : 624 - 630
  • [26] Timber Tracing with Multimodal Encoder-Decoder Networks
    Zolotarev, Fedor
    Eerola, Tuomas
    Lensu, Lasse
    Kalviainen, Heikki
    Haario, Heikki
    Heikkinen, Jere
    Kauppi, Tomi
    COMPUTER ANALYSIS OF IMAGES AND PATTERNS, CAIP 2019, PT II, 2019, 11679 : 342 - 353
  • [27] Attentive encoder-decoder networks for crowd counting
    Liu, Xuhui
    Hu, Yutao
    Zhang, Baochang
    Zhen, Xiantong
    Luo, Xiaoyan
    Cao, Xianbin
    NEUROCOMPUTING, 2022, 490 : 246 - 257
  • [28] Attentive encoder-decoder networks for crowd counting
    Liu, Xuhui
    Hu, Yutao
    Zhang, Baochang
    Zhen, Xiantong
    Luo, Xiaoyan
    Cao, Xianbin
    Neurocomputing, 2022, 490 : 246 - 257
  • [29] A method of face repair based on encoder-decoder and dual discrimination network
    Cui Can
    Zhao Jun
    Xiong Xingzhong
    Edwards, Nathaniel O.
    PROCEEDINGS OF THE 39TH CHINESE CONTROL CONFERENCE, 2020, : 7565 - 7569
  • [30] Encoder-decoder based convolutional neural networks for image forgery detection
    El Biach, Fatima Zahra
    Iala, Imad
    Laanaya, Hicham
    Minaoui, Khalid
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (16) : 22611 - 22628