Dual Encoder-Decoder Based Generative Adversarial Networks for Disentangled Facial Representation Learning

被引：8

作者：

Hu, Cong ^{[1
,2
,3
]}

Feng, Zhenhua ^{[4
,5
]}

Wu, Xiaojun ^{[1
,2
]}

Kittler, Josef ^{[5
]}

机构：

[1] Jiangnan Univ, Sch Artificial Intelligence & Comp Sci, Wuxi 214122, Jiangsu, Peoples R China

[2] Jiangnan Univ, Jiangsu Prov Engn Lab Pattern Recognit & Computat, Wuxi 214122, Jiangsu, Peoples R China

[3] Minjiang Univ, Fujian Prov Key Lab Informat Proc & Intelligent C, Fuzhou 350121, Peoples R China

[4] Univ Surrey, Dept Comp Sci, Guildford GU2 7XH, Surrey, England

[5] Univ Surrey, Ctr Vis Speech & Signal Proc, Guildford GU2 7XH, Surrey, England

来源：

IEEE ACCESS | 2020年 / 8卷

基金：

中国国家自然科学基金; 英国工程与自然科学研究理事会;

关键词：

Face; Gallium nitride; Generative adversarial networks; Training; Generators; Face recognition; Task analysis; Disentangled representation learning; encoder-decoder; generative adversarial networks; face synthesis; pose invariant face recognition; FACE RECOGNITION;

D O I：

10.1109/ACCESS.2020.3009512

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

To learn disentangled representations of facial images, we present a Dual Encoder-Decoder based Generative Adversarial Network (DED-GAN). In the proposed method, both the generator and discriminator are designed with deep encoder-decoder architectures as their backbones. To be more specific, the encoder-decoder structured generator is used to learn a pose disentangled face representation, and the encoder-decoder structured discriminator is tasked to perform real/fake classification, face reconstruction, determining identity and estimating face pose. We further improve the proposed network architecture by minimizing the additional pixel-wise loss defined by the Wasserstein distance at the output of the discriminator so that the adversarial framework can be better trained. Additionally, we consider face pose variation to be continuous, rather than discrete in existing literature, to inject richer pose information into our model. The pose estimation task is formulated as a regression problem, which helps to disentangle identity information from pose variations. The proposed network is evaluated on the tasks of pose-invariant face recognition (PIFR) and face synthesis across poses. An extensive quantitative and qualitative evaluation carried out on several controlled and in-the-wild benchmarking datasets demonstrates the superiority of the proposed DED-GAN method over the state-of-the-art approaches.

引用

页码：130159 / 130171

页数：13

共 50 条

[41] Unsupervised Feature Selection using Encoder-Decoder Networks
SharifiPour, Sasan
Fayyazi, Hossein
Sabokro, Mohammad
2020 6TH IRANIAN CONFERENCE ON SIGNAL PROCESSING AND INTELLIGENT SYSTEMS (ICSPIS), 2020,
[42] A visual localization method based on encoder-decoder dual-stream CNN
Jia R.
Liu S.
Li J.
Wang Y.
Pan H.
Beijing Hangkong Hangtian Daxue Xuebao/Journal of Beijing University of Aeronautics and Astronautics, 2019, 45 (10): : 1965 - 1972
[43] Comparison of Encoder-Decoder Networks for Soccer Field Segmentation
Guimaraes, Otavio H. R.
Maximo, Marcos R. O. A.
Parente de Oliveira, Jose Maria
2023 LATIN AMERICAN ROBOTICS SYMPOSIUM, LARS, 2023 BRAZILIAN SYMPOSIUM ON ROBOTICS, SBR, AND 2023 WORKSHOP ON ROBOTICS IN EDUCATION, WRE, 2023, : 496 - 501
[44] Denoising of 3D magnetic resonance images using a residual encoder-decoder Wasserstein generative adversarial network
Ran, Maosong
Hu, Jinrong
Chen, Yang
Chen, Hu
Sun, Huaiqiang
Zhou, Jiliu
Zhang, Yi
MEDICAL IMAGE ANALYSIS, 2019, 55 : 165 - 180
[45] Encoder-decoder based process generation method
Tang W.
Wang P.
Cai D.
Zhang G.
Wang Y.
Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2023, 29 (11): : 3656 - 3668
[46] Locating Anatomical Landmarks on 2D Lateral Cephalograms Through Adversarial Encoder-Decoder Networks
Dai, Xiubin
Zhao, Hao
Liu, Tianliang
Cao, Dan
Xie, Lizhe
IEEE ACCESS, 2019, 7 : 132738 - 132747
[47] Image Captioning: From Encoder-Decoder to Reinforcement Learning
Tang, Yu
2022 6TH INTERNATIONAL CONFERENCE ON IMAGING, SIGNAL PROCESSING AND COMMUNICATIONS, ICISPC, 2022, : 6 - 10
[48] WalkGAN: Network Representation Learning With Sequence-Based Generative Adversarial Networks
Jin, Taisong
Yang, Xixi
Yu, Zhengtao
Luo, Han
Zhang, Yongmei
Jie, Feiran
Zeng, Xiangxiang
Jiang, Min
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (04) : 5684 - 5694
[49] Describing Multimedia Content Using Attention-Based Encoder-Decoder Networks
Cho, Kyunghyun
Courville, Aaron
Bengio, Yoshua
IEEE TRANSACTIONS ON MULTIMEDIA, 2015, 17 (11) : 1875 - 1886
[50] Image restoration of finger-vein networks based on encoder-decoder model
Guo, Xiao-jing
Li, Dan
Zhang, Hai-gang
Yang, Jin-feng
OPTOELECTRONICS LETTERS, 2019, 15 (06) : 463 - 467

← 1 2 3 4 5 →