Deformable Generator Networks: Unsupervised Disentanglement of Appearance and Geometry

被引：5

作者：

Xing, Xianglei ^{[1
]}

Gao, Ruiqi ^{[3
]}

Han, Tian ^{[2
]}

Zhu, Song-Chun ^{[3
]}

Wu, Ying Nian ^{[3
]}

机构：

[1] Harbin Engn Univ, Coll Automat, Harbin 150001, Heilongjiang, Peoples R China

[2] Stevens Inst Technol, Comp Sci Dept, Hoboken, NJ 07030 USA

[3] Univ Calif Los Angeles, Dept Stat, Los Angeles, CA 90095 USA

来源：

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE | 2022年 / 44卷 / 03期

关键词：

Generators; Deformable models; Data models; Shape; Interpolation; Analytical models; Image color analysis; Unsupervised learning; deep generative model; deformable model; REPRESENTATION;

D O I：

10.1109/TPAMI.2020.3013905

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We present a deformable generator model to disentangle the appearance and geometric information for both image and video data in a purely unsupervised manner. The appearance generator network models the information related to appearance, including color, illumination, identity or category, while the geometric generator performs geometric warping, such as rotation and stretching, through generating deformation field which is used to warp the generated appearance to obtain the final image or video sequences. Two generators take independent latent vectors as input to disentangle the appearance and geometric information from image or video sequences. For video data, a nonlinear transition model is introduced to both the appearance and geometric generators to capture the dynamics over time. The proposed scheme is general and can be easily integrated into different generative models. An extensive set of qualitative and quantitative experiments shows that the appearance and geometric information can be well disentangled, and the learned geometric generator can be conveniently transferred to other image datasets that share similar structure regularity to facilitate knowledge transfer tasks.

引用

页码：1162 / 1179

页数：18

共 50 条

[31] Orthogonal Jacobian Regularization for Unsupervised Disentanglement in Image Generation
Wei, Yuxiang
Shi, Yupeng
Liu, Xiao
Ji, Zhilong
Gao, Yuan
Wu, Zhongqin
Zuo, Wangmeng
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 6701 - 6710
[32] Unsupervised geometry calibration of acoustic sensor networks using source correspondences
Schmalenstroeer, Joerg
Jacob, Florian
Haeb-Umbach, Reinhold
Hennecke, Marius H.
Fink, Gernot A.
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 604 - +
[33] Token-level disentanglement for unsupervised text style transfer
Hu, Yahao
Tao, Wei
Xie, Yifei
Sun, Yi
Pan, Zhisong
NEUROCOMPUTING, 2023, 560
[34] Deformable Sprites for Unsupervised Video Decomposition
Ye, Vickie
Li, Zhengqi
Tucker, Richard
Kanazawa, Angjoo
Snavely, Noah
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 2647 - 2656
[35] Editable Image Generation with Consistent Unsupervised Disentanglement Based on GAN
Yang, Gaoming
Qu, Yuanjin
Fang, Xianjin
APPLIED SCIENCES-BASEL, 2022, 12 (11):
[36] Rethinking Disentanglement in Unsupervised Domain Adaptation for Medical Image Segmentation
Wang, Yan
Chen, Yixin
Zhang, Yingying
Zhu, Haogang
2023 45TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY, EMBC, 2023,
[37] Mutual Information based Method for Unsupervised Disentanglement of Video Representation
Sreekar, P. Aditya
Tiwari, Ujjwal
Namboodiri, Anoop
2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 6396 - 6403
[38] Mutual-weighted feature disentanglement for unsupervised domain adaptation
Wang, Shanshan
Xiao, Qian
Wang, Keyang
Yang, Xun
Zhang, Xingyi
MULTIMEDIA SYSTEMS, 2024, 30 (06)
[39] UNSUPERVISED SPEECH ENHANCEMENT WITH SPEECH RECOGNITION EMBEDDING AND DISENTANGLEMENT LOSSES
Viet Anh Trinh
Braun, Sebastian
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 391 - 395
[40] Stylistic Chinese Poetry Generation via Unsupervised Style Disentanglement
Yang, Cheng
Sun, Maosong
Yi, Xiaoyuan
Li, Wenhao
2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 3960 - 3969

← 1 2 3 4 5 →