Deformable Generator Networks: Unsupervised Disentanglement of Appearance and Geometry

被引:5
|
作者
Xing, Xianglei [1 ]
Gao, Ruiqi [3 ]
Han, Tian [2 ]
Zhu, Song-Chun [3 ]
Wu, Ying Nian [3 ]
机构
[1] Harbin Engn Univ, Coll Automat, Harbin 150001, Heilongjiang, Peoples R China
[2] Stevens Inst Technol, Comp Sci Dept, Hoboken, NJ 07030 USA
[3] Univ Calif Los Angeles, Dept Stat, Los Angeles, CA 90095 USA
关键词
Generators; Deformable models; Data models; Shape; Interpolation; Analytical models; Image color analysis; Unsupervised learning; deep generative model; deformable model; REPRESENTATION;
D O I
10.1109/TPAMI.2020.3013905
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a deformable generator model to disentangle the appearance and geometric information for both image and video data in a purely unsupervised manner. The appearance generator network models the information related to appearance, including color, illumination, identity or category, while the geometric generator performs geometric warping, such as rotation and stretching, through generating deformation field which is used to warp the generated appearance to obtain the final image or video sequences. Two generators take independent latent vectors as input to disentangle the appearance and geometric information from image or video sequences. For video data, a nonlinear transition model is introduced to both the appearance and geometric generators to capture the dynamics over time. The proposed scheme is general and can be easily integrated into different generative models. An extensive set of qualitative and quantitative experiments shows that the appearance and geometric information can be well disentangled, and the learned geometric generator can be conveniently transferred to other image datasets that share similar structure regularity to facilitate knowledge transfer tasks.
引用
收藏
页码:1162 / 1179
页数:18
相关论文
共 50 条
  • [31] Orthogonal Jacobian Regularization for Unsupervised Disentanglement in Image Generation
    Wei, Yuxiang
    Shi, Yupeng
    Liu, Xiao
    Ji, Zhilong
    Gao, Yuan
    Wu, Zhongqin
    Zuo, Wangmeng
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 6701 - 6710
  • [32] Unsupervised geometry calibration of acoustic sensor networks using source correspondences
    Schmalenstroeer, Joerg
    Jacob, Florian
    Haeb-Umbach, Reinhold
    Hennecke, Marius H.
    Fink, Gernot A.
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 604 - +
  • [33] Token-level disentanglement for unsupervised text style transfer
    Hu, Yahao
    Tao, Wei
    Xie, Yifei
    Sun, Yi
    Pan, Zhisong
    NEUROCOMPUTING, 2023, 560
  • [34] Deformable Sprites for Unsupervised Video Decomposition
    Ye, Vickie
    Li, Zhengqi
    Tucker, Richard
    Kanazawa, Angjoo
    Snavely, Noah
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 2647 - 2656
  • [35] Editable Image Generation with Consistent Unsupervised Disentanglement Based on GAN
    Yang, Gaoming
    Qu, Yuanjin
    Fang, Xianjin
    APPLIED SCIENCES-BASEL, 2022, 12 (11):
  • [36] Rethinking Disentanglement in Unsupervised Domain Adaptation for Medical Image Segmentation
    Wang, Yan
    Chen, Yixin
    Zhang, Yingying
    Zhu, Haogang
    2023 45TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY, EMBC, 2023,
  • [37] Mutual Information based Method for Unsupervised Disentanglement of Video Representation
    Sreekar, P. Aditya
    Tiwari, Ujjwal
    Namboodiri, Anoop
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 6396 - 6403
  • [38] Mutual-weighted feature disentanglement for unsupervised domain adaptation
    Wang, Shanshan
    Xiao, Qian
    Wang, Keyang
    Yang, Xun
    Zhang, Xingyi
    MULTIMEDIA SYSTEMS, 2024, 30 (06)
  • [39] UNSUPERVISED SPEECH ENHANCEMENT WITH SPEECH RECOGNITION EMBEDDING AND DISENTANGLEMENT LOSSES
    Viet Anh Trinh
    Braun, Sebastian
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 391 - 395
  • [40] Stylistic Chinese Poetry Generation via Unsupervised Style Disentanglement
    Yang, Cheng
    Sun, Maosong
    Yi, Xiaoyuan
    Li, Wenhao
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 3960 - 3969