Deformable Generator Networks: Unsupervised Disentanglement of Appearance and Geometry

被引:5
|
作者
Xing, Xianglei [1 ]
Gao, Ruiqi [3 ]
Han, Tian [2 ]
Zhu, Song-Chun [3 ]
Wu, Ying Nian [3 ]
机构
[1] Harbin Engn Univ, Coll Automat, Harbin 150001, Heilongjiang, Peoples R China
[2] Stevens Inst Technol, Comp Sci Dept, Hoboken, NJ 07030 USA
[3] Univ Calif Los Angeles, Dept Stat, Los Angeles, CA 90095 USA
关键词
Generators; Deformable models; Data models; Shape; Interpolation; Analytical models; Image color analysis; Unsupervised learning; deep generative model; deformable model; REPRESENTATION;
D O I
10.1109/TPAMI.2020.3013905
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a deformable generator model to disentangle the appearance and geometric information for both image and video data in a purely unsupervised manner. The appearance generator network models the information related to appearance, including color, illumination, identity or category, while the geometric generator performs geometric warping, such as rotation and stretching, through generating deformation field which is used to warp the generated appearance to obtain the final image or video sequences. Two generators take independent latent vectors as input to disentangle the appearance and geometric information from image or video sequences. For video data, a nonlinear transition model is introduced to both the appearance and geometric generators to capture the dynamics over time. The proposed scheme is general and can be easily integrated into different generative models. An extensive set of qualitative and quantitative experiments shows that the appearance and geometric information can be well disentangled, and the learned geometric generator can be conveniently transferred to other image datasets that share similar structure regularity to facilitate knowledge transfer tasks.
引用
收藏
页码:1162 / 1179
页数:18
相关论文
共 50 条
  • [1] Unsupervised Disentangling of Appearance and Geometry by Deformable Generator Network
    Xing, Xianglei
    Han, Tian
    Gao, Ruiqi
    Zhu, Song-Chun
    Wu, Ying Nian
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 10346 - 10355
  • [2] Motion-Based Generator Model: Unsupervised Disentanglement of Appearance, Trackable and Intrackable Motions in Dynamic Patterns
    Xie, Jianwen
    Gao, Ruiqi
    Zheng, Zilong
    Zhu, Song-Chun
    Wu, Ying Nian
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 12442 - 12451
  • [3] Unsupervised Disentanglement of Pose, Appearance and Background from Images and Videos
    Dundar, Aysegul
    Shih, Kevin
    Garg, Animesh
    Pottorff, Robert
    Tao, Andrew
    Catanzaro, Bryan
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (07) : 3883 - 3894
  • [4] Controllable Image Generation with Semi-supervised Deep Learning and Deformable-Mean-Template Based Geometry-Appearance Disentanglement *
    Wadhwani, Krishna
    Awate, Suyash P.
    PATTERN RECOGNITION, 2021, 118
  • [5] When is Unsupervised Disentanglement Possible?
    Horan, Daniella
    Richardson, Eitan
    Weiss, Yair
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021,
  • [6] Unsupervised Hierarchical Disentanglement for Video Prediction
    Motallebi, Mohammad Reza
    Westfechtel, Thomas
    Li, Yang
    Harada, Tatsuya
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 470 - 476
  • [7] DG-Font: Deformable Generative Networks for Unsupervised Font Generation
    Xie, Yangchen
    Chen, Xinyuan
    Sun, Li
    Lu, Yue
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 5126 - 5136
  • [8] Triple-Input-Unsupervised neural Networks for deformable image registration
    Qu, Lei
    Wan, Wan
    Guo, Kaixuan
    Liu, Yu
    Tang, Jun
    Li, Xiaolei
    Wu, Jun
    PATTERN RECOGNITION LETTERS, 2021, 151 : 332 - 339
  • [9] Deformable CT Image Registration Using Unsupervised Deep Learning Networks
    Lei, Y.
    Fu, Y.
    Tian, Z.
    Wang, T.
    Zhang, J.
    Dai, X.
    Zhou, J.
    Roper, J.
    McDonald, M.
    Yu, D.
    Bradley, J.
    Liu, T.
    Yang, X.
    MEDICAL PHYSICS, 2022, 49 (06) : E527 - E527
  • [10] DisDiff: Unsupervised Disentanglement of Diffusion Probabilistic Models
    Yang, Tao
    Wang, Yuwang
    Lu, Yan
    Zheng, Nanning
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,