Self-Supervised 3D Representation Learning of Dressed Humans From Social Media Videos

被引:0
|
作者
Jafarian, Yasamin [1 ]
Park, Hyun Soo [1 ]
机构
[1] Univ Minnesota, Minneapolis, MN 55455 USA
关键词
Depth estimation; dataset; high fidelity human reconstruction; normal estimation; single view 3D reconstruction; self-supervised learning;
D O I
10.1109/TPAMI.2022.3231558
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A key challenge of learning a visual representation for the 3D high fidelity geometry of dressed humans lies in the limited availability of the ground truth data (e.g., 3D scanned models), which results in the performance degradation of 3D human reconstruction when applying to real-world imagery. We address this challenge by leveraging a new data resource: a number of social media dance videos that span diverse appearance, clothing styles, performances, and identities. Each video depicts dynamic movements of the body and clothes of a single person while lacking the 3D ground truth geometry. To learn a visual representation from these videos, we present a new self-supervised learning method to use the local transformation that warps the predicted local geometry of the person from an image to that of another image at a different time instant. This allows self-supervision by enforcing a temporal coherence over the predictions. In addition, we jointly learn the depths along with the surface normals that are highly responsive to local texture, wrinkle, and shade by maximizing their geometric consistency. Our method is end-to-end trainable, resulting in high fidelity depth estimation that predicts fine geometry faithful to the input real image. We further provide a theoretical bound of self-supervised learning via an uncertainty analysis that characterizes the performance of the self-supervised learning without training. We demonstrate that our method outperforms the state-of-the-art human depth estimation and human shape recovery approaches on both real and rendered images.
引用
收藏
页码:8969 / 8983
页数:15
相关论文
共 50 条
  • [31] SegContrast: 3D Point Cloud Feature Representation Learning Through Self-Supervised Segment Discrimination
    Nunes, Lucas
    Marcuzzi, Rodrigo
    Chen, Xieyuanli
    Behley, Jens
    Stachniss, Cyrill
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (02) : 2116 - 2123
  • [32] CMD: Self-supervised 3D Action Representation Learning with Cross-Modal Mutual Distillation
    Mao, Yunyao
    Zhou, Wengang
    Lu, Zhenbo
    Deng, Jiajun
    Li, Houqiang
    COMPUTER VISION - ECCV 2022, PT III, 2022, 13663 : 734 - 752
  • [33] Self-supervised Learning of Morphological Representation for 3D EM Segments with Cluster-Instance Correlations
    Zhang, Chi
    Chen, Qihua
    Chen, Xuejin
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT VIII, 2022, 13438 : 99 - 108
  • [34] Self-Supervised Representation Learning from Flow Equivariance
    Xiong, Yuwen
    Ren, Mengye
    Zeng, Wenyuan
    Urtasun, Raquel
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 10171 - 10180
  • [35] SELF-SUPERVISED REPRESENTATION LEARNING FROM ELECTROENCEPHALOGRAPHY SIGNALS
    Banville, Hubert
    Albuquerque, Isabela
    Hyvarinen, Aapo
    Moffat, Graeme
    Engemann, Denis-Alexander
    Gramfort, Alexandre
    2019 IEEE 29TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2019,
  • [36] Whitening for Self-Supervised Representation Learning
    Ermolov, Aleksandr
    Siarohin, Aliaksandr
    Sangineto, Enver
    Sebe, Nicu
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [37] Learning High Fidelity Depths of Dressed Humans by Watching Social Media Dance Videos
    Jafarian, Yasamin
    Park, Hyun Soo
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 12748 - 12757
  • [38] SELF-SUPERVISED 3D SKELETON REPRESENTATION LEARNING WITH ACTIVE SAMPLING AND ADAPTIVE RELABELING FOR ACTION RECOGNITION
    Wang, Guoquan
    Liu, Hong
    Guo, Tianyu
    Guo, Jingwen
    Wang, Ti
    Li, Yidi
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 56 - 60
  • [39] Self-Supervised Representation Learning for CAD
    Jones, Benjamin T.
    Hu, Michael
    Kodnongbua, Milin
    Kim, Vladimir G.
    Schulz, Adriana
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 21327 - 21336
  • [40] Exploring Self-Supervised Learning for 3D Point Cloud Registration
    Yuan, Mingzhi
    Huang, Qiao
    Shen, Ao
    Huang, Xiaoshui
    Wang, Manning
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2025, 10 (01): : 25 - 31