Learning High Fidelity Depths of Dressed Humans by Watching Social Media Dance Videos

被引:29
|
作者
Jafarian, Yasamin [1 ]
Park, Hyun Soo [1 ]
机构
[1] Univ Minnesota, Minneapolis, MN 55455 USA
基金
美国国家科学基金会;
关键词
D O I
10.1109/CVPR46437.2021.01256
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A key challenge of learning the geometry of dressed humans lies in the limited availability of the ground truth data (e.g., 3D scanned models), which results in the performance degradation of 3D human reconstruction when applying to real-world imagery. We address this challenge by leveraging a new data resource: a number of social media dance videos that span diverse appearance, clothing styles, performances, and identities. Each video depicts dynamic movements of the body and clothes of a single person while lacking the 3D ground truth geometry. To utilize these videos, we present a new method to use the local transformation that warps the predicted local geometry of the person from an image to that of another image at a different time instant. This allows self-supervision as enforcing a temporal coherence over the predictions. In addition, we jointly learn the depth along with the surface normals that are highly responsive to local texture, wrinkle, and shade by maximizing their geometric consistency. Our method is end-to-end trainable, resulting in high fidelity depth estimation that predicts fine geometry faithful to the input real image. We demonstrate that our method outperforms the state-of-the-art human depth estimation and human shape recovery approaches on both real and rendered images.
引用
收藏
页码:12748 / 12757
页数:10
相关论文
共 36 条
  • [31] MOBILE AUGMENTED REALITY APPLICATION THROUGH METAVERSE APPROACH AS SOCIAL STUDIES LEARNING MEDIA IN JUNIOR HIGH SCHOOL
    Ruhimat, Mamat
    Logayah, Dina Siti
    Darmawan, Rizal Akbar
    JOURNAL OF ENGINEERING SCIENCE AND TECHNOLOGY, 2023, 18 : 176 - 185
  • [32] Hybrid grass bee optimization-multikernal extreme learning classifier: Multimodular fusion strategy and optimal feature selection for multimodal sentiment analysis in social media videos
    Alqahtani, Abdullah Saleh
    Saravanan, Pandiaraj
    Maheswari, Murali
    Alshmrany, Sami
    Alsarrayrih, Haytham
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2021, 33 (16):
  • [33] Digital Media Support Design Learning Behavior-A Social Networking Instructional Model for Teaching and Learning Design and Technology at High School in China
    Jiang, Hao
    Liu, Xiao-li
    Peng, Xiang
    Tang, Ming-xi
    2015 INTERNATIONAL CONFERENCE ON EDUCATION, MANAGEMENT AND SYSTEMS ENGINEERING (EMSE 2015), 2015, : 151 - 156
  • [34] Students' critical awareness of the internet and social media use as resources for Islamic learning in Indonesian public senior high schools
    Jusubaidi
    Mujahidin, Anwar
    Abdullah, Irwan
    Choirul Rofiq, Ahmad
    BRITISH JOURNAL OF RELIGIOUS EDUCATION, 2025, 47 (02) : 140 - 155
  • [35] Hidden emotional trends on social media regarding the Thailand-China high-speed railway project: a deep learning approach with ChatGPT integration
    Nokkaew, Manussawee
    Nongpong, Kwankamol
    Yeophantong, Tapanan
    Ploykitikoon, Pattravadee
    Arjharn, Weerachai
    Phonak, Duangkamol
    Siritaratiwat, Apirat
    Surawanitkun, Chayada
    SOCIAL NETWORK ANALYSIS AND MINING, 2024, 14 (01)
  • [36] TALENTED HIGH SCHOOL STUDENTS FROM ALL ACROSS RUSSIA RECEIVE AN OPPORTUNITY TO ENGAGE IN WEBINARS AND LEARNING THROUGH SOCIAL MEDIA: AN EYE-OPENING, YET STILL NOT A MIND OPENING EXPERIENCE
    Vasilyeva, Irina
    INTED2014: 8TH INTERNATIONAL TECHNOLOGY, EDUCATION AND DEVELOPMENT CONFERENCE, 2014, : 6689 - 6692