Holoported Characters: Real-time Free-viewpoint Rendering of Humans from Sparse RGB Cameras

被引:1
|
作者
Shetty, Ashwath [1 ,2 ]
Habermann, Marc [1 ,3 ]
Sun, Guoxing [1 ]
Luvizon, Diogo [1 ,3 ]
Golyanik, Vladislav [1 ]
Theobalt, Christian [1 ,3 ]
机构
[1] Max Planck Inst Informat, Saarland Informat Campus, Saarbrucken, Germany
[2] Saarland Univ, Saarbrucken, Germany
[3] Saarbrucken Res Ctr Visual Comp Interact & AI, Saarbrucken, Germany
关键词
VIDEO;
D O I
10.1109/CVPR52733.2024.00121
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present the first approach to render highly realistic free-viewpoint videos of a human actor in general apparel, from sparse multi-view recording to display, in real-time at an unprecedented 4K resolution. At inference, our method only requires four camera views of the moving actor and the respective 3D skeletal pose. It handles actors in wide clothing, and reproduces even fine-scale dynamic detail, e.g. clothing wrinkles, face expressions, and hand gestures. At training time, our learning-based approach expects dense multi-view video and a rigged static surface scan of the actor. Our method comprises three main stages. Stage 1 is a skeleton-driven neural approach for high-quality capture of the detailed dynamic mesh geometry. Stage 2 is a novel solution to create a view-dependent texture using four testtime camera views as input. Finally, stage 3 comprises a new image-based refinement network rendering the final 4K image given the output from the previous stages. Our approach establishes a new benchmark for real-time rendering resolution and quality using sparse input camera views, unlocking possibilities for immersive telepresence. Code and data is available on our project page.
引用
收藏
页码:1206 / 1215
页数:10
相关论文
共 50 条
  • [1] Real-time, free-viewpoint video rendering from volumetric geometry
    Goldlücke, B
    Magnor, M
    VISUAL COMMUNICATIONS AND IMAGE PROCESSING 2003, PTS 1-3, 2003, 5150 : 1152 - 1158
  • [2] Real-time microfacet billboarding for free-viewpoint video rendering
    Goldlücke, B
    Magnor, M
    2003 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL 3, PROCEEDINGS, 2003, : 713 - 716
  • [3] FlexNeRF: Photorealistic Free-viewpoint Rendering of Moving Humans from Sparse Views
    Jayasundara, Vinoj
    Agrawal, Amit
    Heron, Nicolas
    Shrivastava, Abhinav
    Davis, Larry S.
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 21118 - 21127
  • [4] Real-time free-viewpoint video generation using multiple cameras and a PC-cluster
    Ueda, M
    Arita, D
    Taniguchi, RI
    PROCEEDINGS OF THE SEVENTH IASTED INTERNATIONAL CONFERENCE ON COMPUTER GRAPHICS AND IMAGING, 2004, : 87 - 92
  • [5] Real-time free-viewpoint video generation using multiple cameras and a PC-cluster
    Ueda, M
    Arita, D
    Taniguchi, RI
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2004, PT 1, PROCEEDINGS, 2004, 3331 : 418 - 425
  • [6] Free-Viewpoint RGB-D Human Performance Capture and Rendering
    Phong Nguyen-Ha
    Sarafianos, Nikolaos
    Lassner, Christoph
    Heikkila, Janne
    Tung, Tony
    COMPUTER VISION - ECCV 2022, PT XVI, 2022, 13676 : 473 - 491
  • [7] Real-time free viewpoint from multiple moving cameras
    Nozick, Vincent
    Saito, Hideo
    ADVANCED CONCEPTS FOR INTELLIGENT VISION SYSTEMS, PROCEEDINGS, 2007, 4678 : 72 - 83
  • [8] A low-cost, practical acquisition and rendering pipeline for real-time free-viewpoint video communication
    Sverker Rasmuson
    Erik Sintorn
    Ulf Assarsson
    The Visual Computer, 2021, 37 : 553 - 565
  • [9] A low-cost, practical acquisition and rendering pipeline for real-time free-viewpoint video communication
    Rasmuson, Sverker
    Sintorn, Erik
    Assarsson, Ulf
    VISUAL COMPUTER, 2021, 37 (03): : 553 - 565
  • [10] Real-time free viewpoint video from a range sensor and color cameras
    Pelletier, Stephane
    Cooperstock, Jeremy R.
    MACHINE VISION AND APPLICATIONS, 2013, 24 (04) : 739 - 751