Performance capture from sparse multi-view video

被引:331
|
作者
de Aguiar, Edilson [1 ]
Stoll, Carsten [1 ]
Theobalt, Christian [2 ]
Ahmed, Naveed [1 ]
Seidel, Hans-Peter [1 ]
Thrun, Sebastian [2 ]
机构
[1] MPI Informat, Saarbrucken, Germany
[2] Stanford Univ, Stanford, CA 94305 USA
来源
ACM TRANSACTIONS ON GRAPHICS | 2008年 / 27卷 / 03期
关键词
performance capture; marker-less scene reconstruction; multi-view video analysis;
D O I
10.1145/1360612.1360697
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
This paper proposes a new marker-less approach to capturing human performances from multi-view video. Our algorithm can jointly reconstruct spatio-temporally coherent geometry, motion and textural surface appearance of actors that perform complex and rapid moves. Furthermore, since our algorithm is purely mesh-based and makes as few as possible prior assumptions about the type of subject being tracked. it can even capture performances of people wearing wide apparel, such as a dancer wearing a skirt. To serve this purpose our method efficiently and effectively combines the power of surface- and volume-based shape deformation techniques with a new mesh-based analysis-through-synthesis framework. This framework extracts motion constraints from video and makes the laser-scan of the tracked subject mimic the recorded performance. Also small-scale time-varying shape detail is recovered by applying model-guided multi-view stereo, to refine the model surface. Our method delivers captured performance data at high level of detail, is highly versatile, and is applicable to many complex types of scenes that could not be handled by alternative marker-based or marker-free recording techniques.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] A Dual Rig Approach for Multi-View Video and Spatialized Audio Capture in Medical Training
    Maraval, Joshua
    Wei, Bangning
    Pesce, David
    Gayral, Yann
    Outtas, Meriem
    Ramin, Nicolas
    Zhang, Lu
    2024 16TH INTERNATIONAL CONFERENCE ON QUALITY OF MULTIMEDIA EXPERIENCE, QOMEX 2024, 2024, : 274 - 277
  • [32] A VIEW SCALABLE MULTI-VIEW VIDEO DECODER SYSTEM
    Lee, Jui-Sheng
    Miao, Yuan-Hsiang
    Chien, Cheng-An
    Chang, Hsiu-Cheng
    Guo, Jiun-In
    2013 INTERNATIONAL SYMPOSIUM ON VLSI DESIGN, AUTOMATION, AND TEST (VLSI-DAT), 2013,
  • [33] Multi-view video coding based on view prediction
    An, Ping
    Guo, Qiuyan
    Mi, Tao
    Zhou, Li
    Zhang, Zhaoyang
    2008 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING, VOLS 1 AND 2, PROCEEDINGS, 2008, : 1481 - 1485
  • [34] A VIEW SCALABLE MULTI-VIEW VIDEO DECODER SYSTEM
    Lee, Jui-Sheng
    Miao, Yuan-Hsiang
    Chien, Cheng-An
    Chang, Hsiu-Cheng
    Guo, Jiun-In
    2013 INTERNATIONAL SYMPOSIUM ON VLSI DESIGN, AUTOMATION, AND TEST (VLSI-DAT), 2013,
  • [35] VIEW SELECTION POLICY FOR MULTI-VIEW VIDEO DELIVERY
    Chakareski, Jacob
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 3736 - 3740
  • [36] Calibrated and synchronized multi-view video and motion capture dataset for evaluation of gait recognition
    Kwolek, Bogdan
    Michalczuk, Agnieszka
    Krzeszowski, Tomasz
    Switonski, Adam
    Josinski, Henryk
    Wojciechowski, Konrad
    MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (22) : 32437 - 32465
  • [37] Performance modeling and evaluation of prediction structures in multi-view video coding
    Chen, Chao
    Liu, Yebin
    Dai, Qionghai
    Liu, Xiaodong
    2007 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-5, 2007, : 1335 - +
  • [38] Neural 3D Video Synthesis from Multi-view Video
    Li, Tianye
    Slavcheva, Mira
    Zollhoefer, Michael
    Green, Simon
    Lassner, Christoph
    Kim, Changil
    Schmidt, Tanner
    Lovegrove, Steven
    Goesele, Michael
    Newcombe, Richard
    Lv, Zhaoyang
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 5511 - 5521
  • [39] Multi-view calibration from planar motion for video surveillance
    Jaynes, C
    SECOND IEEE WORKSHOP ON VISUAL SURVEILLANCE (VS'99), PROCEEDINGS, 1999, : 59 - 66
  • [40] Generation of layered depth images from multi-view video
    Cheng, Xiaoyu
    Sun, Lifeng
    Yang, Shiqiang
    2007 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-7, 2007, : 2477 - 2480