STAF: 3D Human Mesh Recovery From Video With Spatio-Temporal Alignment Fusion

被引:0
|
作者
Yao, Wei [1 ]
Zhang, Hongwen [2 ]
Sun, Yunlian [1 ]
Tang, Jinhui [1 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing 210094, Peoples R China
[2] Beijing Normal Univ, Sch Artificial Intelligence, Beijing 100875, Peoples R China
基金
中国国家自然科学基金;
关键词
3D human mesh recovery; temporal coherence; feature pyramid; attention model; POSE;
D O I
10.1109/TCSVT.2024.3410400
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The recovery of 3D human mesh from monocular images has significantly been developed in recent years. However, existing models usually ignore spatial and temporal information, which might lead to mesh and image misalignment and temporal discontinuity. For this reason, we propose a novel Spatio-Temporal Alignment Fusion (STAF) model. As a videobased model, it leverages coherence clues from human motion by an attention-based Temporal Coherence Fusion Module (TCFM). As for spatial mesh-alignment evidence, we extract fine-grained local information through predicted mesh projection on the feature maps. Based on the spatial features, we further introduce a multi-stage adjacent Spatial Alignment Fusion Module (SAFM) to enhance the feature representation of the target frame. In addition to the above, we propose an Average Pooling Module (APM) to allow the model to focus on the entire input sequence rather than just the target frame. This method can remarkably improve the smoothness of recovery results from video. Extensive experiments on 3DPW, MPII3D, and H36M demonstrate the superiority of STAF. We achieve a state-of-the-art trade-off between precision and smoothness. Our code and more video results are on the project page https://yw0208.github.io/staf/.
引用
收藏
页码:10564 / 10577
页数:14
相关论文
共 50 条
  • [1] STAF: 3D Human Mesh Recovery from Video with Spatio-Temporal Alignment Fusion
    Yao, Wei
    Zhang, Hongwen
    Sun, Yunlian
    Tang, Jinhui
    arXiv,
  • [2] Spatio-temporal fusion of multiple view video rate 3D surfaces
    Collins, G
    Hilton, A
    FIFTH INTERNATIONAL CONFERENCE ON 3-D DIGITAL IMAGING AND MODELING, PROCEEDINGS, 2005, : 142 - 149
  • [3] Spatio-Temporal Reconstruction for 3D Motion Recovery
    Yang, Jingyu
    Guo, Xin
    Li, Kun
    Wang, Meiyuan
    Lai, Yu-Kun
    Wu, Feng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (06) : 1583 - 1596
  • [4] Spatio-temporal registration techniques for relightable 3D video
    Ahmed, Naveed
    Theobalt, Christian
    Magnor, Marcus
    Seidel, Hans-Peter
    2007 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-7, 2007, : 1065 - +
  • [5] Spatio-temporal reflectance sharing for relightable 3D video
    Ahmed, Naveed
    Theobalt, Christian
    Seidel, Hans-Peter
    COMPUTER VISION/COMPUTER GRAPHICS COLLABORATION TECHNIQUES, 2007, 4418 : 47 - +
  • [6] Spatio-Temporal Dynamic Interlaced Network for 3D human pose estimation in video
    Xu, Feiyi
    Wang, Jifan
    Sun, Ying
    Qi, Jin
    Dong, Zhenjiang
    Sun, Yanfei
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2025, 251
  • [7] 3D SPATIO-TEMPORAL GRAPH CUTS FOR VIDEO OBJECTS SEGMENTATION
    Tian, Zhiqiang
    Xue, Jianru
    Zheng, Nanning
    Lan, Xuguang
    Li, Ce
    2011 18TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2011,
  • [8] SPATIO-TEMPORAL MODELING OF VISUAL ATTENTION FOR STEREOSCOPIC 3D VIDEO
    Iatsun, Iana
    Larabi, Mohamed-Chaker
    Fernandez-Maloigne, Christine
    2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 5397 - 5401
  • [9] A 3D spatio-temporal motion estimation algorithm for video coding
    Lee, Gwo Giun
    Wang, Ming-Jiun
    Lin, He-Yuan
    Su, Drew Wei-Chi
    Lin, Bo-Yun
    2006 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO - ICME 2006, VOLS 1-5, PROCEEDINGS, 2006, : 741 - +
  • [10] A Graph Attention Spatio-temporal Convolutional Network for 3D Human Pose Estimation in Video
    Liu, Junfa
    Rojas, Juan
    Li, Yihui
    Liang, Zhijun
    Guan, Yisheng
    Xi, Ning
    Zhu, Haifei
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 3374 - 3380