Utilizing motion segmentation for optimizing the temporal adjacency matrix in 3D human pose estimation

被引：0

作者：

Wang, Yingfeng ^{[1
]}

Li, Muyu ^{[3
]}

Yan, Hong ^{[1
,2
]}

机构：

[1] Hong Kong Sci Pk, Ctr Intelligent Multidimens Data Anal, Hong Kong, Peoples R China

[2] City Univ Hong Kong, Dept Elect Engn, Hong Kong, Peoples R China

[3] Dalian Univ Technol, Inst Intelligent Sci & Technol, Sch Control Sci & Engn, Dalian, Peoples R China

来源：

NEUROCOMPUTING | 2024年 / 600卷

关键词：

3D human pose estimation; Temporal adjacency matrix; Motion segmentation;

D O I：

10.1016/j.neucom.2024.128153

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In monocular 3D human pose estimation, modeling the temporal relation of human joints is crucial for prediction accuracy. Currently, most methods utilize transformer to model the temporal relation among joints. However, existing transformer-based methods have limitations. The temporal adjacency matrix utilized within the self-attention of the temporal transformer inaccurately models the temporal relationships between frames, particularly in cases where distinct motions exhibit significant correlation despite having different physical interpretations and large temporal spans. To address this issue, we construct an artificial temporal adjacency matrix based on input data and introduce a temporal adjacency matrix hybrid module to blend this matrix with the model's inherent temporal adjacency matrix, resulting in a novel composite temporal adjacency matrix. Through extensive experiments on Human3.6M and MPI-INF-3DHP datasets using state-of-the-art methods as benchmarks, our proposed method demonstrates a maximum improvement of up to 5.6% compared to the original approach.

引用

页数：12

共 50 条

[31] Bidirectional temporal feature for 3D human pose and shape estimation from a video
Sun, Libo
Tang, Ting
Qu, Yuke
Qin, Wenhu
COMPUTER ANIMATION AND VIRTUAL WORLDS, 2023, 34 (3-4)
[32] STRFormer: Spatial-Temporal-ReTemporal Transformer for 3D human pose estimation
Liu, Xing
Tang, Hao
IMAGE AND VISION COMPUTING, 2023, 140
[33] Global and Local Spatio-Temporal Encoder for 3D Human Pose Estimation
Wang, Yong
Kang, Hongbo
Wu, Doudou
Yang, Wenming
Zhang, Longbin
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 4039 - 4049
[34] Occluded Joints Recovery in 3D Human Pose Estimation based on Distance Matrix
Guo, Xiang
Dai, Yuchao
2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 1325 - 1330
[35] Rank estimation in 3D multibody motion segmentation
Julia, C.
Sappa, A. D.
Lumbreras, F.
Serrat, J.
Lopez, A.
ELECTRONICS LETTERS, 2008, 44 (04) : 279 - 280
[36] Occlusion Resilient 3D Human Pose Estimation
Roy, Soumava Kumar
Badanin, Ilia
Honari, Sina
Fua, Pascal
2024 INTERNATIONAL CONFERENCE IN 3D VISION, 3DV 2024, 2024, : 1198 - 1207
[37] A survey on monocular 3D human pose estimation
Ji X.
Fang Q.
Dong J.
Shuai Q.
Jiang W.
Zhou X.
Virtual Reality and Intelligent Hardware, 2020, 2 (06): : 471 - 500
[38] Precise 3D Pose Estimation of Human Faces
Pernek, Akos
Hajder, Levente
PROCEEDINGS OF THE 2014 9TH INTERNATIONAL CONFERENCE ON COMPUTER VISION, THEORY AND APPLICATIONS (VISAPP 2014), VOL 3, 2014, : 618 - 625
[39] A survey on deep 3D human pose estimation
Neupane, Rama Bastola
Li, Kan
Boka, Tesfaye Fenta
ARTIFICIAL INTELLIGENCE REVIEW, 2024, 58 (01)
[40] Deep 3D human pose estimation: A review
Wang, Jinbao
Tan, Shujie
Zhen, Xiantong
Xu, Shuo
Zheng, Feng
He, Zhenyu
Shao, Ling
COMPUTER VISION AND IMAGE UNDERSTANDING, 2021, 210

← 1 2 3 4 5 →