3D Human Pose Estimation from multi-view thermal vision sensors

被引：5

作者：

Lupion, Marcos ^{[1
]}

Polo-Rodriguez, Aurora ^{[2
]}

Medina-Quero, Javier ^{[3
]}

Sanjuan, Juan F. ^{[1
]}

Ortigosa, Pilar M. ^{[1
]}

机构：

[1] Univ Almeria, Dept Informat, CeIA3, Almeria 04120, Andalucia, Spain

[2] Univ Jaen, Dept Comp Sci, Campus Lagunillas, Jaen 23071, Andalucia, Spain

[3] Univ Granada, Higher Tech Sch Comp Engn & Telecommun, Dept Comp Engn Automat & Robot, E-18071 Granada, Andalucia, Spain

来源：

INFORMATION FUSION | 2024年 / 104卷

关键词：

Thermal vision; 3D human pose estimation; Convolutional neural networks;

D O I：

10.1016/j.inffus.2023.102154

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Human Pose Estimation from images allows the recognition of key daily activity patterns in Smart Environ-ments. Current State-of-the-art (SOTA) 3D pose estimators are built on visible spectrum images, which can lead to privacy concerns in Ambient Assisted Living solutions. Thermal Vision sensors are being deployed in these environments, as they preserve privacy and operate in low brightness conditions. Furthermore, multi-view setups provide the most accurate 3D pose estimation, as the occlusion problem is overcome by having images from different perspectives. Nevertheless, no solutions in the literature use thermal vision sensors following a multi-view scheme. In this work, a multi-view setup consisting of low-cost devices is deployed in the Smart Home of the University of Almeria. Thermal and visible images are paired using homography, and SOTA solutions such as YOLOv3 and Blazepose are used to annotate the bounding box and 2D pose in the thermal images. ThermalYOLO is built by fine-tuning YOLOv3 and outperforms YOLOv3 by 5% in bounding box recognition and by 1% in IoU value. Furthermore, InceptionResNetV2 is found as the most appropriate architecture for 2D pose estimation. Finally, a 3D pose estimator was built comparing input approaches and convolutional architectures. Results show that the most appropriate architecture is having three single-channel thermal images processed by independent convolutional backbones (ResNet50 in this case). After these, the output is fused with the 2D poses. The resulting convolutional neural network shows excellent behaviour when having occlusions,-view SOTA in the visible

引用

页数：15

共 50 条

[1] 3D Human Pose Estimation from Deep Multi-View 2D Pose
Schwarcz, Steven
Pollard, Thomas
2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 2326 - 2331
[2] Multi-view Pictorial Structures for 3D Human Pose Estimation
Amin, Sikandar
Andriluka, Mykhaylo
Rohrbach, Marcus
Schiele, Bernt
PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2013, 2013,
[3] Multi-view 3D Human Pose Estimation in Complex Environment
M. Hofmann
D. M. Gavrila
International Journal of Computer Vision, 2012, 96 : 103 - 124
[4] Generative Multi-View Based 3D Human Pose Estimation
Sabri, Motaz
PROCEEDINGS OF 2021 INTERNATIONAL CONFERENCE ON SUSTAINABLE INFORMATION ENGINEERING AND TECHNOLOGY, SIET 2021, 2021, : 2 - 9
[5] PROGRESSIVE MULTI-VIEW FUSION FOR 3D HUMAN POSE ESTIMATION
Zhang, Lijun
Zhou, Kangkang
Liu, Liangchen
Li, Zhenghao
Zhao, Xunyi
Zhou, Xiang-Dong
Shi, Yu
2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 1600 - 1604
[6] Multi-view 3D Human Pose Estimation in Complex Environment
Hofmann, M.
Gavrila, D. M.
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2012, 96 (01) : 103 - 124
[7] Markerless multi-view 3D human pose estimation: A survey
Nogueira, Ana Filipa Rodrigues
Oliveira, Helder P.
Teixeira, Luis F.
IMAGE AND VISION COMPUTING, 2025, 155
[8] Learning Monocular 3D Human Pose Estimation from Multi-view Images
Rhodin, Helge
Sporri, Jorg
Katircioglu, Isinsu
Constantin, Victor
Meyer, Frederic
Mueller, Erich
Salzmann, Mathieu
Fua, Pascal
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 8437 - 8446
[9] Probabilistic Triangulation for Uncalibrated Multi-View 3D Human Pose Estimation
Jiang, Boyuan
Hu, Lei
Xia, Shihong
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 14804 - 14814
[10] Multi-View 3D Human Pose Tracking Based on Evolutionary Robot Vision
Quan, Wei
Kubota, Naoyuki
JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2021, 25 (04) : 432 - 441

← 1 2 3 4 5 →