A Multi-view RGB-D Approach for Human Pose Estimation in Operating Rooms

被引:29
|
作者
Kadkhodamohammadi, Abdolrahim [1 ]
Gangi, Afshin [1 ,2 ]
de Mathelin, Michel [1 ]
Padoy, Nicolas [1 ]
机构
[1] Univ Strasbourg, CNRS, IHU Strasbourg, ICube, Strasbourg, France
[2] Univ Hosp Strasbourg, Radiol Dept, Strasbourg, France
关键词
PICTORIAL STRUCTURES;
D O I
10.1109/WACV.2017.47
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many approaches have been proposed for human pose estimation in single and multi-view RGB images. However, some environments, such as the operating room, are still very challenging for state-of-the-art RGB methods. In this paper, we propose an approach for multi-view 3D human pose estimation from RGB-D images and demonstrate the benefits of using the additional depth channel for pose refinement beyond its use for the generation of improved features. The proposed method permits the joint detection and estimation of the poses without knowing a priori the number of persons present in the scene. We evaluate this approach on a novel multi-view RGB-D dataset acquired during live surgeries and annotated with ground truth 3D poses.
引用
收藏
页码:363 / 372
页数:10
相关论文
共 50 条
  • [1] Multi-View Inpainting for RGB-D Sequence
    Li, Feiran
    Ricardez, Gustavo Alfonso Garcia
    Takamatsu, Jun
    Ogasawara, Tsukasa
    2018 INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2018, : 464 - 473
  • [2] Pictorial Structures on RGB-D Images for Human Pose Estimation in the Operating Room
    Kadkhodamohammadi, Abdolrahim
    Gangi, Afshin
    de Mathelin, Michel
    Padoy, Nicolas
    MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION - MICCAI 2015, PT I, 2015, 9349 : 363 - 370
  • [3] A Self-Calibration Approach for Multi-View RGB-D Sensing
    Petitti, Antonio
    Vulpi, Fabio
    Marani, Roberto
    Milella, Annalisa
    MULTIMODAL SENSING AND ARTIFICIAL INTELLIGENCE: TECHNOLOGIES AND APPLICATIONS II, 2021, 11785
  • [4] Pseudo View Representation Learning for Monocular RGB-D Human Pose and Shape Estimation
    Zhu, Armando
    Li, Jiefeng
    Lu, Cewu
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 712 - 716
  • [5] Dynamic Depth-Supervised NeRF for Multi-view RGB-D Operating Room Videos
    Gerats, Beerend G. A.
    Wolterink, Jelmer M.
    Broeders, Ivo A. M. J.
    PREDICTIVE INTELLIGENCE IN MEDICINE, PRIME 2023, 2023, 14277 : 218 - 230
  • [6] An RGB-D multi-view perspective for autonomous agricultural robots
    Vulpi, Fabio
    Marani, Roberto
    Petitti, Antonio
    Reina, Giulio
    Milella, Annalisa
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2022, 202
  • [7] HUMAN POSE ESTIMATION USING TWO RGB-D SENSORS
    Xu, Wanxin
    Su, Po-chang
    Cheung, Sen-ching S.
    2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2016, : 1279 - 1283
  • [8] 3D Background Modeling in Multi-view RGB-D Video
    Huang, Yung-Lin
    Wei, Ku-Chu
    Chien, Shao-Yi
    MM'15: PROCEEDINGS OF THE 2015 ACM MULTIMEDIA CONFERENCE, 2015, : 1051 - 1054
  • [9] MVSalNet: Multi-view Augmentation for RGB-D Salient Object Detection
    Zhou, Jiayuan
    Wang, Lijun
    Lu, Huchuan
    Huang, Kaining
    Shi, Xinchu
    Liu, Bocong
    COMPUTER VISION, ECCV 2022, PT XXIX, 2022, 13689 : 270 - 287
  • [10] Model-Based Multi-view Registration for RGB-D Sensors
    Saval-Calvo, Marcelo
    Azorin-Lopez, Jorge
    Fuster-Guillo, Andres
    ADVANCES IN COMPUTATIONAL INTELLIGENCE, PT II, 2013, 7903 : 496 - 503