HMD-EgoPose: head-mounted display-based egocentric marker-less tool and hand pose estimation for augmented surgical guidance

被引:12
|
作者
Doughty, Mitchell [1 ,2 ]
Ghugre, Nilesh R. [1 ,2 ,3 ]
机构
[1] Univ Toronto, Dept Med Biophys, Toronto, ON, Canada
[2] Sunnybrook Hlth Sci Ctr, Schulich Heart Program, Toronto, ON, Canada
[3] Sunnybrook Res Inst, Phys Sci Platform, Toronto, ON, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Single-shot pose estimation; Marker-less; Deep learning; Head-mounted displays; Augmented reality; REALITY;
D O I
10.1007/s11548-022-02688-y
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Purpose The success or failure of modern computer-assisted surgery procedures hinges on the precise six-degree-of-freedom (6DoF) position and orientation (pose) estimation of tracked instruments and tissue. In this paper, we present HMD-EgoPose, a single-shot learning-based approach to hand and object pose estimation and demonstrate state-of-the-art performance on a benchmark dataset for monocular red-green-blue (RGB) 6DoF marker-less hand and surgical instrument pose tracking. Further, we reveal the capacity of our HMD-EgoPose framework for performant 6DoF pose estimation on a commercially available optical see-through head-mounted display (OST-HMD) through a low-latency streaming approach. Methods Our framework utilized an efficient convolutional neural network (CNN) backbone for multi-scale feature extraction and a set of subnetworks to jointly learn the 6DoF pose representation of the rigid surgical drill instrument and the grasping orientation of the hand of a user. To make our approach accessible to a commercially available OST-HMD, the Microsoft HoloLens 2, we created a pipeline for low-latency video and data communication with a high-performance computing workstation capable of optimized network inference. Results HMD-EgoPose outperformed current state-of-the-art approaches on a benchmark dataset for surgical tool pose estimation, achieving an average tool 3D vertex error of 11.0 mm on real data and furthering the progress towards a clinically viable marker-free tracking strategy. Through our low-latency streaming approach, we achieved a round trip latency of 199.1 ms for pose estimation and augmented visualization of the tracked model when integrated with the OST-HMD. Conclusion Our single-shot learned approach, which optimized 6DoF pose based on the joint interaction between the hand of a user and a rigid surgical drill, was robust to occlusion and complex surfaces and improved on current state-of-the-art approaches to marker-less tool and hand pose estimation. Further, we presented the feasibility of our approach for 6DoF object tracking on a commercially available OST-HMD.
引用
收藏
页码:2253 / 2262
页数:10
相关论文
共 4 条
  • [1] HMD-EgoPose: head-mounted display-based egocentric marker-less tool and hand pose estimation for augmented surgical guidance
    Mitchell Doughty
    Nilesh R. Ghugre
    International Journal of Computer Assisted Radiology and Surgery, 2022, 17 : 2253 - 2262
  • [2] SurgeonAssist-Net: Towards Context-Aware Head-Mounted Display-Based Augmented Reality for Surgical Guidance
    Doughty, Mitchell
    Singh, Karan
    Ghugre, Nilesh R.
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT IV, 2021, 12904 : 667 - 677
  • [3] Improved Deep Learning-Based Efficientpose Algorithm for Egocentric Marker-Less Tool and Hand Pose Estimation in Manual Assembly
    Niu, Zihan
    Xia, Yi
    Zhang, Jun
    Wang, Bing
    Chen, Peng
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT V, 2023, 14090 : 288 - 298
  • [4] Architecture of a Hybrid Video/Optical See-through Head-Mounted Display-Based Augmented Reality Surgical Navigation Platform
    Carbone, Marina
    Cutolo, Fabrizio
    Condino, Sara
    Cercenelli, Laura
    D'Amato, Renzo
    Badiali, Giovanni
    Ferrari, Vincenzo
    INFORMATION, 2022, 13 (02)