HMD-EgoPose: head-mounted display-based egocentric marker-less tool and hand pose estimation for augmented surgical guidance

被引：12

作者：

Doughty, Mitchell ^{[1
,2
]}

Ghugre, Nilesh R. ^{[1
,2
,3
]}

机构：

[1] Univ Toronto, Dept Med Biophys, Toronto, ON, Canada

[2] Sunnybrook Hlth Sci Ctr, Schulich Heart Program, Toronto, ON, Canada

[3] Sunnybrook Res Inst, Phys Sci Platform, Toronto, ON, Canada

来源：

INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY | 2022年 / 17卷 / 12期

基金：

加拿大自然科学与工程研究理事会;

关键词：

Single-shot pose estimation; Marker-less; Deep learning; Head-mounted displays; Augmented reality; REALITY;

D O I：

10.1007/s11548-022-02688-y

中图分类号：

R318 [生物医学工程];

学科分类号：

0831 ;

摘要：

Purpose The success or failure of modern computer-assisted surgery procedures hinges on the precise six-degree-of-freedom (6DoF) position and orientation (pose) estimation of tracked instruments and tissue. In this paper, we present HMD-EgoPose, a single-shot learning-based approach to hand and object pose estimation and demonstrate state-of-the-art performance on a benchmark dataset for monocular red-green-blue (RGB) 6DoF marker-less hand and surgical instrument pose tracking. Further, we reveal the capacity of our HMD-EgoPose framework for performant 6DoF pose estimation on a commercially available optical see-through head-mounted display (OST-HMD) through a low-latency streaming approach. Methods Our framework utilized an efficient convolutional neural network (CNN) backbone for multi-scale feature extraction and a set of subnetworks to jointly learn the 6DoF pose representation of the rigid surgical drill instrument and the grasping orientation of the hand of a user. To make our approach accessible to a commercially available OST-HMD, the Microsoft HoloLens 2, we created a pipeline for low-latency video and data communication with a high-performance computing workstation capable of optimized network inference. Results HMD-EgoPose outperformed current state-of-the-art approaches on a benchmark dataset for surgical tool pose estimation, achieving an average tool 3D vertex error of 11.0 mm on real data and furthering the progress towards a clinically viable marker-free tracking strategy. Through our low-latency streaming approach, we achieved a round trip latency of 199.1 ms for pose estimation and augmented visualization of the tracked model when integrated with the OST-HMD. Conclusion Our single-shot learned approach, which optimized 6DoF pose based on the joint interaction between the hand of a user and a rigid surgical drill, was robust to occlusion and complex surfaces and improved on current state-of-the-art approaches to marker-less tool and hand pose estimation. Further, we presented the feasibility of our approach for 6DoF object tracking on a commercially available OST-HMD.

引用

页码：2253 / 2262

页数：10

共 4 条

[1] HMD-EgoPose: head-mounted display-based egocentric marker-less tool and hand pose estimation for augmented surgical guidance
Mitchell Doughty
Nilesh R. Ghugre
International Journal of Computer Assisted Radiology and Surgery, 2022, 17 : 2253 - 2262
[2] SurgeonAssist-Net: Towards Context-Aware Head-Mounted Display-Based Augmented Reality for Surgical Guidance
Doughty, Mitchell
Singh, Karan
Ghugre, Nilesh R.
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT IV, 2021, 12904 : 667 - 677
[3] Improved Deep Learning-Based Efficientpose Algorithm for Egocentric Marker-Less Tool and Hand Pose Estimation in Manual Assembly
Niu, Zihan
Xia, Yi
Zhang, Jun
Wang, Bing
Chen, Peng
ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT V, 2023, 14090 : 288 - 298
[4] Architecture of a Hybrid Video/Optical See-through Head-Mounted Display-Based Augmented Reality Surgical Navigation Platform
Carbone, Marina
Cutolo, Fabrizio
Condino, Sara
Cercenelli, Laura
D'Amato, Renzo
Badiali, Giovanni
Ferrari, Vincenzo
INFORMATION, 2022, 13 (02)

← 1 →